This the multi-page printable view of this section. Click here to print.

Return to the regular view of this page.

  • 1:
    • 1.1:
    • 2:
      • 2.1:
        • 2.2:
          • 2.3:
            • 2.4:
            • 3:
              • 3.1:
                • 3.2:
                  • 3.3:
                    • 3.4:
                      • 3.5:
                        • 3.6:
                          • 3.7:
                            • 3.8:
                            • 4:
                              • 4.1:
                                • 4.2:
                                  • 4.3:
                                    • 4.4:
                                      • 4.5:
                                        • 4.6:
                                          • 4.7:
                                          • 5:
                                            • 5.1:
                                              • 5.2:
                                                • 5.2.1:
                                                  • 5.2.2:
                                                    • 5.2.3:
                                                    • 5.3:
                                                      • 5.3.1:
                                                        • 5.3.2:
                                                          • 5.3.2.1:
                                                            • 5.3.2.2:
                                                            • 5.3.3:
                                                              • 5.3.3.1:
                                                                • 5.3.3.2:
                                                                • 5.3.4:
                                                                  • 5.3.5:
                                                                    • 5.3.6:
                                                                      • 5.3.7:
                                                                        • 5.3.8:
                                                                          • 5.3.9:
                                                                            • 5.3.10:
                                                                              • 5.3.11:
                                                                                • 5.3.12:
                                                                                  • 5.3.13:
                                                                                    • 5.3.14:
                                                                                      • 5.3.14.1:
                                                                                        • 5.3.14.2:
                                                                                          • 5.3.14.3:
                                                                                          • 5.3.15:
                                                                                            • 5.3.16:
                                                                                              • 5.3.17:
                                                                                                • 5.3.17.1:
                                                                                                  • 5.3.17.2:
                                                                                                  • 5.3.18:
                                                                                                    • 5.3.19:
                                                                                                      • 5.3.20:
                                                                                                        • 5.3.21:
                                                                                                          • 5.3.22:
                                                                                                            • 5.3.23:
                                                                                                              • 5.3.24:
                                                                                                                • 5.3.25:
                                                                                                                • 5.4:
                                                                                                                  • 5.4.1:
                                                                                                                    • 5.4.2:
                                                                                                                    • 5.5:
                                                                                                                      • 5.6:
                                                                                                                        • 5.6.1:
                                                                                                                          • 5.6.1.1:
                                                                                                                            • 5.6.1.2:
                                                                                                                              • 5.6.1.3:
                                                                                                                                • 5.6.1.4:
                                                                                                                                  • 5.6.1.5:
                                                                                                                                    • 5.6.1.6:
                                                                                                                                      • 5.6.1.7:
                                                                                                                                        • 5.6.1.8:
                                                                                                                                      • 5.7:
                                                                                                                                        • 5.8:
                                                                                                                                          • 5.9:
                                                                                                                                            • 5.10:
                                                                                                                                              • 5.11:
                                                                                                                                                • 5.11.1:
                                                                                                                                                  • 5.11.2:
                                                                                                                                                  • 5.12:
                                                                                                                                                    • 5.13:
                                                                                                                                                      • 5.14:
                                                                                                                                                        • 5.15:
                                                                                                                                                          • 5.16:
                                                                                                                                                          • 6:
                                                                                                                                                            • 6.1:
                                                                                                                                                              • 6.2:
                                                                                                                                                                • 6.3:
                                                                                                                                                                  • 6.4:
                                                                                                                                                                    • 6.4.1:
                                                                                                                                                                      • 6.4.2:
                                                                                                                                                                        • 6.4.3:
                                                                                                                                                                          • 6.4.4:
                                                                                                                                                                          • 6.5:
                                                                                                                                                                            • 6.5.1:
                                                                                                                                                                              • 6.5.2:
                                                                                                                                                                                • 6.5.2.1:
                                                                                                                                                                                  • 6.5.2.2:
                                                                                                                                                                                    • 6.5.2.3:
                                                                                                                                                                                      • 6.5.2.4:
                                                                                                                                                                                        • 6.5.2.5:
                                                                                                                                                                                          • 6.5.2.6:
                                                                                                                                                                                          • 6.5.3:
                                                                                                                                                                                            • 6.5.3.1:
                                                                                                                                                                                              • 6.5.3.2:
                                                                                                                                                                                                • 6.5.3.3:
                                                                                                                                                                                                  • 6.5.3.4:
                                                                                                                                                                                                    • 6.5.3.5:
                                                                                                                                                                                                  • 6.6:
                                                                                                                                                                                                    • 6.6.1:
                                                                                                                                                                                                      • 6.6.2:
                                                                                                                                                                                                        • 6.6.3:
                                                                                                                                                                                                          • 6.6.4:
                                                                                                                                                                                                            • 6.6.4.1:
                                                                                                                                                                                                              • 6.6.4.2:
                                                                                                                                                                                                            • 6.7:
                                                                                                                                                                                                              • 6.7.1:
                                                                                                                                                                                                                • 6.7.2:
                                                                                                                                                                                                                  • 6.7.3:
                                                                                                                                                                                                                    • 6.7.4:
                                                                                                                                                                                                                      • 6.7.5:
                                                                                                                                                                                                                        • 6.7.6:
                                                                                                                                                                                                                          • 6.7.7:
                                                                                                                                                                                                                            • 6.7.8:
                                                                                                                                                                                                                              • 6.7.9:
                                                                                                                                                                                                                                • 6.7.10:
                                                                                                                                                                                                                                  • 6.7.11:
                                                                                                                                                                                                                                • 7:
                                                                                                                                                                                                                                  • 7.1:
                                                                                                                                                                                                                                    • 7.2:
                                                                                                                                                                                                                                      • 7.3:
                                                                                                                                                                                                                                        • 7.4:
                                                                                                                                                                                                                                          • 7.5:
                                                                                                                                                                                                                                            • 7.6:
                                                                                                                                                                                                                                              • 7.7:
                                                                                                                                                                                                                                              • 8:
                                                                                                                                                                                                                                                • 8.1:
                                                                                                                                                                                                                                                  • 8.2:
                                                                                                                                                                                                                                                    • 8.3:
                                                                                                                                                                                                                                                      • 8.4:
                                                                                                                                                                                                                                                        • 8.5:
                                                                                                                                                                                                                                                          • 8.6:
                                                                                                                                                                                                                                                            • 8.7:
                                                                                                                                                                                                                                                              • 8.8:
                                                                                                                                                                                                                                                                • 8.9:
                                                                                                                                                                                                                                                                • 9:
                                                                                                                                                                                                                                                                  • 9.1:
                                                                                                                                                                                                                                                                    • 9.2:
                                                                                                                                                                                                                                                                    • 10:
                                                                                                                                                                                                                                                                      • 10.1:
                                                                                                                                                                                                                                                                        • 10.2:
                                                                                                                                                                                                                                                                          • 10.2.1:
                                                                                                                                                                                                                                                                            • 10.2.2:
                                                                                                                                                                                                                                                                              • 10.2.3:
                                                                                                                                                                                                                                                                                • 10.2.4:
                                                                                                                                                                                                                                                                                  • 10.2.5:
                                                                                                                                                                                                                                                                                    • 10.2.5.1:
                                                                                                                                                                                                                                                                                      • 10.2.5.2:
                                                                                                                                                                                                                                                                                      • 10.2.6:
                                                                                                                                                                                                                                                                                        • 10.2.7:
                                                                                                                                                                                                                                                                                          • 10.2.8:
                                                                                                                                                                                                                                                                                            • 10.2.9:
                                                                                                                                                                                                                                                                                            • 10.3:
                                                                                                                                                                                                                                                                                              • 10.4:
                                                                                                                                                                                                                                                                                                • 10.5:
                                                                                                                                                                                                                                                                                                  • 10.6:
                                                                                                                                                                                                                                                                                                    • 10.6.1:
                                                                                                                                                                                                                                                                                                      • 10.6.2:
                                                                                                                                                                                                                                                                                                        • 10.6.3:
                                                                                                                                                                                                                                                                                                          • 10.6.4:
                                                                                                                                                                                                                                                                                                            • 10.6.5:
                                                                                                                                                                                                                                                                                                              • 10.6.6:
                                                                                                                                                                                                                                                                                                                • 10.6.7:
                                                                                                                                                                                                                                                                                                                  • 10.6.8:
                                                                                                                                                                                                                                                                                                                    • 10.6.9:
                                                                                                                                                                                                                                                                                                                      • 10.6.10:
                                                                                                                                                                                                                                                                                                                        • 10.6.11:
                                                                                                                                                                                                                                                                                                                          • 10.6.12:
                                                                                                                                                                                                                                                                                                                            • 10.6.13:
                                                                                                                                                                                                                                                                                                                              • 10.6.14:
                                                                                                                                                                                                                                                                                                                                • 10.6.15:
                                                                                                                                                                                                                                                                                                                                  • 10.6.16:
                                                                                                                                                                                                                                                                                                                                    • 10.6.17:
                                                                                                                                                                                                                                                                                                                                      • 10.6.18:
                                                                                                                                                                                                                                                                                                                                        • 10.6.19:
                                                                                                                                                                                                                                                                                                                                          • 10.6.20:
                                                                                                                                                                                                                                                                                                                                            • 10.6.21:
                                                                                                                                                                                                                                                                                                                                              • 10.6.22:
                                                                                                                                                                                                                                                                                                                                                • 10.6.23:
                                                                                                                                                                                                                                                                                                                                                  • 10.6.24:
                                                                                                                                                                                                                                                                                                                                                    • 10.6.25:
                                                                                                                                                                                                                                                                                                                                                      • 10.6.26:
                                                                                                                                                                                                                                                                                                                                                        • 10.6.27:
                                                                                                                                                                                                                                                                                                                                                        • 10.7:
                                                                                                                                                                                                                                                                                                                                                          • 10.8:
                                                                                                                                                                                                                                                                                                                                                            • 10.9:

                                                                                                                                                                                                                                                                                                                                                            Sysdig Monitor

                                                                                                                                                                                                                                                                                                                                                            Sysdig Monitor is part of Sysdig’s container intelligence platform. Sysdig uses a unified platform to deliver security, monitoring, and forensics in a container- and microservices-friendly architecture. Sysdig Monitor is a monitoring, troubleshooting, and alerting suite offering deep, process-level visibility into dynamic, distributed production environments. Sysdig Monitor captures, correlates, and visualizes full-stack data, and provides dashboards for monitoring.

                                                                                                                                                                                                                                                                                                                                                            In the background, the Sysdig agent lives on the hosts being monitored and collects the appropriate metrics and events. Out of the box, the agent reports on a wide variety of pre-defined metrics. Additional metrics and custom parameters are available via agent configuration files. For more information, see the Sysdig Agent Documentation.

                                                                                                                                                                                                                                                                                                                                                            Major Benefits

                                                                                                                                                                                                                                                                                                                                                            • Explore and monitor application performance at any level of the infrastructure stack

                                                                                                                                                                                                                                                                                                                                                            • Correlate metrics and events, and compare with past performance

                                                                                                                                                                                                                                                                                                                                                            • Observe platform state and health

                                                                                                                                                                                                                                                                                                                                                            • Auto-detect anomalies

                                                                                                                                                                                                                                                                                                                                                            • Visualize and share performance metrics with out-of-the-box and custom dashboards

                                                                                                                                                                                                                                                                                                                                                            • Powerful, tuned, and flexible alerts

                                                                                                                                                                                                                                                                                                                                                            • Proactively alert on incidents across services, hosts, containers and so on

                                                                                                                                                                                                                                                                                                                                                            • Trigger system captures for offline troubleshooting and forensics

                                                                                                                                                                                                                                                                                                                                                            • Analyze system call activity to accelerate problem resolution

                                                                                                                                                                                                                                                                                                                                                            Key Components

                                                                                                                                                                                                                                                                                                                                                            Monitor Interface

                                                                                                                                                                                                                                                                                                                                                            Log into the Sysdig Monitor interface, and get started with the basics.

                                                                                                                                                                                                                                                                                                                                                            Overview

                                                                                                                                                                                                                                                                                                                                                            It provides a unified view of the health, risk, and capacity of your Kubernetes infrastructure across a multi- and hybrid-cloud environment.

                                                                                                                                                                                                                                                                                                                                                            Explore the Infrastructure

                                                                                                                                                                                                                                                                                                                                                            Dive into Sysdig Monitor with a deeper understanding of the Explore module, data aggregation, and how to break down data.

                                                                                                                                                                                                                                                                                                                                                            This feature is available in the Enterprise tier of the Sysdig product. See https://sysdig.com/pricing for details, or contact sales@sysdig.com.

                                                                                                                                                                                                                                                                                                                                                            Metrics

                                                                                                                                                                                                                                                                                                                                                            The backbone of monitoring: learn more about metrics, integrate external platforms, and explore the complete metrics dictionary.

                                                                                                                                                                                                                                                                                                                                                            Alerts

                                                                                                                                                                                                                                                                                                                                                            Learn how to build alerts to notify users of infrastructure events, changes in behavior, and unauthorized access.

                                                                                                                                                                                                                                                                                                                                                            Dashboards

                                                                                                                                                                                                                                                                                                                                                            Learn how to build a custom dashboard, configure the default ones, or reconfigure panels to best suit your infrastructure.

                                                                                                                                                                                                                                                                                                                                                            Events

                                                                                                                                                                                                                                                                                                                                                            Integrate Docker and Kubernetes events, customize event notifications, and review infrastructure history.

                                                                                                                                                                                                                                                                                                                                                            Captures

                                                                                                                                                                                                                                                                                                                                                            Create capture files containing system calls and other OS events to assist monitoring and troubleshooting the infrastructure.

                                                                                                                                                                                                                                                                                                                                                            1 -

                                                                                                                                                                                                                                                                                                                                                            Getting Started with Sysdig Monitor

                                                                                                                                                                                                                                                                                                                                                            Sysdig Monitor allows you to maximize the visibility of your Kubernetes environments with native Prometheus support. You can troubleshoot issues faster with Sysdig’s eBPF derived metrics, out-of-the-box dashboards, and alerts.

                                                                                                                                                                                                                                                                                                                                                            You can choose Sysdig Monitor for a Free Trial option to quickly connect to a single cloud account with Sysdig and start with Prometheus-compatible Kubernetes and cloud monitoring.

                                                                                                                                                                                                                                                                                                                                                            Once connected, the Get Started page shows a subset of the options available in the 30-day trial or Enterprise.

                                                                                                                                                                                                                                                                                                                                                            Get Started Page (SaaS)

                                                                                                                                                                                                                                                                                                                                                            The Get Started page targets the key steps to ensure users are getting the most value out of Sysdig Monitor. The page is updated with new steps as users complete tasks and Sysdig adds new features to the product.

                                                                                                                                                                                                                                                                                                                                                            The Get Started page also serves as a linking page for

                                                                                                                                                                                                                                                                                                                                                            • Documentation

                                                                                                                                                                                                                                                                                                                                                            • Release Notes

                                                                                                                                                                                                                                                                                                                                                            • The Sysdig Blog

                                                                                                                                                                                                                                                                                                                                                            • Self-Paced Training

                                                                                                                                                                                                                                                                                                                                                            • Support

                                                                                                                                                                                                                                                                                                                                                            Users can access the Get Started page at any time by clicking the rocketship in the side menu.

                                                                                                                                                                                                                                                                                                                                                            Install the Agent

                                                                                                                                                                                                                                                                                                                                                            Installing the agent on your infrastructure allows Sysdig to collect data for monitoring and security purposes. For more information, see Quick Install Sysdig Agent on Kubernetes.

                                                                                                                                                                                                                                                                                                                                                            (Optional) Connect Your Prometheus Servers

                                                                                                                                                                                                                                                                                                                                                            Connecting your Prometheus servers to Sysdig-managed Prometheus Service helps leverage Sysdig for scalable long-term storage of your Prometheus metrics, PromQL dashboards, centralized querying, and PromQL-based alerting. For more information, see Collect Prometheus Metrics.

                                                                                                                                                                                                                                                                                                                                                            Invite Your Team

                                                                                                                                                                                                                                                                                                                                                            Invite someone in your team to use this Sysdig Monitor account. They will be notified with an email. A user will be created for them and will be added to the default team. They are automatically assigned to the Advanced User role.

                                                                                                                                                                                                                                                                                                                                                            Monitor Your Kubernetes Clusters

                                                                                                                                                                                                                                                                                                                                                            Get a unified view of the health, risk, and capacity of your Kubernetes infrastructure in a multi- and hybrid-cloud environment. For more information, see Dashboard Templates.

                                                                                                                                                                                                                                                                                                                                                            Workload Status & Performance

                                                                                                                                                                                                                                                                                                                                                            Get deep insight into your Kubernetes workloads faster with the Workload Status & Performance Dashboard.

                                                                                                                                                                                                                                                                                                                                                            Pod Status & Performance

                                                                                                                                                                                                                                                                                                                                                            Drill down to workload pods and monitor pod-level resource usage and troubleshoot performance issues with the Pod Status & Performance Dashboard.

                                                                                                                                                                                                                                                                                                                                                            Cluster Capacity Planning

                                                                                                                                                                                                                                                                                                                                                            Verify if your cluster is sized properly for existing deployed applications, identify over-commit on resources that can lead to pod evictions, discover unused requested resources or containers without limits defined with the Cluster Capacity Planning Dashboard.

                                                                                                                                                                                                                                                                                                                                                            Cluster/Namespace Available Resources

                                                                                                                                                                                                                                                                                                                                                            Determine if your cluster has the capacity to deploy a new workload and ascertain if increasing CPU or memory requests or placing limits on an existing application is necessary with the Cluster/Namespace Available Resources Dashboard.

                                                                                                                                                                                                                                                                                                                                                            Pod Rightsizing & Workload Capacity Optimization

                                                                                                                                                                                                                                                                                                                                                            Identify resource-hogging workloads while optimizing your capacity with the Pod Rightsizing & Workload Capacity Optimization Dashboard.

                                                                                                                                                                                                                                                                                                                                                            Set Up Alert

                                                                                                                                                                                                                                                                                                                                                            Sysdig Monitor emits alerts to get proactive notification of events, anomalies, or any incident that requires attention. The alerting system provides out-of-the-box push gateways for regular email, Slack, Cloud-provider notification queues, and custom webhooks, among others. See .

                                                                                                                                                                                                                                                                                                                                                            Configure a Notification Channel

                                                                                                                                                                                                                                                                                                                                                            Alerts are used in Sysdig Monitor when Event thresholds have been crossed and can be sent over a variety of supported notification channels. Integrate Sysdig with your notification dispatchers and incident management workflows. See Set Up Notification Channels

                                                                                                                                                                                                                                                                                                                                                            Turn on Alerts

                                                                                                                                                                                                                                                                                                                                                            Turn on recommended alerts from our Alerts Library. Customize our recommendations or create your own alerts from scratch. See Alerts Library.

                                                                                                                                                                                                                                                                                                                                                            Monitor Your Services

                                                                                                                                                                                                                                                                                                                                                            Create a Dashboard

                                                                                                                                                                                                                                                                                                                                                            Create customized dashboards to display the most relevant views and metrics for the infrastructure in a single location. Each dashboard is comprised of a series of panels configured to display specific data in a number of different formats. See Dashboards.

                                                                                                                                                                                                                                                                                                                                                            Get Started with PromQL

                                                                                                                                                                                                                                                                                                                                                            Write PromQL queries easier with form-based querying available with Sysdig Monitor. All metrics are enriched with cloud and Kubernetes metadata avoiding complicated PromQL joins. See Using PromQL.

                                                                                                                                                                                                                                                                                                                                                            Monitoring Integrations

                                                                                                                                                                                                                                                                                                                                                            Sysdig discovers services running in infrastructure and recommends appropriate Monitoring Integrations that allow you to collect service-specific metrics. The integration bundle includes out-of-the-box dashboards and default alerts. See (Limited Availability) Configure Monitoring Integrations.

                                                                                                                                                                                                                                                                                                                                                            Advanced Actions

                                                                                                                                                                                                                                                                                                                                                            Integrate development tools:

                                                                                                                                                                                                                                                                                                                                                            1.1 -

                                                                                                                                                                                                                                                                                                                                                            Basic Onboarding

                                                                                                                                                                                                                                                                                                                                                            This section describes onboarding tips for Sysdig Monitor on-premises versions prior to 3.5.0.

                                                                                                                                                                                                                                                                                                                                                            Access the Sysdig Monitor Interface

                                                                                                                                                                                                                                                                                                                                                            To access the interface, the Sysdig Agent must be installed. For instructions on installing the Sysdig agent, see the Agent Installation documentation.

                                                                                                                                                                                                                                                                                                                                                            • The first user becomes the administrator.

                                                                                                                                                                                                                                                                                                                                                            • Once agent installation is complete, the Sysdig Monitor interface is available at https://app.sysdigcloud.com, if you are in the US East region. For other regions, the format is https://<region>.app.sysdig.com. Replace <region> with the region where your Sysidig application is hosted. For example, for Sysdig Monitor in the EU, you use https://eu1.app.sysdig.com.

                                                                                                                                                                                                                                                                                                                                                              See SaaS Regions and IP Ranges and identify the correct URLs associated with your Sysdig application and region.

                                                                                                                                                                                                                                                                                                                                                            • Subsequent users must also have user credentials defined, either through Sysdig Monitor or through an integrated authentication tool.

                                                                                                                                                                                                                                                                                                                                                              For information on creating users, see User and Team Administration.

                                                                                                                                                                                                                                                                                                                                                            The Sysdig Monitor Interface

                                                                                                                                                                                                                                                                                                                                                            The Explore module is the primary starting point for all troubleshooting with Sysdig Monitor, and the default homepage of the web interface for all users. An overview of the health of the entire infrastructure is displayed in a table, with various infrastructure components broken into a pre-configured hierarchical grouping.

                                                                                                                                                                                                                                                                                                                                                            Groupings are hierarchical aggregations that allow users to break down their infrastructure into logical sections to find and view issues. The default out-of-the-box grouping is host.hostName → container.name.

                                                                                                                                                                                                                                                                                                                                                            For more information on groupings, refer to the Grouping, Scoping, and Segmenting Metrics section.

                                                                                                                                                                                                                                                                                                                                                            The left-side menu provides access to the five primary modules within Sysdig Monitor, in addition to the user menus and Spotlight.

                                                                                                                                                                                                                                                                                                                                                            Module

                                                                                                                                                                                                                                                                                                                                                            Description

                                                                                                                                                                                                                                                                                                                                                            Overview

                                                                                                                                                                                                                                                                                                                                                            The Overview module provides a unified view of the health, risk, and capacity of your Kubernetes infrastructure.

                                                                                                                                                                                                                                                                                                                                                            Explore

                                                                                                                                                                                                                                                                                                                                                            The Explore module provides users with an in-depth overview of the entire infrastructure, and the ability to deep dive into metrics and dashboards to troubleshoot issues.

                                                                                                                                                                                                                                                                                                                                                            Dashboards

                                                                                                                                                                                                                                                                                                                                                            The Dashboards module allows users to create, modify, and share dashboards.

                                                                                                                                                                                                                                                                                                                                                            For more information, refer to the Dashboards documentation.

                                                                                                                                                                                                                                                                                                                                                            Alerts

                                                                                                                                                                                                                                                                                                                                                            The Alerts module allows users to create, modify, and enable/disable the current alerts configured in Sysdig Monitor, to ensure infrastructure issues are identified effectively.

                                                                                                                                                                                                                                                                                                                                                            For more information, refer to the Alerts documentation.

                                                                                                                                                                                                                                                                                                                                                            Events

                                                                                                                                                                                                                                                                                                                                                            The Events module provides a detailed list of any events that triggered an alert within the infrastructure.

                                                                                                                                                                                                                                                                                                                                                            For more information, refer to the Events documentation.

                                                                                                                                                                                                                                                                                                                                                            Captures

                                                                                                                                                                                                                                                                                                                                                            The Captures page provides a complete list of any capture files that have been created within Sysdig Monitor.

                                                                                                                                                                                                                                                                                                                                                            For more information, refer to the Captures documentation.

                                                                                                                                                                                                                                                                                                                                                            Manage Integrations with Sysdig Spotlight

                                                                                                                                                                                                                                                                                                                                                            Sysdig Spotlight provides an at a glance summary of the current state of the infrastructure and helps configure applications to be monitored.

                                                                                                                                                                                                                                                                                                                                                            You manage Sysdig Monitor integrations through Sysdig Spotlight, before being configured separately.

                                                                                                                                                                                                                                                                                                                                                            Add an Integration

                                                                                                                                                                                                                                                                                                                                                            To add new integrations:

                                                                                                                                                                                                                                                                                                                                                            1. Click the Spotlight (target) icon in the management section of the left-hand sidebar to open Sysdig Spotlight.

                                                                                                                                                                                                                                                                                                                                                            2. Click the Manage Your Integrations link.

                                                                                                                                                                                                                                                                                                                                                            3. Choose the integrations to add by selecting the relevant icon, or by searching for the integration, and then selecting the relevant icon. Multiple integrations can be added at once.

                                                                                                                                                                                                                                                                                                                                                            4. Click the Add X Integration/s button to add the integrations to Sysdig Monitor.

                                                                                                                                                                                                                                                                                                                                                            Integrations whose configuration is pending are marked with a warning symbol.

                                                                                                                                                                                                                                                                                                                                                            For configuration and supported applications, refer to the Integrations for Sysdig Monitor documentation.

                                                                                                                                                                                                                                                                                                                                                            Delete an Integration

                                                                                                                                                                                                                                                                                                                                                            To delete existing integrations:

                                                                                                                                                                                                                                                                                                                                                            1. From the Spotlight page, select the integrations to remove.

                                                                                                                                                                                                                                                                                                                                                              Running integrations cannot be removed.

                                                                                                                                                                                                                                                                                                                                                            2. Click the Remove X Integration(s) button.

                                                                                                                                                                                                                                                                                                                                                            3. Click the Remove link to confirm the changes.

                                                                                                                                                                                                                                                                                                                                                            2 -

                                                                                                                                                                                                                                                                                                                                                            Overview

                                                                                                                                                                                                                                                                                                                                                            Overview leverages Sysdig’s unified data platform to monitor, secure, and troubleshoot your hosts and Kubernetes clusters and workloads.

                                                                                                                                                                                                                                                                                                                                                            The module provides a unified view of the health, risk, and capacity of your Kubernetes infrastructure— a single pane of glass for host machines as well as Kubernetes Clusters, Nodes, Namespaces, and Workloads across a multi- and hybrid-cloud environment. You can easily filter by any of these entities and view associated events and health data.

                                                                                                                                                                                                                                                                                                                                                            Overview shows metrics prioritized by event count and severity, allowing you to get to the root cause of the problem faster. Sysdig Monitor polls the infrastructure data every 10 minutes and refreshes the metrics and events on the Overview page with the system health.

                                                                                                                                                                                                                                                                                                                                                            Key Benefits

                                                                                                                                                                                                                                                                                                                                                            Overview provides the following benefits:

                                                                                                                                                                                                                                                                                                                                                            • Show a unified view of the health, risk, resource use, and capacity of your infrastructure environment at scale

                                                                                                                                                                                                                                                                                                                                                              • Render metrics, security events, compliance CIS benchmark results, and contextual events in a single location

                                                                                                                                                                                                                                                                                                                                                              • Eliminate the need for stand-alone security, monitoring, and forensics tools

                                                                                                                                                                                                                                                                                                                                                              • View data on-the-fly by workload or by infrastructure

                                                                                                                                                                                                                                                                                                                                                            • Display contextual live event stream from alerts, Kubernetes, containers, policies, and image scanning results

                                                                                                                                                                                                                                                                                                                                                            • Surface entities intelligently based on event count and severity

                                                                                                                                                                                                                                                                                                                                                            • Drills down from Clusters to Nodes and Namespaces

                                                                                                                                                                                                                                                                                                                                                            • Support Infrastructure monitoring of multi- and hybrid- cloud environments

                                                                                                                                                                                                                                                                                                                                                            • Expose relevant information based on core operational users :

                                                                                                                                                                                                                                                                                                                                                              • DevOps / Platform Ops

                                                                                                                                                                                                                                                                                                                                                              • Security Analyst

                                                                                                                                                                                                                                                                                                                                                              • Service Owner

                                                                                                                                                                                                                                                                                                                                                            Accessing the Overview User Interface

                                                                                                                                                                                                                                                                                                                                                            You can access and set the scope of Overview in the Sysdig Monitor UI or with the URL:

                                                                                                                                                                                                                                                                                                                                                            Click Overview in the left navigation, then select one of the Kubernetes entities:

                                                                                                                                                                                                                                                                                                                                                            About the Overview User Interface

                                                                                                                                                                                                                                                                                                                                                            The Overview interface opens to the Cluster Overview page. This section describes the major components of the interface and the navigation options.

                                                                                                                                                                                                                                                                                                                                                            Overview Rows

                                                                                                                                                                                                                                                                                                                                                            Each row represents a Kubernetes entity: a cluster, node, namespace, or workload. In the screenshot above, each row shows a Kubernetes cluster.

                                                                                                                                                                                                                                                                                                                                                            • Navigating rows is easy

                                                                                                                                                                                                                                                                                                                                                              Click on the Overview icon in the left navigation and choose an Overview page, or drill down into the next Overview page to explore the next granular level of data. Each Overview page shows 10 rows by default and a maximum of 100 rows. Click Load More to display additional rows if there are more than 10 rows per page.

                                                                                                                                                                                                                                                                                                                                                            • Ability to select a specific row in an Overview page

                                                                                                                                                                                                                                                                                                                                                              Each row contains the scope of the relevant entity that it is showing data for. Clicking a specific row leads to deselecting the rest of the rows (for instance, selecting staging deselects all other rows in the screenshot above) to focus on the scope of the selected entity, including the events which are scoped out by that row. Further, the Live badge will change to Paused, implying rows will not be updated regardless of the new data coming in. Pausing to focus on a single row provides a snapshot of what is going on until at the moment with the entity under purview.

                                                                                                                                                                                                                                                                                                                                                            • Entities are listed according to the severity and number of events detected in them, not by how new the events are

                                                                                                                                                                                                                                                                                                                                                              Rows are sorted by the count and severity level of the events associated with the entity and are displayed in descending order. The items with the highest number of high severity events are shown first, followed by medium, low, and info. This organization helps to highlight events demanding immediate attention and to streamline troubleshooting efforts, in environments that may include thousands of entities.

                                                                                                                                                                                                                                                                                                                                                            Scope Editor

                                                                                                                                                                                                                                                                                                                                                            Scope Editor allows targeting down to a specific entity, such as a particular workload or namespace, from environments that may include thousands of entities. The levels of scope, determined by Kubernetes hierarchy, progresses from Workload to Cluster where Cluster being at the top level. In smaller environments, using the Scope Editor is equivalent to clicking a single row in an Overview page where no scope has been applied.

                                                                                                                                                                                                                                                                                                                                                            Cluster: The highest level in the hierarchy. The only scope applied to the page is Cluster. It allows you to select a specific cluster from a list of available ones.

                                                                                                                                                                                                                                                                                                                                                            Node: The second level in the hierarchy. The scope is determined by Cluster and Node. Selection is narrowed down to a specific node in a selected cluster.

                                                                                                                                                                                                                                                                                                                                                            Namespace: The third level in the hierarchy. The scope is determined by Cluster and Namespace. Selection is narrowed down to a specific namespace in a selected cluster.

                                                                                                                                                                                                                                                                                                                                                            Workloads: The last entity in the hierarchy. The scope is initially determined by Cluster and Namespace, then the selection is narrowed to a specific Deployment, Service, or StatefulSet. Choosing all three options are not allowed.

                                                                                                                                                                                                                                                                                                                                                            Time Navigation

                                                                                                                                                                                                                                                                                                                                                            The Overview feature is based around time. Sysdig Monitor polls the infrastructure data every 10 minutes and refreshes the metrics and events on the Overview page with the system health. You select how to view this gathered data by choosing a Preset interval and a time Range.

                                                                                                                                                                                                                                                                                                                                                            Presets

                                                                                                                                                                                                                                                                                                                                                            Presets are a way of visualizing data that Sysdig Monitor gathers every 10 minutes. A preset that is 10 minutes or less is refreshed every 30 seconds. A preset that is greater than 10 minutes is refreshed every 1 minute. Select a preset to determine the data sample to be displayed. Overview supports the following presets:

                                                                                                                                                                                                                                                                                                                                                            • 1 Hour: Data polled for the last one hour. This is the default value.

                                                                                                                                                                                                                                                                                                                                                            • 6 Hour: Data polled for the last six-hour.

                                                                                                                                                                                                                                                                                                                                                            • 1 Day: Data polled for the last day.

                                                                                                                                                                                                                                                                                                                                                            Presets work in conjunction with Range selections. Selecting a particular preset interval refreshes Range selection and reloads the Overview rows and events subsequently. For example:

                                                                                                                                                                                                                                                                                                                                                            10 Minutes: Resets the Range to Jul 9, 2.20 pm - Jul 9, 2.30 pm.

                                                                                                                                                                                                                                                                                                                                                            6 Hour: Resets the Range to Jul 9, 8.30 am - Jul 9, 2.30 pm.

                                                                                                                                                                                                                                                                                                                                                            1 Day: Resets the Range to Jul 8, 2.30 pm - Jul 9, 2.30 pm.

                                                                                                                                                                                                                                                                                                                                                            Because metrics and events are refreshed every 10 minutes on the Overview page, if you stay for more then 10 minutes on the Overview page, the data will be updated to show the newly-computed values.

                                                                                                                                                                                                                                                                                                                                                            Presets are global throughout the Sysdig Monitor interface. For example, if you select 10 minutes in the Explore view, the Overview preset will also be 10 minutes, and vice versa. Choosing an unsupported Preset in Explorer falls back to 1 day in Overview.

                                                                                                                                                                                                                                                                                                                                                            Range

                                                                                                                                                                                                                                                                                                                                                            Range shows both date and time interval as well as the selected Presets in parenthesis. The Range indicated on the UI is determined by Presets. The time given is the closest time interval and by default, it is the current date and time preset by 1 hour. See Presets to understand how Range works with Presets.

                                                                                                                                                                                                                                                                                                                                                            Time Format

                                                                                                                                                                                                                                                                                                                                                            Overview supports UTC and PDT time formats. Use the toggle button next to Range to change the time format for the slot shown in Range. The default is PDT.

                                                                                                                                                                                                                                                                                                                                                            Live

                                                                                                                                                                                                                                                                                                                                                            The Live badge shows if the feed (Overview rows with data) is Live or Paused.

                                                                                                                                                                                                                                                                                                                                                            • Live: the data is continuously updating based on the 10-minute polling of the Sysdig back end. The Overview feed is normally always Live.

                                                                                                                                                                                                                                                                                                                                                            • Paused: When a specific row is selected, the data refresh pauses and the rows will not be updated with new data coming in.

                                                                                                                                                                                                                                                                                                                                                            Unified Stream of Events

                                                                                                                                                                                                                                                                                                                                                            The right panel of Overview provides a context-sensitive events feed.

                                                                                                                                                                                                                                                                                                                                                            Click an overview row to see relevant Events on the right. Each event is intelligently populated with end-to-end metadata to give context and enable troubleshooting.

                                                                                                                                                                                                                                                                                                                                                            Event Types

                                                                                                                                                                                                                                                                                                                                                            Overview renders the following event types:

                                                                                                                                                                                                                                                                                                                                                            • Alert: See Alerts.

                                                                                                                                                                                                                                                                                                                                                            • Custom: Ensure that Custom labels are enabled to view this type of events.

                                                                                                                                                                                                                                                                                                                                                            • Containers: Events associated with containers.

                                                                                                                                                                                                                                                                                                                                                            • Kubernetes: Events associated with Kubernetes infrastructure.

                                                                                                                                                                                                                                                                                                                                                            • Scanning: See Image Scanning.

                                                                                                                                                                                                                                                                                                                                                            • Policy: See Policies.

                                                                                                                                                                                                                                                                                                                                                            Event Statuses

                                                                                                                                                                                                                                                                                                                                                            Overview renders the following alert-generated event statuses:

                                                                                                                                                                                                                                                                                                                                                            • Triggered: The alert condition has been met and still persists.

                                                                                                                                                                                                                                                                                                                                                            • Resolved: A previously existed alert condition no longer persists.

                                                                                                                                                                                                                                                                                                                                                            • Acknowledged: The event has been acknowledged by the intended recipient.

                                                                                                                                                                                                                                                                                                                                                            • Un-acknowledged: The event has not been acknowledged by an intended recipient. All events are by default marked as Un-acknowledged.

                                                                                                                                                                                                                                                                                                                                                            • Silenced: The alert event has been silenced for a specified scope. No alert notification will be sent out to the channels during the silenced window.

                                                                                                                                                                                                                                                                                                                                                            General Guidelines

                                                                                                                                                                                                                                                                                                                                                            First-Time Usage

                                                                                                                                                                                                                                                                                                                                                            • If the environment is created for the first time, Sysdig Monitor fetches data and generates associated pages. The Overview feature is immediately enabled. However, wait for, at the maximum, 1 hour to see the Overview pages with the necessary data.

                                                                                                                                                                                                                                                                                                                                                            • Overview uses time windows in segments of 1H, 6H and 1D, and therefore wait respectively for 1H, 6H and 1D to be able to see data on the Overview pages.

                                                                                                                                                                                                                                                                                                                                                            • If enough data is not available for the first 1 hour, the “No Data Available” page will be presented until the first 1 hour passes.

                                                                                                                                                                                                                                                                                                                                                            Tuning Overview Data

                                                                                                                                                                                                                                                                                                                                                            Sysdig Monitor leverages a caching mechanism to fetch pre-computed data for the Overview screens.

                                                                                                                                                                                                                                                                                                                                                            If pre-computed data is unavailable, data fetched will be non-computed data, which must be calculated before displaying. This additional computational time adds delays. Caching is enabled for Overview but for optimum performance, you must wait for 1H, 6H, and 1D windows the first time you use Overview. After the specified time has passed, the data will be automatically be cached with every passing minute.

                                                                                                                                                                                                                                                                                                                                                            Enabling Overview for On-Prem Deployments

                                                                                                                                                                                                                                                                                                                                                            The Overview feature is not available by default on On-Prem deployments. Use the following API to enable it:

                                                                                                                                                                                                                                                                                                                                                            1. Get the Beta settings as follows:

                                                                                                                                                                                                                                                                                                                                                              curl -X GET 'https://<Sysdig URL>/api/on-prem/settings/overviews' \
                                                                                                                                                                                                                                                                                                                                                              -H 'Authorization: Bearer <GLOBAL_SUPER_ADMIN_SDC_TOKEN>' \
                                                                                                                                                                                                                                                                                                                                                              -H 'X-Sysdig-Product: SDC' -k
                                                                                                                                                                                                                                                                                                                                                              

                                                                                                                                                                                                                                                                                                                                                              Replace <Sysdig URL> with the Sysdig URL associated with your deployment and <GLOBAL_SUPER_ADMIN_SDC_TOKEN> with the SDC token associated with your deployment.

                                                                                                                                                                                                                                                                                                                                                            2. Copy the payload and change the desired values in the settings.

                                                                                                                                                                                                                                                                                                                                                            3. Update the settings as follows:

                                                                                                                                                                                                                                                                                                                                                              curl X PUT 'https://<Sysdig URL>/api/on-prem/settings/overview' \
                                                                                                                                                                                                                                                                                                                                                              -H 'Authorization: Bearer <GLOBAL_SUPER_ADMIN_SDC_TOKEN>' \
                                                                                                                                                                                                                                                                                                                                                              -H 'X-Sysdig-Product: SDC' \
                                                                                                                                                                                                                                                                                                                                                              -d '{  "overviews": true,  "eventScopeExpansion": true}'
                                                                                                                                                                                                                                                                                                                                                              

                                                                                                                                                                                                                                                                                                                                                            Feature Flags

                                                                                                                                                                                                                                                                                                                                                            • overviews: Set overviews to true to enable the backend components and the UI.

                                                                                                                                                                                                                                                                                                                                                            • eventScopeExpansion: Set eventScopeExpansion to true to enable scope expansion for all the Event types.

                                                                                                                                                                                                                                                                                                                                                            2.1 -

                                                                                                                                                                                                                                                                                                                                                            Clusters Data

                                                                                                                                                                                                                                                                                                                                                            This topic discusses the Clusters Overview page and helps you understand its gauge charts and the data displayed on them.

                                                                                                                                                                                                                                                                                                                                                            About Clusters Overview

                                                                                                                                                                                                                                                                                                                                                            In Kubernetes, a pool of nodes combine together their resources to form a more powerful machine, that is a Cluster. The Cluster Overview page provides key metrics indicating the health, risk, capacity, and compliance of each cluster. Your cluster can reside in any cloud or multi-cloud environment of your choice.

                                                                                                                                                                                                                                                                                                                                                            Each row in the Clusters page represents a cluster. Clusters are sorted by the severity of corresponding events in order to highlight the area that needs attention. For example, a cluster with high severity events is bubbled up to the top of the page to highlight the issue. You can further drill down to the Nodes or Namespaces Overview page for investigating at each level.

                                                                                                                                                                                                                                                                                                                                                            In environments where no Sysdig Secure is enabled, Network I/O is shown instead of the Compliance score.

                                                                                                                                                                                                                                                                                                                                                            Interpret the Cluster Data

                                                                                                                                                                                                                                                                                                                                                            This topic gives insight into the metrics displayed on the Clusters Overview screen.

                                                                                                                                                                                                                                                                                                                                                            Node Ready Status

                                                                                                                                                                                                                                                                                                                                                            The chart shows the latest value returned by avg(min(kubernetes.node.ready)).

                                                                                                                                                                                                                                                                                                                                                            What Is It?

                                                                                                                                                                                                                                                                                                                                                            The number shows the readiness for nodes to accept pods across the entire cluster. The numeric availability indicates the percentage of time the nodes are reported as ready by Kubernetes. For example:

                                                                                                                                                                                                                                                                                                                                                            • 100% is displayed when 10 out of 10 nodes are ready for the entire time window, say, for the last one hour.

                                                                                                                                                                                                                                                                                                                                                            • 95% is displayed when 9 out of 10 nodes are ready for the entire time window and one node is ready only for 50% of the time.

                                                                                                                                                                                                                                                                                                                                                            The bar chart displays the trend across the selected time window, and each bar represents a time slice. For example, selecting the last 1-hour window displays 6 bars, each indicating a 10-minute time slice. Each bar represents the availability across the time slice (green) or the unavailability (red).

                                                                                                                                                                                                                                                                                                                                                            For instance, the following image shows an average availability of 80% across the last 1-hour, and each 10-minute time slice shows a constant availability for the same time window:

                                                                                                                                                                                                                                                                                                                                                            What to Expect?

                                                                                                                                                                                                                                                                                                                                                            Expect a constant 100% at all times.

                                                                                                                                                                                                                                                                                                                                                            What to Do Otherwise?

                                                                                                                                                                                                                                                                                                                                                            If the value is less than 100%, determine whether a node is not available at all, or one or more nodes are partially available.

                                                                                                                                                                                                                                                                                                                                                            • Drill down either to the Nodes screen in Overview or to the “Kubernetes Cluster Overview” in Explore to see the list of nodes and their availability.

                                                                                                                                                                                                                                                                                                                                                            • Check the Kubernetes Node Overview dashboard in Explore to identify the problem that Kubernetes reports.

                                                                                                                                                                                                                                                                                                                                                            Pods Available vs Desired

                                                                                                                                                                                                                                                                                                                                                            The chart shows the latest value returned by sum(avg(kubernetes.namespace.pod.available.count)) / sum(avg(kubernetes.namespace.pod.desired.count)).

                                                                                                                                                                                                                                                                                                                                                            What Is It?

                                                                                                                                                                                                                                                                                                                                                            The chart displays the ratio between available and desired pods, averaged across the selected time window, for all the pods in a given Cluster. The upper bound shows the number of desired pods in the Cluster.

                                                                                                                                                                                                                                                                                                                                                            For instance, the following image shows 42 desired pods are available to use:

                                                                                                                                                                                                                                                                                                                                                            What to Expect?

                                                                                                                                                                                                                                                                                                                                                            You should typically expect 100%.

                                                                                                                                                                                                                                                                                                                                                            If certain pods take a long time to be available you might temporarily see a value that is less than 100%. Pulling images, pod initialization, readiness probe, and so on causes such delays.

                                                                                                                                                                                                                                                                                                                                                            What to Do Otherwise?

                                                                                                                                                                                                                                                                                                                                                            Identify one or more Namespaces that have lower availability. To do so, drill down to the Namespaces screen, then drill down to the Workloads screen to identify the unavailable pods.

                                                                                                                                                                                                                                                                                                                                                            If the number of unavailable pods is considerably higher (the ratio is significantly low), check the status of the Nodes. A Node failure will cause several pods to become unavailable across most of the Namespaces.

                                                                                                                                                                                                                                                                                                                                                            Several factors could cause the pods to stuck in the Pending state:

                                                                                                                                                                                                                                                                                                                                                            • Pods make requests for resources that exceed what’s available across the nodes (the remaining allocatable pods).

                                                                                                                                                                                                                                                                                                                                                            • Pods make requests higher than the availability of every single node. For example, you have 8-core Nodes and you create a pod with a 16-core request. These pods might require reconfiguration and specific setup related to Node affinity and anti-affinity constraints.

                                                                                                                                                                                                                                                                                                                                                            • Namespace quota is reached before making a high resource request.

                                                                                                                                                                                                                                                                                                                                                              If a quota is enforced at the Namespace level, you may hit the limit independent of the resource availability across the Nodes.

                                                                                                                                                                                                                                                                                                                                                            CPU Requests vs Allocatable

                                                                                                                                                                                                                                                                                                                                                            The chart shows the latest value returned by sum(avg(kubernetes.pod.resourceRequests.cpuCores)) / sum(avg(kubernetes.node.allocatable.cpuCores)).

                                                                                                                                                                                                                                                                                                                                                            What Is It?

                                                                                                                                                                                                                                                                                                                                                            The chart displays the ratio between CPU requests configured for all the pods in a selected Cluster and allocatable CPUs across all the nodes.

                                                                                                                                                                                                                                                                                                                                                            The upper bound shows the number of allocatable CPU cores across all the nodes in the Cluster.

                                                                                                                                                                                                                                                                                                                                                            For instance, the image below shows that out of 620 available CPU cores across all the nodes (allocatable CPUs), 71% is requested by the pods:

                                                                                                                                                                                                                                                                                                                                                            What to Expect?

                                                                                                                                                                                                                                                                                                                                                            Your resource utilization strategy determines what ratio you can expect. A healthy ratio falls between 50% and 80%.

                                                                                                                                                                                                                                                                                                                                                            Assuming all the nodes have the same amount of allocatable resources, a reasonable upper bound is the value of (node_count - 1) / node_count x 100. For example, the ratio will be 90% if you have 9 nodes. Having this percentage protects you against a node becoming unavailable.

                                                                                                                                                                                                                                                                                                                                                            What to Do Otherwise?

                                                                                                                                                                                                                                                                                                                                                            A lower ratio indicates under-utilized resources (and corresponding cost) in your infrastructure. A higher ratio indicates insufficient resources. As a result

                                                                                                                                                                                                                                                                                                                                                            • Applications cannot be scheduled to be run.

                                                                                                                                                                                                                                                                                                                                                            • Pods might not start and remain in a Pending/Unscheduled state.

                                                                                                                                                                                                                                                                                                                                                            To triage, do the following:

                                                                                                                                                                                                                                                                                                                                                            • Drill down to the Nodes screen to get insights into how resources are utilized across all nodes.

                                                                                                                                                                                                                                                                                                                                                            • Drill down to the Namespaces screen to understand how resources are requested across Namespaces.

                                                                                                                                                                                                                                                                                                                                                            • Drill down to Explore and refer to the following dashboards:

                                                                                                                                                                                                                                                                                                                                                              • Kubernetes CPU Allocation Optimization: Evaluate whether a significant amount of resources are under-utilized in the infrastructure.

                                                                                                                                                                                                                                                                                                                                                              • Kubernetes Workloads CPU Usage and Allocation: Determine whether pods are properly configured and are using resources as expected.

                                                                                                                                                                                                                                                                                                                                                            Can the Value Be Higher than 100%?

                                                                                                                                                                                                                                                                                                                                                            Currently, the ratio accounts only for scheduled pods, while pending pods are excluded from the calculation. This means pods have been scheduled to run on Nodes out of the allocatable pods. Consequently, the ratio cannot be higher than 100%.

                                                                                                                                                                                                                                                                                                                                                            In the case of over-commitment (pods requesting for more resources than what’s available), you can expect a higher Requests vs Allocatable ratio and a lower Pods Available vs Desired ratio. What it indicates is that most of the available resources are being used, and what’s left is not enough to schedule additional pods. Therefore, the Available vs Desired ratio for pods will decrease.

                                                                                                                                                                                                                                                                                                                                                            When your environment has pods that are updated often or that are deleted and created often (for example, testing Clusters), the total requests might appear higher than what it is at any given time. Consequently, the ratio becomes higher across the selected time window, and you might see a value that is higher than 100%. This error is rendered due to how the data engine calculates the aggregated ratio.

                                                                                                                                                                                                                                                                                                                                                            Drill down to Kubernetes Cluster Overview to see the CPU Cores Usage vs Requests vs Allocatable time series to correctly evaluate the trend of the request commitments.

                                                                                                                                                                                                                                                                                                                                                            Listed below are some of the factors that could cause the pods to stuck in a Pending state:

                                                                                                                                                                                                                                                                                                                                                            • Pods make requests that exceed what’s available across the nodes (the remaining allocatable pods). The Requests vs Allocatable ratio is an indicator of this issue.

                                                                                                                                                                                                                                                                                                                                                            • Pods make requests that are higher than the availability of every single Node. For example, you have 8-core Nodes and you create a pod with a 16-core request. These pods might require reconfiguration and specific setup related to Node affinity and anti-affinity constraints.

                                                                                                                                                                                                                                                                                                                                                            • The Quota set at the Namespace level is reached before a request is configured. The Requests vs Allocatable ratio may not suggest the problem, but the Pods Available vs Desired ratio would decrease, especially for the specific Namespaces. See the Namespaces screen in Overview.

                                                                                                                                                                                                                                                                                                                                                            Memory Requests vs Allocatable

                                                                                                                                                                                                                                                                                                                                                            The chart shows the latest value returned by sum(avg(kubernetes.pod.resourceRequests.memBytes)) / sum(avg(kubernetes.node.allocatable.memBytes)).

                                                                                                                                                                                                                                                                                                                                                            What Is It?

                                                                                                                                                                                                                                                                                                                                                            The chart displays the ratio between memory requests configured for all the pods in the Cluster and allocatable memory available across all the Nodes.

                                                                                                                                                                                                                                                                                                                                                            The upper bound shows the allocatable memory available across all Nodes. The value is expressed in bytes, displayed in a specified unit.

                                                                                                                                                                                                                                                                                                                                                            For instance, the image below shows that out of 29.7 GiB available across all Nodes (allocatable memory), 35% is requested by the pods:

                                                                                                                                                                                                                                                                                                                                                            What to Expect?

                                                                                                                                                                                                                                                                                                                                                            Your resource utilization strategy determines what ratio you can expect. A healthy ratio falls between 50% and 80%.

                                                                                                                                                                                                                                                                                                                                                            Assuming all the nodes have the same amount of allocatable resources, a reasonable upper bound is the value of (node_count - 1) / node_count x 100. For example, 90% if you have 9 nodes. This ratio protects your system against a node becoming unavailable.

                                                                                                                                                                                                                                                                                                                                                            What to do Otherwise

                                                                                                                                                                                                                                                                                                                                                            A lower ratio indicates under-utilized resources (and corresponding cost) in your infrastructure. A higher ratio indicates insufficient resources. As a result

                                                                                                                                                                                                                                                                                                                                                            • Applications cannot be scheduled to be run.

                                                                                                                                                                                                                                                                                                                                                            • Pods might not start and remain in a Pending/Unscheduled state.

                                                                                                                                                                                                                                                                                                                                                            To troubleshoot, do the following:

                                                                                                                                                                                                                                                                                                                                                            • Drill down to the Nodes screen to get insights into how resources are utilized across all the Nodes.

                                                                                                                                                                                                                                                                                                                                                            • Drill down to the Namespaces screen to understand how resources are requested across Namespaces.

                                                                                                                                                                                                                                                                                                                                                            • Drill down to Explore and refer to the following dashboards:

                                                                                                                                                                                                                                                                                                                                                              • Kubernetes Memory Allocation Optimization: Evaluate whether a significant amount of resources are under-utilized in the infrastructure.

                                                                                                                                                                                                                                                                                                                                                              • Kubernetes Workloads Memory Usage and Allocation: Determine whether pods are properly configured and are using resources as expected.

                                                                                                                                                                                                                                                                                                                                                            Can the Value be Higher than 100%?

                                                                                                                                                                                                                                                                                                                                                            The ratio currently accounts only for scheduled pods, while pending pods are excluded from the calculation. What this implies is that pods have been scheduled to run on Nodes out of the allocatable resources available. Consequently, the ratio cannot be higher than 100%.

                                                                                                                                                                                                                                                                                                                                                            In the case of over-commitment (pods requesting for more resources than what’s available), expect a higher Requests vs Allocatable ratio and a lower Pods Available vs Desired ratio. What it indicates is that most of the available resources have been used and what’s left is not enough to schedule additional pods. Therefore, the Pods Available vs Desired ratio will decrease.

                                                                                                                                                                                                                                                                                                                                                            When your environment has pods that are updated often or that are deleted and created often (for example, testing Clusters), the total requests might appear higher than what it is at any given time. Consequently, the ratio becomes higher across the selected time window, and you might see a value that is higher than 100%. This error is rendered due to how the data engine calculates the aggregated ratio.

                                                                                                                                                                                                                                                                                                                                                            Drill down to Kubernetes Cluster Overview to see the Memory Requests vs Allocatable time series to correctly evaluate the trend for the request commitments.

                                                                                                                                                                                                                                                                                                                                                            Listed are some of the factors that could cause your pods to stuck in a Pending state:

                                                                                                                                                                                                                                                                                                                                                            • Pods make requests that exceed what’s available across the nodes (the remaining allocatable pods). The Requests vs Allocatable ratio is an indicator of this issue.

                                                                                                                                                                                                                                                                                                                                                            • Pods make requests that are higher than the availability of every single Node. For example, you have 8-core nodes and you create a pod with a 16-core request. These pods might require configuration changes and specific setup related to node affinity and anti-affinity factors.

                                                                                                                                                                                                                                                                                                                                                            • The Quota set at the Namespace-level is reached before a high request is configured. The Requests vs Allocatable ratio might not suggest the problem, but the Pods Available vs Desired ratio would decrease, especially for the specific Namespaces. See the Namespaces screen in Overview.

                                                                                                                                                                                                                                                                                                                                                            Compliance Score

                                                                                                                                                                                                                                                                                                                                                            Docker: The latest value returned by avg(avg(compliance.k8s-bench.pass_pct)).

                                                                                                                                                                                                                                                                                                                                                            Kubernetes: The latest value returned by avg(avg(compliance.docker-bench.pass_pct)).

                                                                                                                                                                                                                                                                                                                                                            What Is it?

                                                                                                                                                                                                                                                                                                                                                            The numbers show the percentage of benchmarks that succeeded in the selected time window, respectively for Docker and Kubernetes entities.

                                                                                                                                                                                                                                                                                                                                                            What to Expect

                                                                                                                                                                                                                                                                                                                                                            If you do not have Sysdig Secure enabled, or you do not have benchmarks scheduled, then you should expect no data available.

                                                                                                                                                                                                                                                                                                                                                            Otherwise, the higher the score, the more compliant your infrastructure is.

                                                                                                                                                                                                                                                                                                                                                            What to Do Otherwise?

                                                                                                                                                                                                                                                                                                                                                            If the score is lower than expected, drill down to Docker Compliance Report or Kubernetes Compliance Report to see further details about benchmark checks and their results.

                                                                                                                                                                                                                                                                                                                                                            You may also want to use the Benchmarks / Results page in Sysdig Secure to see the history of checks.

                                                                                                                                                                                                                                                                                                                                                            2.2 -

                                                                                                                                                                                                                                                                                                                                                            Nodes Data

                                                                                                                                                                                                                                                                                                                                                            This topic discusses the Nodes Overview page and helps you understand its gauge charts and the data displayed on them.

                                                                                                                                                                                                                                                                                                                                                            About Nodes Overview

                                                                                                                                                                                                                                                                                                                                                            A node refers to a worker machine in Kubernetes. A physical machine or VM can represent a node. The Nodes Overview page provides key metrics indicating the health, capacity, and compliance of each node in your cluster.

                                                                                                                                                                                                                                                                                                                                                            In environments where no Sysdig Secure is enabled, Network I/O is shown instead of the Compliance score.

                                                                                                                                                                                                                                                                                                                                                            Interpret the Nodes Data

                                                                                                                                                                                                                                                                                                                                                            This topic gives insight into the metrics displayed on the Nodes Overview page.

                                                                                                                                                                                                                                                                                                                                                            Node Ready Status

                                                                                                                                                                                                                                                                                                                                                            The chart shows the latest value returned by avg(min(kubernetes.node.ready)).

                                                                                                                                                                                                                                                                                                                                                            What Is It?

                                                                                                                                                                                                                                                                                                                                                            The number expresses the Node readiness to accept pods across the Cluster. The numeric availability indicates the percentage of time the Node is reported ready by Kubernetes. For example:

                                                                                                                                                                                                                                                                                                                                                            • 100% is displayed when a Node is ready for the entire time window, say, for the last one hour.

                                                                                                                                                                                                                                                                                                                                                            • 95% when the Node is ready for 95% of the time window, say, 57 out of 60 minutes.

                                                                                                                                                                                                                                                                                                                                                            The bar chart displays the trend across the selected time window, and each bar represents a time slice. For example, selecting “last 1 hour” displays 6 bars, each indicating a 10-minute time slice. Each bar shows the availability across the time slice (green) and the unavailability (red).

                                                                                                                                                                                                                                                                                                                                                            For instance, the image below indicates the Node has not been ready for the entire last 1-hour time window:

                                                                                                                                                                                                                                                                                                                                                            What to Expect?

                                                                                                                                                                                                                                                                                                                                                            The chart should show a constant 100% at all times.

                                                                                                                                                                                                                                                                                                                                                            What to Do Otherwise?

                                                                                                                                                                                                                                                                                                                                                            If the number is less than 100%, review the status reported by Kubernetes. Drill-down to the Kubernetes Node Overview Dashboard in Explore to see details about the Node readiness:

                                                                                                                                                                                                                                                                                                                                                            If the Node Ready Status has an alternating behavior, as shown in the image, the node is flapping. Flapping indicates that the kubelet is not healthy. See specific conditions reported by Kubernetes that would help determine the causes for the Node not being ready. Such conditions include network issues and memory pressure.

                                                                                                                                                                                                                                                                                                                                                            Pods Ready vs Allocatable

                                                                                                                                                                                                                                                                                                                                                            The chart reports the latest value of sum(avg(kubernetes.pod.status.ready)) / avg(avg(kubernetes.node.allocatable.pods)).

                                                                                                                                                                                                                                                                                                                                                            What Is It?

                                                                                                                                                                                                                                                                                                                                                            It is the ratio between available and allocatable pods configured on the node, averaged across the selected time window.

                                                                                                                                                                                                                                                                                                                                                            The Clusters page includes a similar chart named Pods Available vs Desired. However, the meaning is different:

                                                                                                                                                                                                                                                                                                                                                            • The Pods Available vs Desired chart for Clusters highlights how many pods you expect and how many are actually available. See IsPodAvailable for a detailed definition.

                                                                                                                                                                                                                                                                                                                                                            • The Pods Ready vs Allocatable chart for Nodes indicates how many pods can be scheduled on each Node and how many are actually ready.

                                                                                                                                                                                                                                                                                                                                                            The upper bound shows the number of pods you can allocate in the node. See node configuration.

                                                                                                                                                                                                                                                                                                                                                            For instance, the image below indicates that you can allocate 110 pods in the Node (default configuration), but only 11 pods are ready:

                                                                                                                                                                                                                                                                                                                                                            What to Expect?

                                                                                                                                                                                                                                                                                                                                                            The ratio does not relate to resource utilization, but it measures the pod density on each node. The more pods you have on a single node, the more effort the kubelet has to put in order to manage the pods, the routing mechanism, and Kubernetes overall.

                                                                                                                                                                                                                                                                                                                                                            Given the allocatable is properly set, values lower than 80% indicate a healthy status.

                                                                                                                                                                                                                                                                                                                                                            What to Do Otherwise?

                                                                                                                                                                                                                                                                                                                                                            • Reviewing the default maximum pods configuration of the kubelet to allow more pods, especially if the CPU and memory utilization is healthy.

                                                                                                                                                                                                                                                                                                                                                            • Adding more nodes to allow for more pods to be scheduled.

                                                                                                                                                                                                                                                                                                                                                            • Reviewing kubelet process performance and Node resource utilization in general. A higher ratio indicates high pressure on the operating system and for Kubernetes itself.

                                                                                                                                                                                                                                                                                                                                                            CPU Requests vs Allocatable

                                                                                                                                                                                                                                                                                                                                                            The chart shows the latest value returned by sum(avg(kubernetes.pod.resourceRequests.cpuCores)) / sum(avg(kubernetes.node.allocatable.cpuCores)).

                                                                                                                                                                                                                                                                                                                                                            What Is It?

                                                                                                                                                                                                                                                                                                                                                            The chart shows the ratio between the number of CPU cores requested by the pods scheduled on the Node and the number of cores available to pods. The upper bound shows the CPU cores available to pods, which corresponds to the user-defined configuration for allocatable CPU.

                                                                                                                                                                                                                                                                                                                                                            For instance, the image below shows that the Node has 16 CPU cores available, out of which, 84% are requested by the pods scheduled on the Node:

                                                                                                                                                                                                                                                                                                                                                            What to Expect?

                                                                                                                                                                                                                                                                                                                                                            Expect a value up to 80%.

                                                                                                                                                                                                                                                                                                                                                            Assuming all the nodes have the same amount of allocatable resources, a reasonable upper bound is the value of (node_count - 1) / node_count x 100. For example, 90% if you have 9 nodes. Having a high ratio protects your system against a Node becoming unavailable.

                                                                                                                                                                                                                                                                                                                                                            What to Do Otherwise?

                                                                                                                                                                                                                                                                                                                                                            • A low ratio indicates the Node is underutilized. Drill up to the corresponding cluster in the Clusters page to determine whether the number of pods currently running is lower, or if the pods cannot run for other reasons.

                                                                                                                                                                                                                                                                                                                                                            • A high ratio indicates a potential risk of being unable to schedule additional pods on the Node.

                                                                                                                                                                                                                                                                                                                                                              Drill down to the  Kubernetes Node Overview Dashboard to evaluate what Namespaces, Workloads, and pods are running. Additionally, drill up in the Clusters page to evaluate whether you are over-committing the CPU resource. You might not have enough resources to fulfill requests, and consequently, pods might not be able to run on the Node. Consider adding Nodes or replacing Nodes with additional CPU cores.

                                                                                                                                                                                                                                                                                                                                                            Can the Value Be Higher than 100%?

                                                                                                                                                                                                                                                                                                                                                            Kubernetes schedules pods on Nodes where sufficient allocatable resources are available to fulfill the pod request. This means Kubernetes does not allow having a total request higher than the allocatable. Consequently, the ratio cannot be higher than 100%.

                                                                                                                                                                                                                                                                                                                                                            Over-committing (pods requesting resources higher than the capacity) results in a high Requests vs Allocatable ratio and a low Pods Available vs Desired ratio at the Cluster level. What it indicates is that most of the available resources are being used, consequently, what’s available is not sufficient to schedule additional pods. Therefore, Pods Available vs Desired ratio will also decrease.

                                                                                                                                                                                                                                                                                                                                                            Memory Requests vs Allocatable

                                                                                                                                                                                                                                                                                                                                                            The chart highlights the latest value returned by sum(avg(kubernetes.pod.resourceRequests.memBytes)) / sum(avg(kubernetes.node.allocatable.memBytes)).

                                                                                                                                                                                                                                                                                                                                                            What Is It?

                                                                                                                                                                                                                                                                                                                                                            The ratio between the number of bytes of memory is requested by the pods scheduled on the node and the number of bytes of memory available.The upper bound shows the memory available to pods, which corresponds to the user-defined allocatable memory configuration.

                                                                                                                                                                                                                                                                                                                                                            For instance, the image below indicates the node has 62.8 GiB of memory available, out of which, 37% is requested by the pods scheduled on the Node:

                                                                                                                                                                                                                                                                                                                                                            What to Expect?

                                                                                                                                                                                                                                                                                                                                                            A healthy ratio falls under 80%.

                                                                                                                                                                                                                                                                                                                                                            Assuming all the nodes have the same amount of allocatable resources, a reasonable upper bound is the value of (node_count - 1) / node_count x 100. For example, the ratio is 90% if you have 9 nodes. Having a high ratio protects your system against a node becoming unavailable.

                                                                                                                                                                                                                                                                                                                                                            What to Do Otherwise?

                                                                                                                                                                                                                                                                                                                                                            • A low ratio indicates that the Node is underutilized. Drill up to the corresponding cluster in the Clusters page to determine whether the number of pods running is low, or if pods cannot run for other reasons.

                                                                                                                                                                                                                                                                                                                                                            • A high ratio indicates a potential risk of being unable to schedule additional pods on the node.

                                                                                                                                                                                                                                                                                                                                                              • Drill down to the  Kubernetes Node Overview dashboard to evaluate what Namespaces, Workloads, and pods are running.

                                                                                                                                                                                                                                                                                                                                                              • Additionally, drill up in the Clusters page to evaluate whether you are over-committing the memory resource. Consequently, you don’t have enough resources to fulfill requests, and pods might not be able to run. Consider adding nodes or replacing nodes with more memory.

                                                                                                                                                                                                                                                                                                                                                            Can the Value be Higher than 100%?

                                                                                                                                                                                                                                                                                                                                                            Kubernetes schedules pods on nodes where sufficient allocatable resources are available to fulfill the pod request. This means Kubernetes does not allow having a total request higher than the allocatable. Consequently, the ratio cannot be higher than 100%.

                                                                                                                                                                                                                                                                                                                                                            Over-committing (pods requesting for more resources than that are available) results in a high Requests vs Allocatable ratio at the Nodes level and a low Pods Available vs Desired ratio at the Cluster level. What it indicates is that most of the resources are being used, consequently, what’s available is not sufficient to schedule additional pods. Therefore, Pods Available vs Desired ratio will also decrease.

                                                                                                                                                                                                                                                                                                                                                            Network I/O

                                                                                                                                                                                                                                                                                                                                                            The chart shows the latest value returned by avg(avg(net.bytes.total)).

                                                                                                                                                                                                                                                                                                                                                            What Is It?

                                                                                                                                                                                                                                                                                                                                                            The sparkline shows the trend of network traffic (inbound and outbound) for a Node. The number indicates the most recent rate of restarts per second.

                                                                                                                                                                                                                                                                                                                                                            For reference, the sparklines show the following number of steps (sampling):

                                                                                                                                                                                                                                                                                                                                                            • Last hour: 6 steps, each for a 10-minute time slice

                                                                                                                                                                                                                                                                                                                                                            • Last 6 hours: 12 steps, each for a 20-minute time slice

                                                                                                                                                                                                                                                                                                                                                            • Last day: 12 steps, each for a 2-hour time slice

                                                                                                                                                                                                                                                                                                                                                            What to Expect?

                                                                                                                                                                                                                                                                                                                                                            The metric highly depends on what type of applications run on the Node. You should expect some network activity for Kubernetes related operations.

                                                                                                                                                                                                                                                                                                                                                            Drilling down to the Kubernetes Node Overview Dashboard in Explore will provide additional details, such as network activity across pods.

                                                                                                                                                                                                                                                                                                                                                            2.3 -

                                                                                                                                                                                                                                                                                                                                                            Namespaces Data

                                                                                                                                                                                                                                                                                                                                                            This topic discusses the Namespaces Overview page and helps you understand its gauge charts and the data displayed on them.

                                                                                                                                                                                                                                                                                                                                                            About Namespaces Overview

                                                                                                                                                                                                                                                                                                                                                            Namespaces are virtual clusters on a physical cluster. They provide logical separation between the teams and their environments. The Namespaces Overview page provides key metrics indicating the health, capacity, and performance of each Namespace in your cluster.

                                                                                                                                                                                                                                                                                                                                                            Interpret the Namespaces Data

                                                                                                                                                                                                                                                                                                                                                            This topic gives insight into the metrics displayed on the Namespaces Overview screen.

                                                                                                                                                                                                                                                                                                                                                            Pod Restarts

                                                                                                                                                                                                                                                                                                                                                            The chart highlights the latest value returned by avg(timeAvg(kubernetes.pod.restart.rate)).

                                                                                                                                                                                                                                                                                                                                                            What Is It?

                                                                                                                                                                                                                                                                                                                                                            The sparkline shows the trend of pod restarts rate across all the pods in a selected Namespace. The number shows the most recent rate of restarts per second.

                                                                                                                                                                                                                                                                                                                                                            For instance, the image shows a rate of 0.04 restarts per second for the last 2-hours, given the selected time window is one day. The trend also suggests a non-flat pattern (periodic crashes).

                                                                                                                                                                                                                                                                                                                                                            • Last hour: 6 steps, each for a 10-minute time slice

                                                                                                                                                                                                                                                                                                                                                            • Last 6 hours: 12 steps, each for a 20-minute time slice

                                                                                                                                                                                                                                                                                                                                                            • Last day: 12 steps, each for a 2-hour time slice

                                                                                                                                                                                                                                                                                                                                                            What to Expect?

                                                                                                                                                                                                                                                                                                                                                            Expect 0 restarts for any pod.

                                                                                                                                                                                                                                                                                                                                                            What to Do Otherwise?

                                                                                                                                                                                                                                                                                                                                                            A few restarts across the last one hour or larger time windows might not indicate a serious problem. In the event restart loop, identify the root cause as follows:

                                                                                                                                                                                                                                                                                                                                                            • Drill down to the Workloads page in Overview to identify the Workloads that have been stuck at a restart loop.

                                                                                                                                                                                                                                                                                                                                                            • Drill down to the Kubernetes Namespace Overview to see a detailed trend broken down by pods:

                                                                                                                                                                                                                                                                                                                                                            Pods Available vs Desired

                                                                                                                                                                                                                                                                                                                                                            The chart shows the latest value returned by sum(avg(kubernetes.namespace.pod.available.count)) / sum(avg(kubernetes.namespace.pod.desired.count)).

                                                                                                                                                                                                                                                                                                                                                            What Is It?

                                                                                                                                                                                                                                                                                                                                                            The chart displays the ratio between available and desired pods, averaged across the selected time window, in a given Namespace.

                                                                                                                                                                                                                                                                                                                                                            The upper bound shows the number of desired pods in the namespace.

                                                                                                                                                                                                                                                                                                                                                            For instance, the image below shows 42 desired pods that are available:

                                                                                                                                                                                                                                                                                                                                                            What to Expect?

                                                                                                                                                                                                                                                                                                                                                            Expect 100% on the chart.

                                                                                                                                                                                                                                                                                                                                                            If certain pods take a significant amount of time to become available due to delays (image pull time, pod initialization, readiness probe) you might temporarily see a ratio lower than 100%.

                                                                                                                                                                                                                                                                                                                                                            What to Do Otherwise?

                                                                                                                                                                                                                                                                                                                                                            • Identify one or more Workloads that have low availability by drilling down to the Workloads page.

                                                                                                                                                                                                                                                                                                                                                            • Once you identify the Workload, drill down to the related dashboard in Explore. For example, Kubernetes Deployment Overview to determine the trend and the state of the pods.

                                                                                                                                                                                                                                                                                                                                                              For instance, in the following image, the ratio is 98% (3.93 / 4 x 100). The decline is due to an update that caused pods to be terminated and consequently to be started with a newer version.

                                                                                                                                                                                                                                                                                                                                                            CPU Used vs Requests

                                                                                                                                                                                                                                                                                                                                                            The chart shows the latest value returned by sum(avg(cpu.cores.used)) / sum(avg(kubernetes.pod.resourceRequests.cpuCores)).

                                                                                                                                                                                                                                                                                                                                                            What Is It?

                                                                                                                                                                                                                                                                                                                                                            The chart shows the ratio between the total CPU usage across all the pods in the Namespace and the total CPU requested by all the pods.

                                                                                                                                                                                                                                                                                                                                                            The upper bound shows the total CPU requested by all the pods. The value is expressed as the number of CPU cores.

                                                                                                                                                                                                                                                                                                                                                            For instance, the image below shows the pods in a Namespace requests for 40 CPU cores, of which only 43% is being used (about 17 cores):

                                                                                                                                                                                                                                                                                                                                                            What to Expect?

                                                                                                                                                                                                                                                                                                                                                            The value you see depends on the type of Workloads running in the Namespace.

                                                                                                                                                                                                                                                                                                                                                            Typically, values that fall between 80% and 120% is considered healthy. Values higher than 100% is considered healthy relatively for a short amount of time.

                                                                                                                                                                                                                                                                                                                                                            For applications whose resource usage is constant (such as background processes), expect the ratio to be close to 100%.

                                                                                                                                                                                                                                                                                                                                                            For “bursty” applications, such as an API server, expect the ratio to be less than 100%. Note that this value is averaged for the selected time window, therefore, a usage spike would be compensated by an idle period.

                                                                                                                                                                                                                                                                                                                                                            What to Do Otherwise?

                                                                                                                                                                                                                                                                                                                                                            A low usage indicates that the application is not properly running (not executing the expected functions) or the Workload configuration is not accurate (requests are too high compared to what the pods actually need).

                                                                                                                                                                                                                                                                                                                                                            A high usage indicates that the application is operating with a heavy load or the workload configuration is not accurate (requests are too low compared to what pods actually need).

                                                                                                                                                                                                                                                                                                                                                            In either case, drill down to the Workloads page to determine the workload that requires a deeper analysis.

                                                                                                                                                                                                                                                                                                                                                            Can the Value Be Higher than 100%?

                                                                                                                                                                                                                                                                                                                                                            Yes, it can.

                                                                                                                                                                                                                                                                                                                                                            • You can configure requests without limits, or requests lower than the limits. In either case, you are allowing the containers to use more resources than requested, typically to handle temporary overloads.

                                                                                                                                                                                                                                                                                                                                                            • Consider a Namespace with two Workloads with one pod each. Say, one Workload is configured to request for 1 CPU core and uses 1 CPU core (ratio of Used vs Request is 100%). The other Workload is configured without any request and uses 1 CPU core. In this example, 2 CPU cores used to 1 CPU core requested ratio at the Namespace level is 200%.

                                                                                                                                                                                                                                                                                                                                                            Memory Used vs Requests

                                                                                                                                                                                                                                                                                                                                                            The chart shows the latest value returned by sum(avg(memory.bytes.used)) / sum(avg(kubernetes.pod.resourceRequests.memBytes)).

                                                                                                                                                                                                                                                                                                                                                            What Is It?

                                                                                                                                                                                                                                                                                                                                                            The chart shows the ratio between the total memory usage across all pods of the Namespace and the total memory requested by all pods.

                                                                                                                                                                                                                                                                                                                                                            The upper bound shows the total memory requested by all the pods, expressed in a specified unit for bytes.

                                                                                                                                                                                                                                                                                                                                                            For instance, the image below shows that all the pods in the Namespace requests for 120 GiB, of which only 24% is being used (about 29 GiB):

                                                                                                                                                                                                                                                                                                                                                            What to Expect?

                                                                                                                                                                                                                                                                                                                                                            It depends on the type of Workloads you run in the Namespace. Typically, values that fall between 80% and 120% are considered healthy.

                                                                                                                                                                                                                                                                                                                                                            Values that are higher than 100% considered normal for a relatively short amount of time.

                                                                                                                                                                                                                                                                                                                                                            What to Do Otherwise?

                                                                                                                                                                                                                                                                                                                                                            A low usage indicates the application is not properly running (not executing the expected functions) or the workload configuration is not accurate (high requests compared to what the pods actually need).

                                                                                                                                                                                                                                                                                                                                                            A high usage indicates the application is operating with a high load or the Workload configuration is not accurate (Fewer requests compared to what the pods actually need).

                                                                                                                                                                                                                                                                                                                                                            Given the configured limits for the Workloads and the memory pressure on the nodes, if the Workloads use more memory than what’s requested they are at risk of eviction. See Exceed a Container’s Limit for more information.

                                                                                                                                                                                                                                                                                                                                                            In both cases, you may want to drill down to the Workloads page to determine which Workload requires a deeper analysis.

                                                                                                                                                                                                                                                                                                                                                            Can the Value Be Higher than 100%?

                                                                                                                                                                                                                                                                                                                                                            Yes, it can.

                                                                                                                                                                                                                                                                                                                                                            • You can configure requests without limits, or requests lower than the limits. In either case, you are allowing the containers to use more resources than requested, typically to handle temporary overloads.

                                                                                                                                                                                                                                                                                                                                                            • Consider a Namespace with two Workloads with one pod each. Say, one Workload is configured to request for 1 GiB of memory and uses 1 GiB (Used vs Request ratio is 100%). The other Workload is configured without any request and uses 1 GiB. In this example, 2 GiB of Memory Used to1 GiB Requested ratio at the Namespace level is 200%.

                                                                                                                                                                                                                                                                                                                                                            Network I/O

                                                                                                                                                                                                                                                                                                                                                            The chart shows the latest value returned by avg(avg(net.bytes.total)).

                                                                                                                                                                                                                                                                                                                                                            What Is It?

                                                                                                                                                                                                                                                                                                                                                            The sparkline shows the trend of network traffic (inbound and outbound) for all the pods in the Namespace. The number shows the most recent rate, expressed in restarts per second.

                                                                                                                                                                                                                                                                                                                                                            For reference, the sparklines show the following number of steps (sampling):

                                                                                                                                                                                                                                                                                                                                                            • Last hour: 6 steps, each for a 10-minute time slice

                                                                                                                                                                                                                                                                                                                                                            • Last 6 hours: 12 steps, each for a 30-minute time slice

                                                                                                                                                                                                                                                                                                                                                            • Last day: 12 steps, each for a 2-hour time slice

                                                                                                                                                                                                                                                                                                                                                            What to Expect?

                                                                                                                                                                                                                                                                                                                                                            The type of applications run in the Namespace determine the metrics. Drilling down to the Kubernetes Namespace Overview Dashboard in Explore provides additional details, such as network activity across pods.

                                                                                                                                                                                                                                                                                                                                                            2.4 -

                                                                                                                                                                                                                                                                                                                                                            Workloads Data

                                                                                                                                                                                                                                                                                                                                                            This topic discusses the Workloads Overview page and helps you understand its gauge charts and the data displayed on them.

                                                                                                                                                                                                                                                                                                                                                            About Workloads Overview

                                                                                                                                                                                                                                                                                                                                                            Workloads, in Kubernetes terminology, refers to your containerized applications. Workloads comprise of Deployments, Statefulsets, and Daemonsets within a Namespace.

                                                                                                                                                                                                                                                                                                                                                            In a Cluster, worker nodes run your application workloads, whereas the master node provides the core Kubernetes services and orchestration for application workloads. The Workloads Overview page provides the key metrics indicating health, capacity, and compliance.

                                                                                                                                                                                                                                                                                                                                                            Interpret the Workloads Data

                                                                                                                                                                                                                                                                                                                                                            This topic gives insight into the metrics displayed on the Workloads Overview page.

                                                                                                                                                                                                                                                                                                                                                            Pod Restarts

                                                                                                                                                                                                                                                                                                                                                            The chart displays the latest value returned by sum(timeAvg(kubernetes.pod.restart.rate)).

                                                                                                                                                                                                                                                                                                                                                            What Is It?

                                                                                                                                                                                                                                                                                                                                                            The sparkline shows the trend of Pod Restarts rate across all the pods in a selected Workload. The number shows the most recent rate, expressed in Restarts per Second.

                                                                                                                                                                                                                                                                                                                                                            For instance, the image below shows the trend for the last hour. The number indicates that the rate of pod restarts is less than 0.01 for the last 10 minutes.

                                                                                                                                                                                                                                                                                                                                                            For reference, the sparklines show the following number of steps (sampling):

                                                                                                                                                                                                                                                                                                                                                            • Last hour: 6 steps, each for a 10-minute time slice.

                                                                                                                                                                                                                                                                                                                                                            • Last 6 hours: 12 steps, each for a 20-minute time slice.

                                                                                                                                                                                                                                                                                                                                                            • Last day: 12 steps, each for a 2-hour time slice.

                                                                                                                                                                                                                                                                                                                                                            What to Expect?

                                                                                                                                                                                                                                                                                                                                                            A healthy pod will have 0 restarts at any given time.

                                                                                                                                                                                                                                                                                                                                                            What to Do Otherwise?

                                                                                                                                                                                                                                                                                                                                                            In most cases, fewer restarts in the last hour (or larger time windows) do not indicate a serious problem. Drill down to the Kubernetes Overview Dashboard related to the Workload in Explore. For example, Kubernetes StatefulSet Overview provides a detailed trend broken down by pods.

                                                                                                                                                                                                                                                                                                                                                            In this example, the number of restarts is constant (roughly every 5 minutes) and no pods are ready. This might indicate a crash loop back-off .

                                                                                                                                                                                                                                                                                                                                                            Pods Available vs Desired

                                                                                                                                                                                                                                                                                                                                                            The chart shows the latest value of returned by sum(avg(kubernetes.deployment.replicas.available)) / sum(avg(kubernetes.deployment.replicas.desired)).

                                                                                                                                                                                                                                                                                                                                                            What Is It?

                                                                                                                                                                                                                                                                                                                                                            The chart displays the ratio between available and desired pods, averaged across the selected time window, for all the pods in a given Workload.

                                                                                                                                                                                                                                                                                                                                                            The upper bound shows the number of desired pods in the Workload.

                                                                                                                                                                                                                                                                                                                                                            For instance, the image below shows all the 42 desired pods are available.

                                                                                                                                                                                                                                                                                                                                                            What to Expect?

                                                                                                                                                                                                                                                                                                                                                            You should typically expect 100%.

                                                                                                                                                                                                                                                                                                                                                            If certain pods take a significant amount of time to become available (image pull time, pod initialization, readiness probe), then you may temporarily see a ratio lower than 100%.

                                                                                                                                                                                                                                                                                                                                                            What to Do Otherwise?

                                                                                                                                                                                                                                                                                                                                                            Determine the Workloads that have low availability by drilling down to the related Dashboard in Explore. For example, the Kubernetes Deployment Overview helps understand the trend and the state of the pods.

                                                                                                                                                                                                                                                                                                                                                            For instance, the image above shows that the ratio is 98% (3.93 / 4 x 100). The slight decline is due to an update that caused pods to be terminated and consequently to be started with a newer version.

                                                                                                                                                                                                                                                                                                                                                            CPU Used vs Requests

                                                                                                                                                                                                                                                                                                                                                            The chart shows the latest value returned by sum(avg(cpu.cores.used)) / sum(avg(kubernetes.pod.resourceRequests.cpuCores)).

                                                                                                                                                                                                                                                                                                                                                            What Is It?

                                                                                                                                                                                                                                                                                                                                                            The chart shows the ratio between the total CPU usage across all pods of a selected Workload and the total CPU requested by all the pods.

                                                                                                                                                                                                                                                                                                                                                            The upper bound shows the total CPU requested by all the pods. The value denotes the number of CPU cores.

                                                                                                                                                                                                                                                                                                                                                            In this image, the pods in the Workload requests for 40 CPU cores, of which 43% is actually used (about 17 cores).

                                                                                                                                                                                                                                                                                                                                                            What to Expect?

                                                                                                                                                                                                                                                                                                                                                            It depends on the type of workload.

                                                                                                                                                                                                                                                                                                                                                            For applications (background processes) whose resource usage is constant, expect the ratio to be around 100%.

                                                                                                                                                                                                                                                                                                                                                            For “bursty” applications, such as an API server, expect the ratio to be lower than 100%. Note that the value is averaged for the selected time window, therefore, a usage spike would be compensated by an idle period.

                                                                                                                                                                                                                                                                                                                                                            Generally, values between 80% and 120% are considered normal. Values that are higher than 100% deemed normal if it’s observed only for a relatively short time.

                                                                                                                                                                                                                                                                                                                                                            What to Do Otherwise?

                                                                                                                                                                                                                                                                                                                                                            • A low usage indicates that the application is not properly running (not executing the expected functions) or the Workload configuration is not accurate (requests are too high compared to what the pods actually need).

                                                                                                                                                                                                                                                                                                                                                            • A high usage indicates that the load is high for applications or the Workload configuration is not accurate (low requests compared to what the pods actually need).

                                                                                                                                                                                                                                                                                                                                                            In either case, drill down to the Kubernetes Overview Dashboard corresponding to the Workload in Explore. For example, the Kubernetes Deployment Overview Dashboard provides insight into resource usage and configuration.

                                                                                                                                                                                                                                                                                                                                                            Can the Value Be Higher than 100%?

                                                                                                                                                                                                                                                                                                                                                            Yes, it can.

                                                                                                                                                                                                                                                                                                                                                            • Configuring CPU requests without limits or requests lower than limits is permissible. In these cases, you are allowing the containers to use more resources than requested, typically to handle temporary overloads.

                                                                                                                                                                                                                                                                                                                                                            • Consider a Workload with two containers. Say, one container is configured to request for 1 CPU core and uses 1 CPU core (Used vs Request ratio is 100%). The other is configured without any request and uses 1 CPU core. In this example, the 2 CPU core Used to 1 CPU core Requested ratio is 200% at the Workload level.

                                                                                                                                                                                                                                                                                                                                                            What Does “No Data” Mean?

                                                                                                                                                                                                                                                                                                                                                            If the Workload is configured with no requests and limits, then the Usage vs Requests ratio cannot be computed. In this case, the chart will show “no data”. Drill down to the Dashboard in Explore to evaluate the actual usage.

                                                                                                                                                                                                                                                                                                                                                            You must always configure requests. Setting requests helps to detect Workloads that require reconfiguration.

                                                                                                                                                                                                                                                                                                                                                            Kubernetes itself might expose Workloads with no requests or limits configured. For example, the kube-system Namespace can have Workloads without requests configured.

                                                                                                                                                                                                                                                                                                                                                            Memory Used vs Requests

                                                                                                                                                                                                                                                                                                                                                            The chart shows the latest value returned by sum(avg(memory.bytes.used)) / sum(avg(kubernetes.pod.resourceRequests.memBytes)).

                                                                                                                                                                                                                                                                                                                                                            What Is It?

                                                                                                                                                                                                                                                                                                                                                            The chart shows the ratio between the total memory usage across all the pods in a Workload and the total memory requested by the Workload.

                                                                                                                                                                                                                                                                                                                                                            The upper bound shows the total memory requested by all the pods, expressed in the specified unit of bytes.

                                                                                                                                                                                                                                                                                                                                                            For instance, the image shows that the pods in the selected Workload requested for 120 GiB, of which 24% is actually used (about 29 GiB).

                                                                                                                                                                                                                                                                                                                                                            What to Expect?

                                                                                                                                                                                                                                                                                                                                                            The type of Workload determines the ratio. Values between 80% and 120% are considered normal. Values that are higher than 100% is deemed normal if it’s observed only for a relatively short time.

                                                                                                                                                                                                                                                                                                                                                            What to Do Otherwise?

                                                                                                                                                                                                                                                                                                                                                            A low memory usage indicates that the application is not properly running (not executing the expected functions) or the Workload configuration is not accurate (requests are too high compared to what the pods actually need).

                                                                                                                                                                                                                                                                                                                                                            A high memory usage indicates that the load is higher for applications or the Workload configuration is not accurate (low requests compared to what the pods actually need).

                                                                                                                                                                                                                                                                                                                                                            Given the configured limits for the Workloads and the memory pressure on the nodes, if the Workloads use more memory than what’s requested they are at risk of eviction. For more information, see Container’s Memory Limit.

                                                                                                                                                                                                                                                                                                                                                            In either case, drill down to the Workloads page to determine the Workload that requires a deeper analysis.

                                                                                                                                                                                                                                                                                                                                                            Can the Value Be Higher than 100%?

                                                                                                                                                                                                                                                                                                                                                            Yes, it can.

                                                                                                                                                                                                                                                                                                                                                            • Configuring memory requests without limits or requests lower than limits is permissible. In these cases, you are allowing the containers to use more resources than requested, typically to handle temporary overloads.

                                                                                                                                                                                                                                                                                                                                                            • Consider a Workload with two containers. Say, one container is configured to request for 1 GiB of memory and uses 1 GiB (Used vs Request ratio is 100%), while the other is configured without any request and uses 1 GiB of memory. In this example, the 2 GiB of memory used to 1 GiB requested ratio is 200% at the Workload level.

                                                                                                                                                                                                                                                                                                                                                            What Does “No Data” Mean?

                                                                                                                                                                                                                                                                                                                                                            If the Workload is configured with no memory requests and limits, then the Usage vs Requests ratio cannot be computed. In this case, the chart will show “no data”. Drill down to the Dashboard in Explore to evaluate the actual usage.

                                                                                                                                                                                                                                                                                                                                                            You must configure requests. It helps to detect Workloads that require reconfiguration.

                                                                                                                                                                                                                                                                                                                                                            Kubernetes itself might expose Workloads with no requests or limits configured. For example, the kube-system Namespace can have Workloads without requests configured.

                                                                                                                                                                                                                                                                                                                                                            Network I/O

                                                                                                                                                                                                                                                                                                                                                            The chart shows the latest value returned by avg(avg(net.bytes.total)).

                                                                                                                                                                                                                                                                                                                                                            What Is It?

                                                                                                                                                                                                                                                                                                                                                            The sparkline shows the trend of network traffic (inbound and outbound) for the Workload. The number shows the most recent rate, expressed in bytes per second in a specific unit.

                                                                                                                                                                                                                                                                                                                                                            For reference, the sparklines show the following number of steps (sampling):

                                                                                                                                                                                                                                                                                                                                                            • Last hour: 6 steps, each for a 10-minute time slice

                                                                                                                                                                                                                                                                                                                                                            • Last 6 hours: 12 steps, each for a 30-minute time slice

                                                                                                                                                                                                                                                                                                                                                            • Last day: 12 steps, each for a 2-hour time slice

                                                                                                                                                                                                                                                                                                                                                            What to Expect?

                                                                                                                                                                                                                                                                                                                                                            The type of application runs in the Workload determines the metrics. Drill down to the Kubernetes Overview Dashboard corresponding to the Workload in Explore. For example, the Kubernetes Deployment Overview Dashboard provides additional details, such as network activity across pods.

                                                                                                                                                                                                                                                                                                                                                            3 -

                                                                                                                                                                                                                                                                                                                                                            Explore

                                                                                                                                                                                                                                                                                                                                                            This feature is available in the Enterprise tier of the Sysdig product. See https://sysdig.com/pricing for details, or contact sales@sysdig.com.

                                                                                                                                                                                                                                                                                                                                                            The Sysdig Monitor web interface centers around the Explore module, where you perform the majority of infrastructure monitoring operations. Explore provides you the ability to view and troubleshoot key metrics and entities of your infrastructure stack. Sysdig Monitor automatically discovers your stack and presents pre-built views in Explore. You can drill down to any layers of your infrastructure hierarchy and view granular-level data. Grouping controls how entities are organized in Explore. Grouping is fully customizable by logical layers, such as containers, Kubernetes clusters, or services.

                                                                                                                                                                                                                                                                                                                                                            In addition to the Explore interface, Sysdig provides a PromQL Query Explorer and PromQL Library. They help you understand metrics and corresponding labels and values clearly, to create queries faster, and to build Dashboard and Alerts easily.

                                                                                                                                                                                                                                                                                                                                                            Learn more about using Explore in the following sections:

                                                                                                                                                                                                                                                                                                                                                            3.1 -

                                                                                                                                                                                                                                                                                                                                                            Explore Interface

                                                                                                                                                                                                                                                                                                                                                            The sections below outline the key areas of the interface and detail basic navigation steps.

                                                                                                                                                                                                                                                                                                                                                            The C-Frame Structure

                                                                                                                                                                                                                                                                                                                                                            The image below provides a complete view of the Explore UI in the Sysdig Monitor c-frame style interface:

                                                                                                                                                                                                                                                                                                                                                            There are several key areas highlighted in the image above:

                                                                                                                                                                                                                                                                                                                                                            • Product Selector: This allows you to switch between Sysdig products.

                                                                                                                                                                                                                                                                                                                                                            • Grouping: Groupings are hierarchical organizations of tags, allowing users to organize their infrastructure views using the Grouping Wizard in a logical hierarchy. For more information on groupings, refer to Grouping, Scoping, and Segmenting Metrics.

                                                                                                                                                                                                                                                                                                                                                            • Modules: Quick links for each of the main Sysdig Monitor modules: Explore, Dashboards, Alerts, Events, and Captures.

                                                                                                                                                                                                                                                                                                                                                            • PromQL Query Explorer: Run PromQL queries to build your infrastructure views and get an in-depth insight into what’s going on. See PromQL Query Explorer.

                                                                                                                                                                                                                                                                                                                                                            • Management: Quick links for Sysdig Spotlight, help material, and the user profile configuration settings.

                                                                                                                                                                                                                                                                                                                                                            • Drill-Down: This allows you to explore deep down the infrastructure stack and retrieve all the components in a certain category in a single organized element.

                                                                                                                                                                                                                                                                                                                                                            • Search Metrics and Dashboards: Helps you select desired metrics or dashboards.

                                                                                                                                                                                                                                                                                                                                                            • Time Navigation: Helps you customize the time window used for displaying data

                                                                                                                                                                                                                                                                                                                                                            • Key Page Actions: Quick links to create events, alerts, and dashboards.

                                                                                                                                                                                                                                                                                                                                                            3.2 -

                                                                                                                                                                                                                                                                                                                                                            Navigate the Explore Interface

                                                                                                                                                                                                                                                                                                                                                            This section helps you navigate the Explore menu in the Sysdig Monitor UI.

                                                                                                                                                                                                                                                                                                                                                            Switch Groupings

                                                                                                                                                                                                                                                                                                                                                            Sysdig Monitor detects and collects the metrics associated with your infrastructure once the agent is deployed in your environment. Use the Explore UI to search, group, and troubleshoot your infrastructure components.

                                                                                                                                                                                                                                                                                                                                                            To switch between available data sources:

                                                                                                                                                                                                                                                                                                                                                            1. On the Explore tab, click the My Groupings drop-down menu:

                                                                                                                                                                                                                                                                                                                                                            2. Select the desired grouping from the drop-down list.

                                                                                                                                                                                                                                                                                                                                                            Groupings Editor

                                                                                                                                                                                                                                                                                                                                                            The Groupings Editor helps you create and manage your infrastructure groupings.

                                                                                                                                                                                                                                                                                                                                                            Use Drill-Down Menu

                                                                                                                                                                                                                                                                                                                                                            Sysdig Monitor users can drill down into the infrastructure by using the numerous dashboards and metrics available for display in the Explore UI. These displays can be found by selecting an infrastructure object, and opening the drill-down menu.

                                                                                                                                                                                                                                                                                                                                                            Sysdig Monitor only displays the metrics and dashboards that are relevant to the selected infrastructure object.

                                                                                                                                                                                                                                                                                                                                                            Metrics

                                                                                                                                                                                                                                                                                                                                                            Sysdig Monitor users can view specific metrics for an infrastructure object by navigating the drill-down menu:

                                                                                                                                                                                                                                                                                                                                                            1. On the Explore tab, open the drill-down menu.

                                                                                                                                                                                                                                                                                                                                                            2. Navigate to Search Metrics and Dashboard.

                                                                                                                                                                                                                                                                                                                                                            3. Select the desired metrics.

                                                                                                                                                                                                                                                                                                                                                              The metric will now be presented on the Explore UI, until the user navigates away from it.

                                                                                                                                                                                                                                                                                                                                                              The scope of the metric, when viewed via the drill-down menu, is set to the infrastructure object that you have selected.

                                                                                                                                                                                                                                                                                                                                                            Troubleshooting Views

                                                                                                                                                                                                                                                                                                                                                            The drill-down menu displays all the default dashboard templates relevant to the selected infrastructure object. These Troubleshooting Views are broken into the following sections:

                                                                                                                                                                                                                                                                                                                                                            The scope of the Troubleshooting View, when viewed via the drill-down menu, is set to the infrastructure object that you have selected from the drill-down.

                                                                                                                                                                                                                                                                                                                                                            To navigate to the Troubleshooting Views:

                                                                                                                                                                                                                                                                                                                                                            1. On the Explore tab, select an infrastructure object.

                                                                                                                                                                                                                                                                                                                                                            2. Open the drill-down menu and select the desired infrastructure element

                                                                                                                                                                                                                                                                                                                                                            3. Navigate to Search Metrics and Dashboard.

                                                                                                                                                                                                                                                                                                                                                            4. Select the desired troubleshooting view.

                                                                                                                                                                                                                                                                                                                                                              The selected dashboard will now be presented on the screen, until you navigate away from it.

                                                                                                                                                                                                                                                                                                                                                            Pin and Unpin the Drill-Down Menu

                                                                                                                                                                                                                                                                                                                                                            1. On the Explore tab, select an infrastructure object.

                                                                                                                                                                                                                                                                                                                                                            2. Open the drill-down menu.

                                                                                                                                                                                                                                                                                                                                                            3. Click Pin Menu to pin the menu to the Explore tab.

                                                                                                                                                                                                                                                                                                                                                              To unpin the menu, click Unpin Menu at the bottom of the menu.

                                                                                                                                                                                                                                                                                                                                                            3.3 -

                                                                                                                                                                                                                                                                                                                                                            PromQL Query Explorer

                                                                                                                                                                                                                                                                                                                                                            Use the PromQL Query Explorer to run PromQL queries and build infrastructure views. It allows you

                                                                                                                                                                                                                                                                                                                                                            • Write PromQL queries faster by automatically identifying the common labels and labels among different metrics.

                                                                                                                                                                                                                                                                                                                                                              See Run PromQL Queries Faster with Extended Label Set.

                                                                                                                                                                                                                                                                                                                                                            • Query metrics by leveraging advanced functions, operators, and boolean logic.

                                                                                                                                                                                                                                                                                                                                                            • Interactively modify the PromQL results by using visual label filtering.

                                                                                                                                                                                                                                                                                                                                                            • Use label filtering to visualize the common labels between metrics, which is key when combining multiple metrics.

                                                                                                                                                                                                                                                                                                                                                            About the PromQL Explorer UI

                                                                                                                                                                                                                                                                                                                                                            The main components of the PromQL Query Explorer UI include widgets, time navigation, and dashboard and time series panel.

                                                                                                                                                                                                                                                                                                                                                            You’ll find PromQL Explore under the Explore tab on the Sysdig Monitor UI.

                                                                                                                                                                                                                                                                                                                                                            PromQL Query

                                                                                                                                                                                                                                                                                                                                                            The PromQL field supports manually building PromQL queries. You can manually enter simple or complex PromQL queries and build dashboards and create alerts. The PromQL Query Explorer allows running up to 5 queries simultaneously. With the query field, you can do the following:

                                                                                                                                                                                                                                                                                                                                                            • Explore metrics and labels available in your infrastructure.

                                                                                                                                                                                                                                                                                                                                                              For example, calculate the number of bytes received in a selected host:

                                                                                                                                                                                                                                                                                                                                                              sysdig_host_net_total_bytes{host_mac="0a:e2:e8:b4:6c:1a"}
                                                                                                                                                                                                                                                                                                                                                              

                                                                                                                                                                                                                                                                                                                                                              Calculate the number of bytes received in all the hosts except one:

                                                                                                                                                                                                                                                                                                                                                              sysdig_host_net_total_bytes{host_mac!="0a:a3:4b:3e:db:a2"}
                                                                                                                                                                                                                                                                                                                                                              

                                                                                                                                                                                                                                                                                                                                                              Compare current data with historical data:

                                                                                                                                                                                                                                                                                                                                                              sysdig_host_net_total_bytes offset 7d
                                                                                                                                                                                                                                                                                                                                                              
                                                                                                                                                                                                                                                                                                                                                            • Use arithmetic operators to perform calculations on one or more metrics or labels.

                                                                                                                                                                                                                                                                                                                                                              For example, calculate the rate of incoming bytes and convert it to bits:

                                                                                                                                                                                                                                                                                                                                                              rate(sysdig_host_net_total_bytes[5m]) * 8
                                                                                                                                                                                                                                                                                                                                                              
                                                                                                                                                                                                                                                                                                                                                            • Build complex PromQL queries.

                                                                                                                                                                                                                                                                                                                                                              For example, return summary ingress traffic across all the network interfaces grouped by instances

                                                                                                                                                                                                                                                                                                                                                              sum(rate(sysdig_host_net_total_bytes[5m])) by (container_id)
                                                                                                                                                                                                                                                                                                                                                              

                                                                                                                                                                                                                                                                                                                                                            Label Filtering

                                                                                                                                                                                                                                                                                                                                                            Label filtering to automatically identify common labels between queries for vector matching. In the given example, you can see that A and B metrics have only the host_mac label in common.

                                                                                                                                                                                                                                                                                                                                                            You can also filter by using the relational operators available in the time series table. Simply click the operator for it to be automatically applied to the queries. Run the queries again to visualize the metrics.

                                                                                                                                                                                                                                                                                                                                                            Filtering simultaneously applies to all the queries in the PromQL Query Explorer.

                                                                                                                                                                                                                                                                                                                                                            Widgets

                                                                                                                                                                                                                                                                                                                                                            PromQL Query Explorer supports only time series (Timechart). You can run advanced (PromQL) queries and build dashboard panels. PromQL Explorer does not support building form-based queries.

                                                                                                                                                                                                                                                                                                                                                            Time Navigation

                                                                                                                                                                                                                                                                                                                                                            PromQL Query Explorer is designed around time. After a query has been executed, Sysdig Monitor polls the infrastructure data every 10 seconds and refreshes the metrics on the Dashboard panel. You select how to view this gathered data by choosing a Preset interval and a time Range. For more information, see Time Navigation.

                                                                                                                                                                                                                                                                                                                                                            Legend

                                                                                                                                                                                                                                                                                                                                                            The legend is positioned on the upper right corner of the panel. Each query will have associated legends listed in the same execution order.

                                                                                                                                                                                                                                                                                                                                                            Build a Query

                                                                                                                                                                                                                                                                                                                                                            1. On the Explore tab, click PromQL Query.

                                                                                                                                                                                                                                                                                                                                                            2. Enter a PromQL query manually.

                                                                                                                                                                                                                                                                                                                                                              sysdig_host_cpu_used_percent
                                                                                                                                                                                                                                                                                                                                                              

                                                                                                                                                                                                                                                                                                                                                              Click Add Query to run multiple queries. You can run up to 5 queries at once.

                                                                                                                                                                                                                                                                                                                                                              sysdig_container_cpu_used_percent
                                                                                                                                                                                                                                                                                                                                                              
                                                                                                                                                                                                                                                                                                                                                            3. Click Run Query or press command+Enter.

                                                                                                                                                                                                                                                                                                                                                              A dashboard will appear on the screen. You can either Copy to a Dashboard or Create an Alert.

                                                                                                                                                                                                                                                                                                                                                            Copy to a Dashboard

                                                                                                                                                                                                                                                                                                                                                            1. Run a PromQL query.

                                                                                                                                                                                                                                                                                                                                                            2. Click Create > Create a Dashboard Panel.

                                                                                                                                                                                                                                                                                                                                                            3. Either select an existing Dashboard or enter the Dashboard name to copy to a new Dashboard.

                                                                                                                                                                                                                                                                                                                                                            4. Click Copy and Open.

                                                                                                                                                                                                                                                                                                                                                              The new Dashboard panel with the given title will open to the Dashboard tab.

                                                                                                                                                                                                                                                                                                                                                              You might want to continue with the Dashboard operations as given in Dashboards.

                                                                                                                                                                                                                                                                                                                                                            Create an Alert

                                                                                                                                                                                                                                                                                                                                                            1. Run a PromQL query.

                                                                                                                                                                                                                                                                                                                                                            2. Click Create > Create Alert.

                                                                                                                                                                                                                                                                                                                                                            3. If you have multiple queries, select the query you want to create the alert for.

                                                                                                                                                                                                                                                                                                                                                              A new PromQL Alert page for the selected query appears on the screen.

                                                                                                                                                                                                                                                                                                                                                              Continue with PromQL Alerts.

                                                                                                                                                                                                                                                                                                                                                            Remove a Query

                                                                                                                                                                                                                                                                                                                                                            Click the three dots next to the query field to remove the query.

                                                                                                                                                                                                                                                                                                                                                            Toggle Query Results

                                                                                                                                                                                                                                                                                                                                                            Click the respective query buttons, for example, A or B, to show or hide query results.

                                                                                                                                                                                                                                                                                                                                                            3.4 -

                                                                                                                                                                                                                                                                                                                                                            PromQL Library

                                                                                                                                                                                                                                                                                                                                                            PromQL is a powerful language to query metrics, but it could be challenging for beginners. To ease the learning curve of PromQL, Sysdig provides a set of curated examples, called PromQL Library. It helps you perform complex queries against your metrics with one click and get insight into your infrastructure problems which was not previously possible with Sysdig querying. For example, identify containers > 90% limit and counting pods per namespace, and so on.

                                                                                                                                                                                                                                                                                                                                                            You have the following categories currently to experiment with PromQL:

                                                                                                                                                                                                                                                                                                                                                            • Kubernetes

                                                                                                                                                                                                                                                                                                                                                            • Infrastructure

                                                                                                                                                                                                                                                                                                                                                            • Troubleshooting

                                                                                                                                                                                                                                                                                                                                                            • PromQL 101

                                                                                                                                                                                                                                                                                                                                                            Access PromQL Library

                                                                                                                                                                                                                                                                                                                                                            1. Log in to Sysdig Monitor.

                                                                                                                                                                                                                                                                                                                                                            2. Click Explore from the left navigation pane.

                                                                                                                                                                                                                                                                                                                                                            3. On the Explore tab, click PromQL Library.

                                                                                                                                                                                                                                                                                                                                                              The tab opens to a list of PromQL examples.

                                                                                                                                                                                                                                                                                                                                                            Use PromQL Library

                                                                                                                                                                                                                                                                                                                                                            Click Try me to open PromQL Query Explore. A visualization corresponding to the query will be displayed. You can do the following with the query:

                                                                                                                                                                                                                                                                                                                                                            • Create a dashboard panel

                                                                                                                                                                                                                                                                                                                                                            • Create an alert

                                                                                                                                                                                                                                                                                                                                                            See PromQL Query Explorer for more information.

                                                                                                                                                                                                                                                                                                                                                            To copy a query, click the copy icon next to the query.

                                                                                                                                                                                                                                                                                                                                                            Filter PromQL Queries

                                                                                                                                                                                                                                                                                                                                                            Automatic tag filtering identifies common tags in the given examples. You can use the following to filter queries:

                                                                                                                                                                                                                                                                                                                                                            • Visual label filtering: Simply click the desired color-coded label to filter queries based on tags.

                                                                                                                                                                                                                                                                                                                                                            • Text search: Use the Text Search bar on the top-left navigation pane.

                                                                                                                                                                                                                                                                                                                                                            • Label search: Use the Label drop-down list on the top-left navigation pane.

                                                                                                                                                                                                                                                                                                                                                            • Filter using categories: Use the All Categories checkboxes.

                                                                                                                                                                                                                                                                                                                                                            3.5 -

                                                                                                                                                                                                                                                                                                                                                            Groupings Editor

                                                                                                                                                                                                                                                                                                                                                            Groupings are hierarchical organizations of labels, allowing you to organize your infrastructure views on the Explore UI in a logical hierarchy.

                                                                                                                                                                                                                                                                                                                                                            An example grouping is shown below:

                                                                                                                                                                                                                                                                                                                                                            The example above groups the infrastructure into four levels. This results in a tree view in the Groupings Editor with four levels, with rows for each infrastructure object applicable to each level.

                                                                                                                                                                                                                                                                                                                                                            As each label is selected, Sysdig Monitor automatically filters out labels for the next selection that no longer fit the hierarchy, to ensure that only logical groupings are created.

                                                                                                                                                                                                                                                                                                                                                            Sysdig Monitor automatically organizes all the configured groupings that are inapplicable to the current infrastructure under Inapplicable Groupings.

                                                                                                                                                                                                                                                                                                                                                            Manage Groupings

                                                                                                                                                                                                                                                                                                                                                            You can perform the following operations using the Groupings Editor:

                                                                                                                                                                                                                                                                                                                                                            • Search existing groupings

                                                                                                                                                                                                                                                                                                                                                            • Create a new grouping

                                                                                                                                                                                                                                                                                                                                                            • Edit an existing grouping

                                                                                                                                                                                                                                                                                                                                                            • Rename a groupings

                                                                                                                                                                                                                                                                                                                                                            • Share a grouping with the active team

                                                                                                                                                                                                                                                                                                                                                            Search for a Grouping

                                                                                                                                                                                                                                                                                                                                                            1. Do one of the following:

                                                                                                                                                                                                                                                                                                                                                              • From Explore, click the Groupings drop-down. Search for the desired grouping.

                                                                                                                                                                                                                                                                                                                                                                Either select the desired grouping, or search for it by scrolling down the list or by using the search bar, and then select it.

                                                                                                                                                                                                                                                                                                                                                              • Click Manage Groupings and open the Groupings Editor.

                                                                                                                                                                                                                                                                                                                                                                Either select the desired grouping, or search for it by scrolling down the list or by using the search bar, and then select it.

                                                                                                                                                                                                                                                                                                                                                            Create a New Grouping

                                                                                                                                                                                                                                                                                                                                                            1. In the Explore tab, click the Groupings drop-down, then click Manage Groupings.

                                                                                                                                                                                                                                                                                                                                                            2. Open the Groupings Editor.

                                                                                                                                                                                                                                                                                                                                                            3. Click Add.

                                                                                                                                                                                                                                                                                                                                                              The New Groupings page is displayed.

                                                                                                                                                                                                                                                                                                                                                            4. Enter the following information:

                                                                                                                                                                                                                                                                                                                                                              • Groupings Name: Set an appropriate name to identify the grouping that you are creating.

                                                                                                                                                                                                                                                                                                                                                              • Shared with Team: Select if you want to share the grouping with the active team that you are part of.

                                                                                                                                                                                                                                                                                                                                                              • Hierarchy: Determine the hierarchical representation of the grouping by choosing a top-level label and subsequent ones. Repeat adding the labels until there are no further layers available in the infrastructure label hierarchy.

                                                                                                                                                                                                                                                                                                                                                                You can search for the label by entering the first few characters in the Select label drop-down or scrolling down. As you add labels, the preview displays associated components in your infrastructure.

                                                                                                                                                                                                                                                                                                                                                            5. Check the preview to ensure that the label selection is correct.

                                                                                                                                                                                                                                                                                                                                                            6. Click Save&Apply.

                                                                                                                                                                                                                                                                                                                                                            Rename a Grouping

                                                                                                                                                                                                                                                                                                                                                            Renaming is allowed only for groupings that are owned by you. To rename a shared grouping, create a copy of it and edit the name.

                                                                                                                                                                                                                                                                                                                                                            1. Do one of the following in Explore:

                                                                                                                                                                                                                                                                                                                                                              • Click the Groupings drill-down. Search for the desired grouping. Click the Edit button next to the grouping.

                                                                                                                                                                                                                                                                                                                                                              • Click the Groupings drill-down and click Manage Groupings.

                                                                                                                                                                                                                                                                                                                                                            2. Open the Groupings Editor.

                                                                                                                                                                                                                                                                                                                                                            3. Either select the desired grouping, or search for it by scrolling down the list or by using the search bar, and then select it.

                                                                                                                                                                                                                                                                                                                                                            4. Click Edit.

                                                                                                                                                                                                                                                                                                                                                              The edit window is displayed on the screen.

                                                                                                                                                                                                                                                                                                                                                            5. Specify the new grouping name, then click Save& Apply to save the changes.

                                                                                                                                                                                                                                                                                                                                                            Share a Grouping with Your Active Team

                                                                                                                                                                                                                                                                                                                                                            Custom groupings are owned by you, and therefore you can share them with all the members of your active team. To share a default grouping, create a custom grouping and use the Shared with Team option in the Grouping Editor.

                                                                                                                                                                                                                                                                                                                                                            1. Click the Groupings drill-down and click Manage Groupings.

                                                                                                                                                                                                                                                                                                                                                              The Grouping Editor screen appears.

                                                                                                                                                                                                                                                                                                                                                            2. Highlight the relevant grouping and click Edit.

                                                                                                                                                                                                                                                                                                                                                            3. Click Shared with Team.

                                                                                                                                                                                                                                                                                                                                                            4. Click Save &Apply to save the changes.

                                                                                                                                                                                                                                                                                                                                                            To share a default grouping, create a custom grouping and then use the Shared with Team option in the Grouping Editor.

                                                                                                                                                                                                                                                                                                                                                            3.6 -

                                                                                                                                                                                                                                                                                                                                                            Time Windows

                                                                                                                                                                                                                                                                                                                                                            By default, Sysdig Monitor displays information in Live mode. This means that dashboards, panels, and the Explore table will be automatically updated with new data as time passes, and will display the most recent data available for the configured time window.

                                                                                                                                                                                                                                                                                                                                                            By default, time navigation will enter Live mode with a two hour time window.

                                                                                                                                                                                                                                                                                                                                                            The time window navigation bar provides users with quick links to common time windows, as well as the ability to configure a custom time period in order to review historical data.

                                                                                                                                                                                                                                                                                                                                                            As shown in the image above, the navigation bar provides a number of pieces of information:

                                                                                                                                                                                                                                                                                                                                                            • The state of the data (Live or Past).

                                                                                                                                                                                                                                                                                                                                                            • The current time window.

                                                                                                                                                                                                                                                                                                                                                            • The configured timezone.

                                                                                                                                                                                                                                                                                                                                                            In addition, the navigation bar provides:

                                                                                                                                                                                                                                                                                                                                                            • Quick links for common time windows (one second, one minute, ten minutes, one hour, six hours, one day, and two weeks).

                                                                                                                                                                                                                                                                                                                                                            • A custom time window configuration option.

                                                                                                                                                                                                                                                                                                                                                            • A pause/play button to exit Live mode and freeze the data to a time window, and to return to Live mode.

                                                                                                                                                                                                                                                                                                                                                            • Step back/forward buttons to jump through a frozen time window to review historical data.

                                                                                                                                                                                                                                                                                                                                                            • Zoom in/out buttons to increase/decrease the time window.

                                                                                                                                                                                                                                                                                                                                                            Configure a Custom Time Period

                                                                                                                                                                                                                                                                                                                                                            The Time Navigation dropdown panel can be used to configure a specific time range. To configure a manual range:

                                                                                                                                                                                                                                                                                                                                                            1. On the Explore tab or the Dashboards tab, click the Custom link in the time navigation bar.

                                                                                                                                                                                                                                                                                                                                                            2. Configure the start and end points, and click the Adjust Time button to save the changes.

                                                                                                                                                                                                                                                                                                                                                            Some limitations apply to custom time windows. Refer to the Time Window Limitations section for more information.

                                                                                                                                                                                                                                                                                                                                                            Time Window Limitations

                                                                                                                                                                                                                                                                                                                                                            Some time window configurations may not be available in certain situations. In these instances, a modification to the time window is automatically applied, and a warning notification will be displayed:

                                                                                                                                                                                                                                                                                                                                                            There are two main reasons for a time window being unavailable. Both relate to data granularity and specificity:

                                                                                                                                                                                                                                                                                                                                                            • The time window specifies the granularity of data that has expired and is no longer available. For example, a time window specifying a one-hour time range from six months ago would not be available, resulting in the time window being modified to a time range of at least one day.

                                                                                                                                                                                                                                                                                                                                                            • The time window specifies a granularity of data that is too high given the size of the window, as a graph can only handle a certain number of data points. For example, a multi-hour time range would contain too many datapoints at one-minute granularity, and would automatically be modified to 10-minute granularity.

                                                                                                                                                                                                                                                                                                                                                            3.7 -

                                                                                                                                                                                                                                                                                                                                                            Visualize Metrics Using the Topology View

                                                                                                                                                                                                                                                                                                                                                            Topology View provides an interactive, animated interface to visualize how different components in your system interact with each other in real-time. The interface by default renders a selected host’s top processes and their interaction with processes on remote hosts or host groups. Entities in any valid logical grouping can be visually represented with Topology View. The interaction is depicted as nodes and links. Links connect nodes. Nodes and links radially expand from the left.

                                                                                                                                                                                                                                                                                                                                                            • Nodes: The entities participating in network communication. A node could be a process, a container, a host, or any label identified by Sysdig Agent. For example: kubernetes.pod.name.

                                                                                                                                                                                                                                                                                                                                                              The limit imposed on the maximum number of nodes that can be visualized is as follows:

                                                                                                                                                                                                                                                                                                                                                              • host groups (hosts in a region): 20

                                                                                                                                                                                                                                                                                                                                                              • hosts: 20

                                                                                                                                                                                                                                                                                                                                                              • containers: 20

                                                                                                                                                                                                                                                                                                                                                              • processes: 10

                                                                                                                                                                                                                                                                                                                                                              This limit applies to the entities at any node level.

                                                                                                                                                                                                                                                                                                                                                            • Links: The network connection between nodes.

                                                                                                                                                                                                                                                                                                                                                            Hosts and their child processes (host.hostName > proc.name) serve as the default grouping for the Topology View. Scaling a Topology View is limited by the number of processes and connections. Sysdig Monitor creates the Topology View by identifying network endpoints (IP addresses) derived from system call data.

                                                                                                                                                                                                                                                                                                                                                            Topology View in the Explore tab provides pre-defined dashboards to represent CPU Usage, Network``Traffic, and Response Time metrics, and as such, they are not configurable. It serves as a template diagnostic tool that provides bottom-up inter-process connection metrics as graphs. For a detailed description on each default metric type supported by the Topology View, see Topology Dashboards.

                                                                                                                                                                                                                                                                                                                                                            About Topology View

                                                                                                                                                                                                                                                                                                                                                            The UI allows you to zoom in, zoom out, fit, pan, and reset the display, and expand and collapse the nodes.

                                                                                                                                                                                                                                                                                                                                                            Zoom into an entity by clicking the associated + icon and view the top processes within the selected entity. You can expand the hosts to see individual processes running inside and corresponding metric value. The links indicating network connections are rendered between entities in the selected hierarchy and, when zoomed, between processes and entities. Use the mouse scroll wheel to zoom the content, then left-click and drag to move the map components within the window.

                                                                                                                                                                                                                                                                                                                                                            Topology View uses legends and color schemes. This section explains what they mean.

                                                                                                                                                                                                                                                                                                                                                            • The line width (thickness) of the links expresses metric values relative to other connections in the system. Scaling line width is relative to the metric values associated with each link. Topology View dynamically adjusts the legend as the data recorded for each connection changes.

                                                                                                                                                                                                                                                                                                                                                              • The thin black line(—): Lower metric values.

                                                                                                                                                                                                                                                                                                                                                              • The thick black line(): Higher metric values.

                                                                                                                                                                                                                                                                                                                                                              • Dashed lines(—): A previously existed connection. For example, an active connection between node A and B, rendered as a solid line, could dissolve after a data update if that connection does not exist anymore in the new data set. Instead of removing it from the graph, Sysdig Monitor depicts the inactive connection as a dotted line until the next data update. If the connection returns in a subsequent data update, it is rendered again as a solid line. If the connection does not exist, the line will be removed.

                                                                                                                                                                                                                                                                                                                                                            • The color scheme is determined by color coding for the customizable threshold. The nodes are identified by five colors:

                                                                                                                                                                                                                                                                                                                                                              • Green indicates the node is healthy.

                                                                                                                                                                                                                                                                                                                                                              • Yellow indicates the node is in a warning state.

                                                                                                                                                                                                                                                                                                                                                              • Red indicates the node is in a critical state.

                                                                                                                                                                                                                                                                                                                                                              • Grey indicates the node state is unknown. An unknown node typically signifies a node where no Sysdig Agent installed.

                                                                                                                                                                                                                                                                                                                                                              • Light Blue indicates no threshold is defined for the metrics.

                                                                                                                                                                                                                                                                                                                                                            Configure Topology View

                                                                                                                                                                                                                                                                                                                                                            You can navigate to the Topology View from the Explore or Dashboard menus.

                                                                                                                                                                                                                                                                                                                                                            Access from Explore

                                                                                                                                                                                                                                                                                                                                                            The types of Topology View you can see on Explore are pre-defined dashboard templates that contain only a single panel, and as such, they can’t be edited.

                                                                                                                                                                                                                                                                                                                                                            1. Click Explore.

                                                                                                                                                                                                                                                                                                                                                            2. On the Explore tab, click the Switch Data Source drop-down menu.

                                                                                                                                                                                                                                                                                                                                                            3. Select Sysdig Agents.

                                                                                                                                                                                                                                                                                                                                                            4. In the My Groupings drop-down, select the desired data source for which you want a topology view.

                                                                                                                                                                                                                                                                                                                                                            5. From the Search Metrics and Dashboard drill-down, select Topology.

                                                                                                                                                                                                                                                                                                                                                            6. Select one of the three pre-defined Dashboards.

                                                                                                                                                                                                                                                                                                                                                              The Topology View for the selected metrics will appear on the screen.

                                                                                                                                                                                                                                                                                                                                                            3.8 -

                                                                                                                                                                                                                                                                                                                                                            Explore Workflows

                                                                                                                                                                                                                                                                                                                                                            While every user has unique needs from Sysdig Monitor, there are three main workflows that you can follow when building out the interface and monitoring your infrastructure.

                                                                                                                                                                                                                                                                                                                                                            Workflow One

                                                                                                                                                                                                                                                                                                                                                            This workflow assumes that an alert has not been triggered yet.

                                                                                                                                                                                                                                                                                                                                                            Start with Explore , identify a problem area, then drill-down into the data. This workflow is the most basic approach, as it begins with a user monitoring the overall infrastructure, rather than with a specific alert notification. The workflow tends to follow the following steps:

                                                                                                                                                                                                                                                                                                                                                            1. Organize the infrastructure with groupings.

                                                                                                                                                                                                                                                                                                                                                            2. Define key signals with alerts and dashboards to detect a problem.

                                                                                                                                                                                                                                                                                                                                                            3. Identify a problem area, and drill down into the data using dashboards, metrics, and by adjusting groupings and scope as necessary.

                                                                                                                                                                                                                                                                                                                                                            Workflow Two

                                                                                                                                                                                                                                                                                                                                                            Start with an event notification, and begin troubleshooting. This workflow begins with an already configured alert and event being triggered. Unlike workflow one, this workflow assumes that pre-determined data boundaries have already been set:

                                                                                                                                                                                                                                                                                                                                                            1. Explore the event by adjusting time windows, scope, and segmentation.

                                                                                                                                                                                                                                                                                                                                                            2. Identify the exact area of concern within the infrastructure.

                                                                                                                                                                                                                                                                                                                                                            3. Drill down into the data to troubleshoot the issue.

                                                                                                                                                                                                                                                                                                                                                            Workflow Three

                                                                                                                                                                                                                                                                                                                                                            Customize default dashboard panels to troubleshoot a potential issue. This workflow assumes that an issue has been identified within one of the default dashboards, but alerts have not been set up for the problem area.

                                                                                                                                                                                                                                                                                                                                                            1. Copy the displayed panel to a new dashboard.

                                                                                                                                                                                                                                                                                                                                                            2. Create an alert based on the dashboard panel.

                                                                                                                                                                                                                                                                                                                                                            3. Configure a Sysdig Capture on demand.

                                                                                                                                                                                                                                                                                                                                                            4 -

                                                                                                                                                                                                                                                                                                                                                            Metrics

                                                                                                                                                                                                                                                                                                                                                            Metrics are quantitative values or measures that can be grouped/divided by labels. Sysdig Monitor metrics are divided into two groups: default metrics (out-of-the-box metrics concerning the system, orchestrator, and network infrastructure), and custom metrics(JMX, StatsD, and multiple other integrated application metrics).

                                                                                                                                                                                                                                                                                                                                                            Sysdig automatically collects all types of metrics, and auto-labels them. Custom metrics can also have custom (user-defined) labels.

                                                                                                                                                                                                                                                                                                                                                            Out-of-the box, when an agent has been deployed on a host, Sysdig Monitor automatically begins collecting and reporting on a wide array of metrics. The sections below describe how those metrics are conceptualized within the system.

                                                                                                                                                                                                                                                                                                                                                            Learn more about the metrics types and the data aggregation techniques supported by Sysdig Monitor in the following sections:

                                                                                                                                                                                                                                                                                                                                                            4.1 -

                                                                                                                                                                                                                                                                                                                                                            Grouping, Scoping, and Segmenting Metrics

                                                                                                                                                                                                                                                                                                                                                            Data aggregation and filtering in Sysdig Monitor are done through the use of assigned labels. The sections below explain how labels work, the ways they can be used, and how to work with groupings, scopes, and segments.

                                                                                                                                                                                                                                                                                                                                                            Labels

                                                                                                                                                                                                                                                                                                                                                            Labels are used to identify and differentiate characteristics of a metric, allowing them to be aggregated or filtered for Explore module views, dashboards, alerts, and captures. Labels can be used in different ways:

                                                                                                                                                                                                                                                                                                                                                            • To group infrastructure objects into logical hierarchies displayed on the Explore tab (called groupings). For more information, refer to Groupings .

                                                                                                                                                                                                                                                                                                                                                            • To split aggregated data into segments. For more information, refer to Segments.

                                                                                                                                                                                                                                                                                                                                                            There are two types of labels:

                                                                                                                                                                                                                                                                                                                                                            • Infrastructure labels

                                                                                                                                                                                                                                                                                                                                                            • Metric descriptor labels

                                                                                                                                                                                                                                                                                                                                                            Infrastructure Labels

                                                                                                                                                                                                                                                                                                                                                            Infrastructure labels are used to identify objects or entities within the infrastructure that a metric is associated with, including hosts, containers, and processes. An example label is shown below:

                                                                                                                                                                                                                                                                                                                                                            kubernetes.pod.name
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            The table below outlines what each part of the label represents:

                                                                                                                                                                                                                                                                                                                                                            Example Label ComponentDescription
                                                                                                                                                                                                                                                                                                                                                            kubernetesThe infrastructure type.
                                                                                                                                                                                                                                                                                                                                                            podThe object.
                                                                                                                                                                                                                                                                                                                                                            nameThe label key.

                                                                                                                                                                                                                                                                                                                                                            Infrastructure labels are obtained from the infrastructure (including from orchestrators, platforms, and the runtime processes), and Sysdig automatically builds a relationship model using the labels. This allows users to create logical hierarchical groupings to better aggregate the infrastructure objects in the Explore module.

                                                                                                                                                                                                                                                                                                                                                            For more information on groupings, refer to the Groupings.

                                                                                                                                                                                                                                                                                                                                                            Metric Descriptor Labels

                                                                                                                                                                                                                                                                                                                                                            Metric descriptor labels are custom descriptors or key-value pairs applied directly to metrics, obtained from integrations like StatsD, Prometheus, and JMX. Sysdig automatically collects custom metrics from these integrations, and parses the labels from them. Unlike infrastructure labels, these labels can be arbitrary, and do not necessarily map to any entity or object.

                                                                                                                                                                                                                                                                                                                                                            Metric descriptor labels can only be used for segmenting, not grouping or scoping.

                                                                                                                                                                                                                                                                                                                                                            An example metric descriptor label is shown below:

                                                                                                                                                                                                                                                                                                                                                            website_failedRequests:20|region=‘Asia’, customer_ID=‘abc’
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            The table below outlines what each part of the label represents:

                                                                                                                                                                                                                                                                                                                                                            Example Label ComponentDescription
                                                                                                                                                                                                                                                                                                                                                            website_failedRequestsThe metric name.
                                                                                                                                                                                                                                                                                                                                                            20The metric value.
                                                                                                                                                                                                                                                                                                                                                            region=‘Asia’, customer_ID=‘abc’The metric descriptor labels. Multiple key-value pairs can be assigned using a comma separated list.

                                                                                                                                                                                                                                                                                                                                                            Sysdig recommends not using labels to store dimensions with high cardinalities (numerous different label values), such as user IDs, email addresses, URLs, or other unbounded sets of values. Each unique key-value label pair represents a new time series, which can dramatically increase the amount of data stored.

                                                                                                                                                                                                                                                                                                                                                            Groupings

                                                                                                                                                                                                                                                                                                                                                            Groupings are hierarchical organizations of labels, allowing users to organize their infrastructure views on the Explore tab in a logical hierarchy. An example grouping is shown below:

                                                                                                                                                                                                                                                                                                                                                            The example above groups the infrastructure into four levels. This results in a tree view in the Explore module with four levels, with rows for each infrastructure object applicable to each level.

                                                                                                                                                                                                                                                                                                                                                            As each label is selected, Sysdig Monitor automatically filters out labels for the next selection that no longer fit the hierarchy, to ensure that only logical groupings are created.

                                                                                                                                                                                                                                                                                                                                                            The example below shows the logical hierarchy structure for Kubernetes:

                                                                                                                                                                                                                                                                                                                                                            • Clusters: Cluster > Namespace > Replicaset > Pod

                                                                                                                                                                                                                                                                                                                                                            • Namespace: Cluster > Namespace > HorizontalPodAutoscaler > Deployment > Pod

                                                                                                                                                                                                                                                                                                                                                            • Daemonsets : Cluster > Namespace > Daemonsets > Pod

                                                                                                                                                                                                                                                                                                                                                            • Services: Cluster > Namespace > Service > StatefulSet > Pod

                                                                                                                                                                                                                                                                                                                                                            • Job: Cluster > Namespace > Job > Pod

                                                                                                                                                                                                                                                                                                                                                            • ReplicationController: Cluster > Namespace > ReplicationController > Pod

                                                                                                                                                                                                                                                                                                                                                            The default groupings are immutable: They cannot be modified or deleted. However, you can make a copy of them that you can modify.

                                                                                                                                                                                                                                                                                                                                                            Unified Workload Labels

                                                                                                                                                                                                                                                                                                                                                            Sysdig provides the following labels to help improve your infrastructure organization and troubleshooting easier.

                                                                                                                                                                                                                                                                                                                                                            • kubernetes.workload.name: Displays all the Kubernetes workloads and indicates what type and name of workload resource (deployment, daemonSet, replicaSet, and so on) it is.

                                                                                                                                                                                                                                                                                                                                                            • kubernetes.workload.type: Indicates what type of workload resource (deployment, daemonSet, replicaSet, and so on) it is.

                                                                                                                                                                                                                                                                                                                                                            The availability of these labels also simplifies Groupings. You do not need different groupings for each type of deployment, instead, you have a single grouping for workloads.

                                                                                                                                                                                                                                                                                                                                                            The labels allow you to segment metrics, such as cpu.used.percent , by kubernetes.workload.name to see CPU usage for all the workloads, instead of having a separate query for segmenting by kubernetes.deployment.name, kubernetes.replicaSet.name , and so on.

                                                                                                                                                                                                                                                                                                                                                            Learn More

                                                                                                                                                                                                                                                                                                                                                            Scopes

                                                                                                                                                                                                                                                                                                                                                            A scope is a collection of labels that are used to filter out or define the boundaries of a group of data points when creating dashboards, dashboard panels, alerts, and teams. An example scope is shown below:

                                                                                                                                                                                                                                                                                                                                                            In the example above, the scope is defined by two labels with operators and values defined. The table below defines each of the available operators.

                                                                                                                                                                                                                                                                                                                                                            OperatorDescription
                                                                                                                                                                                                                                                                                                                                                            isThe value matches the defined label value exactly.
                                                                                                                                                                                                                                                                                                                                                            is notThe value does not match the defined label value exactly.
                                                                                                                                                                                                                                                                                                                                                            inThe value is among the comma separated values entered.
                                                                                                                                                                                                                                                                                                                                                            not inThe value is not among the comma separated values entered.
                                                                                                                                                                                                                                                                                                                                                            containsThe label value contains the defined value.
                                                                                                                                                                                                                                                                                                                                                            does not containThe label value does not contain the defined value.

                                                                                                                                                                                                                                                                                                                                                            The scope editor provides dynamic filtering capabilities. It restricts the scope of the selection for subsequent filters by rendering valid values that are specific to the previously selected label. Expand the list to view unfiltered suggestions. At run time, users can also supply custom values to achieve more granular filtering. The custom values are preserved. Note that changing a label higher up in the hierarchy might render the subsequent labels incompatible. For example, changing the kubernetes.namespace.name > kubernetes.deployment.name hierarchy to swarm.service.name > kubernetes.deployment.name is invalid as these entities belong to different orchestrators and cannot be logically grouped.

                                                                                                                                                                                                                                                                                                                                                            Dashboards and Panels

                                                                                                                                                                                                                                                                                                                                                            Dashboard scopes define the criteria for what metric data will be included in the dashboard’s panels. The current dashboard’s scope can be seen at the top of the dashboard:

                                                                                                                                                                                                                                                                                                                                                            By default, all dashboard panels abide by the scope of the overall dashboard. However, an individual panel scope can be configured for a different scope than the rest of the dashboard.

                                                                                                                                                                                                                                                                                                                                                            For more information on Dashboards and Panels, refer to the Dashboards documentation.

                                                                                                                                                                                                                                                                                                                                                            Alerts

                                                                                                                                                                                                                                                                                                                                                            Alert scopes are defined during the creation process, and specify what areas within the infrastructure the alert is applicable for. In the example alerts below, the first alert has a scope defined, whereas the second alert does not have a custom scope defined. If no scope is defined, the alert is applicable to the entire infrastructure.

                                                                                                                                                                                                                                                                                                                                                            For more information on Alerts, refer to the Alerts documentation.

                                                                                                                                                                                                                                                                                                                                                            Teams

                                                                                                                                                                                                                                                                                                                                                            A team’s scope determines the highest level of data that team members have visibility for:

                                                                                                                                                                                                                                                                                                                                                            • If a team’s scope is set to Host, team members can see all host-level and container-level information.

                                                                                                                                                                                                                                                                                                                                                            • If a team’s scope is set to Container, team members can only see container-level information.

                                                                                                                                                                                                                                                                                                                                                            A team’s scope only applies to that team. Users that are members of multiple teams may have different visibility depending on which team is active.

                                                                                                                                                                                                                                                                                                                                                            For more information on teams and configuring team scope, refer to the Manage Teams and Roles documentation.

                                                                                                                                                                                                                                                                                                                                                            Segments

                                                                                                                                                                                                                                                                                                                                                            Aggregated data can be split into smaller sections by segmenting the data with labels. This allows for the creation of multi-series comparisons and multiple alerts. In the first image, the metric is not segmented:

                                                                                                                                                                                                                                                                                                                                                            In the second image, the same metric has been segmented by container.id:

                                                                                                                                                                                                                                                                                                                                                            Line and Area panels can display up to five different segments for any given metric. The example image below displays the net.byte.in metric segmented by both container.id and net.http.url:

                                                                                                                                                                                                                                                                                                                                                            For more information regarding segmentation in dashboard panels, refer to the Configure Panels documentation. For more information regarding configuring alerts, refer to the Alerts documentation.

                                                                                                                                                                                                                                                                                                                                                            The Meaning of n/a

                                                                                                                                                                                                                                                                                                                                                            Sysdig Monitor imports data related to entities such as hosts, containers, processes, and so on, and reports them in tables or panels on the Explore and Dashboards UI, as well as in events, so across the UI you see varieties of data. The term n/a can appear anywhere on the UI where some form of data is displayed.

                                                                                                                                                                                                                                                                                                                                                            n/a is a term that indicates data that is not available or that it does not apply to a particular instance. In Sysdig parlance, the term signifies one or more entities defined by a particular label, such as hostname or Kubernetes service, for which the label is invalid. In other words, n/a collectively represent entities whose metadata is not relevant to aggregation and filtering techniques—Grouping, Scoping, and Segmenting. For instance, a list of Kubernetes services might display the list of all the services as well as n/a that includes all the containers without the metadata describing a Kubernetes service.

                                                                                                                                                                                                                                                                                                                                                            You might encounter n/a sporadically in Explore UI as well as in drill-down panels or dashboards, events, and likely elsewhere on the Sysdig Monitor UI when no relevant metadata is available for that particular display. How n/a should be treated depends on the nature of your deployment. The deployment will not be affected by the entities marked n/a.

                                                                                                                                                                                                                                                                                                                                                            The following are some of the cases that yield n/a on the UI:

                                                                                                                                                                                                                                                                                                                                                            • Labels are partially available or not available. For example, a host has entities that are not associated with a monitored Kubernetes deployment, or a monitored host has an unmonitored Kubernetes deployment running.

                                                                                                                                                                                                                                                                                                                                                            • Labels that do not apply to the grouping criteria or at the hierarchy level. For example:

                                                                                                                                                                                                                                                                                                                                                              • Containers that are not managed by Kubernetes. The containers managed by Kubernetes are identified with their  container.name labels.

                                                                                                                                                                                                                                                                                                                                                              • In certain groupings by DaemonSet, Deployments render N/A and vice versa. Not all containers belong to both DaemonSet and Deployment objects concurrently. Likewise, a Kubernetes ReplicaSet grouping with the  kubernetes.replicaset.name label will not show StatefulSets.

                                                                                                                                                                                                                                                                                                                                                              • In a kubernetes.cluster.name > kubernetes.namespace.name > kubernetes.deployment.name  grouping, the entities without the kubernetes.cluster.name label yield n/a.

                                                                                                                                                                                                                                                                                                                                                            • Entities are incorrectly labeled in the infrastructure.

                                                                                                                                                                                                                                                                                                                                                            • Kubernetes features that are yet to be in sync with Sysdig Monitoring.

                                                                                                                                                                                                                                                                                                                                                            • The format is not applicable to a particular record in the database.

                                                                                                                                                                                                                                                                                                                                                            4.2 -

                                                                                                                                                                                                                                                                                                                                                            Understanding Default, Custom, and Missing Metrics

                                                                                                                                                                                                                                                                                                                                                            Default Metrics

                                                                                                                                                                                                                                                                                                                                                            Default metrics include various kinds of metadata which Sysdig Monitor automatically knows how to label, segment, and display.

                                                                                                                                                                                                                                                                                                                                                            For example:

                                                                                                                                                                                                                                                                                                                                                            • System metrics for hosts, containers, and processes (CPU used, etc.)

                                                                                                                                                                                                                                                                                                                                                            • Orchestrator metrics (collected from Kubernetes, Mesos, etc.)

                                                                                                                                                                                                                                                                                                                                                            • Network metrics (e.g. network traffic)

                                                                                                                                                                                                                                                                                                                                                            • HTTP

                                                                                                                                                                                                                                                                                                                                                            • Platform metrics (in some cases)

                                                                                                                                                                                                                                                                                                                                                            Default metrics are collected mainly from two sources: syscalls and Kubernetes.

                                                                                                                                                                                                                                                                                                                                                            Custom Metrics

                                                                                                                                                                                                                                                                                                                                                            About Custom Metrics

                                                                                                                                                                                                                                                                                                                                                            Custom metrics generally refer to any metrics that the Sysdig Agent collects from some third-party integration. The type of infrastructure and applications integrated determine the custom metrics that the Agent collects and reports to Sysdig Monitor. The supported custom metrics are:

                                                                                                                                                                                                                                                                                                                                                            Each metric comes with a set of custom labels, and additional labels can be user-created. Sysdig Monitor simply collects and reports them with minimal or no internal processing. The limit currently enforced is 3000 metrics per host. Use the metrics_filter option in the dragent.yaml file to remove unwanted metrics or to choose the metrics to report when hosts exceed this limit. For more information on editing the dragent.yaml file, see Understanding the Agent Config Files.

                                                                                                                                                                                                                                                                                                                                                            Unit for Custom Metrics

                                                                                                                                                                                                                                                                                                                                                            Sysdig Monitor detects the default unit of custom metrics automatically with the delimiter suffix in the metrics name. For example, custom_expvar_time_seconds results in a base unit set to seconds. The supported base units are byte, percent, and time. Custom metrics name should carry one of the following delimiter suffixes in order for Sysdig Monitor to identify and configure the accurate unit type.

                                                                                                                                                                                                                                                                                                                                                            • second

                                                                                                                                                                                                                                                                                                                                                            • seconds

                                                                                                                                                                                                                                                                                                                                                            • byte

                                                                                                                                                                                                                                                                                                                                                            • bytes

                                                                                                                                                                                                                                                                                                                                                            • total (represents accumulating count)

                                                                                                                                                                                                                                                                                                                                                            • percent

                                                                                                                                                                                                                                                                                                                                                            Custom metrics will not be auto-detected and the unit will be incorrect unless this naming convention is followed. For instance, custom_byte_expvar will not yield the correct unit, that is MiB.

                                                                                                                                                                                                                                                                                                                                                            Editing the Unit Scale

                                                                                                                                                                                                                                                                                                                                                            You have the flexibility to change the unit scale either by editing the panel on the Dashboard or in the Explore.

                                                                                                                                                                                                                                                                                                                                                            Explore

                                                                                                                                                                                                                                                                                                                                                            From the Search Metrics and Dashboard drop-down, select the custom metrics you want to edit the unit selection for, then click More Options. Select the desired unit scale from the Metric Format drop-down and click Save.

                                                                                                                                                                                                                                                                                                                                                            Dashboard

                                                                                                                                                                                                                                                                                                                                                            Select the Dashboard Panel associated with the custom metrics you want to modify. Select the desired unit scale from the Metrics drop-down and click Save.

                                                                                                                                                                                                                                                                                                                                                            Display Missing Data

                                                                                                                                                                                                                                                                                                                                                            Data can be missing for a few different reasons:

                                                                                                                                                                                                                                                                                                                                                            • Problems such as faulty network connectivity in the communication channel between your infrastructure and Sysdig metrics store.

                                                                                                                                                                                                                                                                                                                                                            • Metrics or StatsD batch jobs are submitted sporadically.

                                                                                                                                                                                                                                                                                                                                                            Sysdig Monitor allows you to configure the behavior of missing data in Dashboards. Though metric type determines the default behavior, you can configure how to visualize missing data and define it at the per-query level. Use the No Data Display drop-down in the Options menu in the panel configuration. See Create a New Panel for more information.

                                                                                                                                                                                                                                                                                                                                                            Consider the following guidelines:

                                                                                                                                                                                                                                                                                                                                                            • The No Data Display drop-down has only two options for the Stacked Area timechart: gap and show as zero.

                                                                                                                                                                                                                                                                                                                                                            • For the Number panel, the No Data Display option allows entering a custom no data text.

                                                                                                                                                                                                                                                                                                                                                            • For form-based timechart panels, the default option for a metrics selection that does not contain a StatsD metric is gap.

                                                                                                                                                                                                                                                                                                                                                            • Adding a StatsD metric to a query in a form-based timechart panel will default the selected No Data Display type to the show as zero , which is the default option for form-based StatsD metrics. You can change this selection to any other type.

                                                                                                                                                                                                                                                                                                                                                            • The default display option is gap for PromQL Timechart panels.

                                                                                                                                                                                                                                                                                                                                                            The options for No Data Display are:

                                                                                                                                                                                                                                                                                                                                                            • gap: The default option for form-based timechart panel, where a query metrics selection does not contain a StatsD metric. gap is the best visualization type for most use cases because it is easy to spot indicating a problem.

                                                                                                                                                                                                                                                                                                                                                            • show as zero: The best option for StatsD metrics which are only submitted sporadically. For example, batch jobs and count of errors. This is the default display option for StatsD metrics in form-based panels.

                                                                                                                                                                                                                                                                                                                                                              We do not recommend this option as setting zero could be misleading. For example, this setting will report the value for free disk space as 0% when the disk or host disappears, but in reality, the value is unknown.

                                                                                                                                                                                                                                                                                                                                                              Prometheus best practices recommend avoiding missing metrics.

                                                                                                                                                                                                                                                                                                                                                            • connect - solid: Use for measuring the value of a metric, typically a gauge, where you want to visualize the missing samples flattened.

                                                                                                                                                                                                                                                                                                                                                              The leftmost and rightmost visible data points can be connected as Sysdig does not perform the interpolation.

                                                                                                                                                                                                                                                                                                                                                            • connect - dotted: Use it for measuring the value of a metric, typically a gauge, where you want to visualize the missing samples flattened.

                                                                                                                                                                                                                                                                                                                                                              The leftmost and rightmost visible data points can be connected as Sysdig does not perform the interpolation.

                                                                                                                                                                                                                                                                                                                                                            4.3 -

                                                                                                                                                                                                                                                                                                                                                            Prometheus Metrics Types

                                                                                                                                                                                                                                                                                                                                                            Sysdig Monitor transforms Prometheus metrics into usable, actionable entries in two ways:

                                                                                                                                                                                                                                                                                                                                                            Calculated Metrics

                                                                                                                                                                                                                                                                                                                                                            The Prometheus metrics that are scraped by the Sysdig agent and transformed into the traditional StatsD model are called calculated metrics. In calculated metrics, the delta is stored with the previous value. This delta is what Sysdig uses on the classic backend for metrics analyzing and visualization. While generating the calculated metrics, the gauge metrics are kept as they are, but the counter metrics are transformed.

                                                                                                                                                                                                                                                                                                                                                            Prometheus calculated metrics cannot be used in PromQL.

                                                                                                                                                                                                                                                                                                                                                            The Histogram and Summary metrics are transformed into a different format called Prometheus histogram and summary metrics respectively. The transformations include:

                                                                                                                                                                                                                                                                                                                                                            • All of the quantiles are transformed into a different metric, with the quantile added as a suffix.

                                                                                                                                                                                                                                                                                                                                                            • The count and sum of these summary metrics are exposed as different metrics with names slightly changed. _ (underscore) in the name is replaced with a period .. For more information, see Mapping Between Classic Metrics and PromQL Metrics.

                                                                                                                                                                                                                                                                                                                                                            Prometheus calculated metrics (legacy metrics) are scheduled to be deprecated in the coming months.

                                                                                                                                                                                                                                                                                                                                                            Raw Metrics

                                                                                                                                                                                                                                                                                                                                                            In Sysdig parlance, the Prometheus metrics that are scraped (by the Sysdig agent), collected, sent, stored, visualized, and presented exactly as Prometheus exposes them are called raw metrics. Raw metrics are used with PromQL.

                                                                                                                                                                                                                                                                                                                                                            Sysdig counter is a StatsD type counter, where the difference in value is kept, but not the raw value of the counter, whereas Prometheus raw metrics are counters that are always monotonically increasing. A rate function needs to be applied on Prometheus raw metrics to make sense of it.

                                                                                                                                                                                                                                                                                                                                                            Time Aggregations Over Prometheus Metrics

                                                                                                                                                                                                                                                                                                                                                            The following time aggregations are supported for both the metric types:

                                                                                                                                                                                                                                                                                                                                                            • Average: Returns an average of a set of data points, keeping all the labels.

                                                                                                                                                                                                                                                                                                                                                            • Maximum and Minimum: Returns a maximal or minimal value, keeping all the labels.

                                                                                                                                                                                                                                                                                                                                                            • Sum: Returns a sum of the values of data points, keeping all the labels.

                                                                                                                                                                                                                                                                                                                                                            • Rate (timeAvg): Returns a sum of changes to the counter across data points in a given time period and divides by time, keeping all the labels as they are. For Prometheus raw metrics, timeAvg is calculated by taking the difference and dividing it by time.

                                                                                                                                                                                                                                                                                                                                                            Prometheus Calculated Metrics

                                                                                                                                                                                                                                                                                                                                                            Prometheus calculated metrics are treated as gauges by Sysdig, and there the following time aggregations are available:

                                                                                                                                                                                                                                                                                                                                                            • Average

                                                                                                                                                                                                                                                                                                                                                            • Sum

                                                                                                                                                                                                                                                                                                                                                            • Minimum

                                                                                                                                                                                                                                                                                                                                                            • Maximum

                                                                                                                                                                                                                                                                                                                                                            Rate (timeAvg) is not available because they are not applicable to gauge metrics.

                                                                                                                                                                                                                                                                                                                                                            Prometheus Raw Metrics

                                                                                                                                                                                                                                                                                                                                                            For the gauge type, the following types are available:

                                                                                                                                                                                                                                                                                                                                                            • Average

                                                                                                                                                                                                                                                                                                                                                            • Minimum

                                                                                                                                                                                                                                                                                                                                                            • Maximum

                                                                                                                                                                                                                                                                                                                                                            For the counter type, the following types are available:

                                                                                                                                                                                                                                                                                                                                                            • Rate: Calculates the first derivative of the counter (change over time).

                                                                                                                                                                                                                                                                                                                                                            • Sum: Calculates a complete change of the counter over a period of time.

                                                                                                                                                                                                                                                                                                                                                            4.4 -

                                                                                                                                                                                                                                                                                                                                                            Heuristic and Deprecated Metrics

                                                                                                                                                                                                                                                                                                                                                            Heuristic Metrics

                                                                                                                                                                                                                                                                                                                                                            Various network-related metrics reported by Sysdig, including response times, are calculated at the kernel level by measuring latency between systems calls. In an effort to ensure Sysdig remains the trusted source of infrastructure insights, moving forward we will be labeling some network related metrics as heuristic and are tagging with the symbol in the application.

                                                                                                                                                                                                                                                                                                                                                            Existing alerts using these metrics will not be modified or disabled. However, these alerts will not be able to be updated.

                                                                                                                                                                                                                                                                                                                                                            Additional heuristic metric details are listed below:

                                                                                                                                                                                                                                                                                                                                                            MetricSet New Alerts
                                                                                                                                                                                                                                                                                                                                                            net.http.request.timeYes
                                                                                                                                                                                                                                                                                                                                                            net.http.request.countYes
                                                                                                                                                                                                                                                                                                                                                            net.http.error.countYes
                                                                                                                                                                                                                                                                                                                                                            net.sql.request.timeYes
                                                                                                                                                                                                                                                                                                                                                            net.sql.request.countYes
                                                                                                                                                                                                                                                                                                                                                            net.sql.error.countYes
                                                                                                                                                                                                                                                                                                                                                            net.mongodb.request.timeYes
                                                                                                                                                                                                                                                                                                                                                            net.mongodb.request.countYes
                                                                                                                                                                                                                                                                                                                                                            net.mongodb.error.countYes
                                                                                                                                                                                                                                                                                                                                                            net.request.time.file.percentYes
                                                                                                                                                                                                                                                                                                                                                            net.request.time.local.percentYes
                                                                                                                                                                                                                                                                                                                                                            net.request.time.net.percentYes
                                                                                                                                                                                                                                                                                                                                                            net.request.time.nextTiers.percentYes
                                                                                                                                                                                                                                                                                                                                                            net.request.time.processing.percentYes
                                                                                                                                                                                                                                                                                                                                                            net.request.timeNo
                                                                                                                                                                                                                                                                                                                                                            net.request.time.inNo
                                                                                                                                                                                                                                                                                                                                                            net.request.time.outNo
                                                                                                                                                                                                                                                                                                                                                            net.request.time.worst.inNo
                                                                                                                                                                                                                                                                                                                                                            net.request.time.worst.outNo
                                                                                                                                                                                                                                                                                                                                                            net.request.countNo
                                                                                                                                                                                                                                                                                                                                                            net.request.count.inNo

                                                                                                                                                                                                                                                                                                                                                            Deprecated Metrics:

                                                                                                                                                                                                                                                                                                                                                            Based on low usage patterns, Sysdig has decided to deprecate the following metrics on August 1, 2018. Users will continue to have the ability to collect similar data using Prometheus, or another method of code instrumentation (i.e. StatsD or JMX for Java applications).

                                                                                                                                                                                                                                                                                                                                                            The table below shows the current metrics and options for similar functionality.

                                                                                                                                                                                                                                                                                                                                                            Current MetricAlternative Starting August 1, 2018
                                                                                                                                                                                                                                                                                                                                                            capacity.estimated.request.stolen.countCreate your application metrics using Prometheus, StatsD or JMX for Java applications.
                                                                                                                                                                                                                                                                                                                                                            capacity.estimated.request.total.count
                                                                                                                                                                                                                                                                                                                                                            capacity.stolen.percent
                                                                                                                                                                                                                                                                                                                                                            capacity.total.percent
                                                                                                                                                                                                                                                                                                                                                            capacity.used.percent
                                                                                                                                                                                                                                                                                                                                                            net.request.time.file
                                                                                                                                                                                                                                                                                                                                                            net.request.time.local
                                                                                                                                                                                                                                                                                                                                                            net.request.time.net
                                                                                                                                                                                                                                                                                                                                                            net.request.time.nextTiers
                                                                                                                                                                                                                                                                                                                                                            net.request.time.processing
                                                                                                                                                                                                                                                                                                                                                            net.sql.request.time.worstMax aggregation (net.sql.request.time)
                                                                                                                                                                                                                                                                                                                                                            net.mongodb.request.time.worstMax aggregation (net.mongodb.request.time)
                                                                                                                                                                                                                                                                                                                                                            net.http.request.time.worstMax aggregation (net.http.request.time)

                                                                                                                                                                                                                                                                                                                                                            4.5 -

                                                                                                                                                                                                                                                                                                                                                            Manage Metric Scale

                                                                                                                                                                                                                                                                                                                                                            Sysdig provides several knobs for managing metric scale.

                                                                                                                                                                                                                                                                                                                                                            There are three primary ways in which you could include/exclude metrics, should you encounter unwanted metrics limits.

                                                                                                                                                                                                                                                                                                                                                            1. Include/exclude custom metrics by name filters.

                                                                                                                                                                                                                                                                                                                                                              See Include/Exclude Custom Metrics.

                                                                                                                                                                                                                                                                                                                                                            2. Include/exclude metrics emitted by certain containers, Kubernetes annotations, or any other container label at collection time.

                                                                                                                                                                                                                                                                                                                                                              See Prioritize/Include/Exclude Designated Containers.

                                                                                                                                                                                                                                                                                                                                                            3. Exclude metrics from unwanted ports.

                                                                                                                                                                                                                                                                                                                                                              See Blacklist Ports.

                                                                                                                                                                                                                                                                                                                                                            4.6 -

                                                                                                                                                                                                                                                                                                                                                            Data Aggregation

                                                                                                                                                                                                                                                                                                                                                            Sysdig Monitor allows users to adjust the aggregation settings when graphing or creating alerts for a metric, informing how Sysdig rolls up the available data samples in order to create the chart or evaluate the alert. There are two forms of aggregation used for metrics in Sysdig: time aggregation and group aggregation.

                                                                                                                                                                                                                                                                                                                                                            Time aggregation is always performed before group aggregation.

                                                                                                                                                                                                                                                                                                                                                            Time Aggregation

                                                                                                                                                                                                                                                                                                                                                            Time aggregation comes into effect in two overlapping situations:

                                                                                                                                                                                                                                                                                                                                                            • Charts can only render a limited number of data points. To look at a wide range of data, Sysdig Monitor may need to aggregate granular data into larger samples for visualization.

                                                                                                                                                                                                                                                                                                                                                            • Sysdig Monitor rolls up historical data over time.

                                                                                                                                                                                                                                                                                                                                                              Sysdig retains rollups based on each aggregation type, to allow users to choose which data points to utilize when evaluating older data.

                                                                                                                                                                                                                                                                                                                                                            Sysdig agents collect 1-second samples and report data at 10-second resolution. The data is stored and reported every 10-second with the available aggregations (average, rate, min, max, sum) to make them available via the Sysdig Monitor UI and the API. For time series charts covering five minutes or less, data points are drawn at this 10-second resolution, and any time aggregation selections will have no effect. When an amount of time greater than five minutes is displayed, data points are drawn as an aggregate for an appropriate time interval. For example, for a chart covering one hour, each data point would reflect a one minute interval.

                                                                                                                                                                                                                                                                                                                                                            At time intervals of one minute and above, charts can be configured to display different aggregates for the 10-second metrics used to calculate each datapoint.

                                                                                                                                                                                                                                                                                                                                                            Aggregation TypeDescription
                                                                                                                                                                                                                                                                                                                                                            averageThe average of the retrieved metric values across the time period.
                                                                                                                                                                                                                                                                                                                                                            rateThe average value of the metric across the time period evaluated.
                                                                                                                                                                                                                                                                                                                                                            maximumThe highest value during the time period evaluated.
                                                                                                                                                                                                                                                                                                                                                            minimumThe lowest value during the time period evaluated.
                                                                                                                                                                                                                                                                                                                                                            sumThe combined sum of the metric across the time period evaluated.

                                                                                                                                                                                                                                                                                                                                                            In the example images below, the kubernetes.deployment.replicas.available metrics first uses the average for time aggregation:

                                                                                                                                                                                                                                                                                                                                                            Then uses the sum for time aggregation:

                                                                                                                                                                                                                                                                                                                                                            • Rate and average are very similar and often provide the same result. However, the calculation of each is different.

                                                                                                                                                                                                                                                                                                                                                              • If time aggregation is set to one minute, the agent is supposed to retrieve six samples (one every 10 seconds).

                                                                                                                                                                                                                                                                                                                                                              • In some cases, samples may not be there, due to disconnections or other circumstances. For this example, four samples are available. If this was the case, the average would be calculated by dividing by four, while the rate would be calculated by dividing by six.

                                                                                                                                                                                                                                                                                                                                                            • Most metrics are sampled once for each time interval, resulting in average and rate returning the same value. However, there will be a distinction for any metrics not reported at every time interval. For example, some custom statsd metrics.

                                                                                                                                                                                                                                                                                                                                                            • Rate is currently referred to as timeAvg in the Sysdig Monitor API and advanced alerting language.

                                                                                                                                                                                                                                                                                                                                                            • By default, average is used when displaying data points for a time interval.

                                                                                                                                                                                                                                                                                                                                                            Group Aggregation

                                                                                                                                                                                                                                                                                                                                                            Metrics applied to a group of items (for example, several containers, hosts, or nodes) are averaged between the members of the group by default. For example, three hosts report different CPU usage for one sample interval. The three values will be averaged, and reported on the chart as a single datapoint for that metric.

                                                                                                                                                                                                                                                                                                                                                            There are several different types of group aggregation:

                                                                                                                                                                                                                                                                                                                                                            Aggregation TypeDescription
                                                                                                                                                                                                                                                                                                                                                            averageThe average value of the interval’s samples.
                                                                                                                                                                                                                                                                                                                                                            maximumThe maximum value of the interval’s samples.
                                                                                                                                                                                                                                                                                                                                                            minimumThe minimum value of the interval’s samples.
                                                                                                                                                                                                                                                                                                                                                            sumThe combined value of all of the interval’s samples.

                                                                                                                                                                                                                                                                                                                                                            If a chart or alert is segmented, the group aggregation settings will be utilized for both aggregations across the whole group, and aggregation within each individual segmentation.

                                                                                                                                                                                                                                                                                                                                                            For example, the image below shows a chart for CPU% across the infrastructure:

                                                                                                                                                                                                                                                                                                                                                            When segmented by proc.name, the chart shows one CPU% line for each process:

                                                                                                                                                                                                                                                                                                                                                            Each line provides the average value for every process with the same name. To see the difference, change the group aggregation type to sum:

                                                                                                                                                                                                                                                                                                                                                            The metric aggregation value showed beside the metric name is for the time aggregation. While the screenshot shows AVG, the group aggregation is set to SUM.

                                                                                                                                                                                                                                                                                                                                                            Aggregation Examples

                                                                                                                                                                                                                                                                                                                                                            The tables below provide an example of how each type of aggregation works. The first table provides the metric data, while the second displays the resulting value for each type of aggregation.

                                                                                                                                                                                                                                                                                                                                                            In the example below, the CPU% metric is applied to a group of servers called webserver. The first chart shows metrics using average aggregation for both time and group. The second chart shows the metrics using maximum aggregation for both time and group.

                                                                                                                                                                                                                                                                                                                                                            For each one minute interval, the second chart renders the highest CPU usage value found from the servers in the webserver group and from all of the samples reported during the one minute interval. This view can be useful when searching for transient spikes in metrics over long periods of time, that would otherwise be missed with average aggregation.

                                                                                                                                                                                                                                                                                                                                                            The group aggregation type is dependent on the segmentation. For a view showing metrics for a group of items, the current group aggregation setting will revert to the default setting, if the Segment By selection is changed.

                                                                                                                                                                                                                                                                                                                                                            4.7 -

                                                                                                                                                                                                                                                                                                                                                            Metric Limits

                                                                                                                                                                                                                                                                                                                                                            Sysdig ensures that you see the most relevant metric information relevant to your monitored environment. To achieve this, limits are enforced on the number of metrics that the datastore can store. Different limits apply to different metric types and agent versions.

                                                                                                                                                                                                                                                                                                                                                            Enterprise

                                                                                                                                                                                                                                                                                                                                                            The metric limits are automatically set by the Sysdig backend components based on your plan, agent version, and backend configuration. The default limits are provided below:

                                                                                                                                                                                                                                                                                                                                                            Metrics TypesMetrics LimitDescription
                                                                                                                                                                                                                                                                                                                                                            Prometheus8000Set other custom metric limits to zero to increase the Prometheus metrics limit to 10,000.
                                                                                                                                                                                                                                                                                                                                                            StatsD1000
                                                                                                                                                                                                                                                                                                                                                            JMX500
                                                                                                                                                                                                                                                                                                                                                            AppChecks500
                                                                                                                                                                                                                                                                                                                                                            Total10,000The total number of custom metrics across all metric types should not exceed 10,000.

                                                                                                                                                                                                                                                                                                                                                            The custom metrics limit of 10,000 does not include the agent metrics that are provided out-of-the-box, such as host, container, and Kube State Metrics.

                                                                                                                                                                                                                                                                                                                                                            View Metric Limits

                                                                                                                                                                                                                                                                                                                                                            Use the Sysdig Agent Health & Status dashboard under Host Infrastructure templates to view current usage per host for each metric type.

                                                                                                                                                                                                                                                                                                                                                            The metric limits are exposed to the UI through the following agent metrics.

                                                                                                                                                                                                                                                                                                                                                            MetricsDescription
                                                                                                                                                                                                                                                                                                                                                            dragent.metricCount.limit.appCheckThe maximum number of unique appCheck timeseries that are allowed in an individual sample from the agent per node.
                                                                                                                                                                                                                                                                                                                                                            dragent.metricCount.limit.statsdThe maximum number of unique statsd timeseries that are allowed in an individual sample from the agent per node.
                                                                                                                                                                                                                                                                                                                                                            dragent.metricCount.limit.jmxThe maximum number of unique JMX timeseries that are allowed in an individual sample from the agent per node.
                                                                                                                                                                                                                                                                                                                                                            dragent.metricCount.limit.prometheusThe maximum number of unique Prometheus timeseries that are allowed in an individual sample from the agent per node.

                                                                                                                                                                                                                                                                                                                                                            Learn More

                                                                                                                                                                                                                                                                                                                                                            5 -

                                                                                                                                                                                                                                                                                                                                                            Metrics Dictionary

                                                                                                                                                                                                                                                                                                                                                            The Sysdig metrics dictionary lists all current default metrics supported by the Sysdig product suite, as well as kube state and cloud provider metrics. The Metrics Dictionary is a living document and is updated as new metrics are added to the product.

                                                                                                                                                                                                                                                                                                                                                            Overview

                                                                                                                                                                                                                                                                                                                                                            Each metric in the dictionary has several pieces of metadata listed to provide greater context for how the metric can be used within Sysdig products. An example layout is displayed below:

                                                                                                                                                                                                                                                                                                                                                            Metric Name

                                                                                                                                                                                                                                                                                                                                                            Metric definition. For some metrics, the equation for how the value is determined is provided.

                                                                                                                                                                                                                                                                                                                                                            Metadata

                                                                                                                                                                                                                                                                                                                                                            Definition

                                                                                                                                                                                                                                                                                                                                                            Metric Type

                                                                                                                                                                                                                                                                                                                                                            Metric type determines whether the metric value is a counter metric or a gauge metric. Sysdig Monitor offers two Metric types:

                                                                                                                                                                                                                                                                                                                                                            Counter: The metric whose value keeps on increasing and is reliant on previous values. It helps you record how many times something has happened, for example, a user login.

                                                                                                                                                                                                                                                                                                                                                            Gauge: Represents a single numerical value that can arbitrarily fluctuate over time. Each value returns an instantaneous measurement, for example, CPU usage.

                                                                                                                                                                                                                                                                                                                                                            Value Type

                                                                                                                                                                                                                                                                                                                                                            The type of value the metric can have. The possible values are:

                                                                                                                                                                                                                                                                                                                                                            • Percent (%)

                                                                                                                                                                                                                                                                                                                                                            • Byte

                                                                                                                                                                                                                                                                                                                                                            • Date

                                                                                                                                                                                                                                                                                                                                                            • Double

                                                                                                                                                                                                                                                                                                                                                            • Integer (int)

                                                                                                                                                                                                                                                                                                                                                            • relativeTime

                                                                                                                                                                                                                                                                                                                                                            • String

                                                                                                                                                                                                                                                                                                                                                            Segment By

                                                                                                                                                                                                                                                                                                                                                            The levels within the infrastructure that the metric can be segmented at:

                                                                                                                                                                                                                                                                                                                                                            • Host

                                                                                                                                                                                                                                                                                                                                                            • Container

                                                                                                                                                                                                                                                                                                                                                            • Process

                                                                                                                                                                                                                                                                                                                                                            • Kubernetes

                                                                                                                                                                                                                                                                                                                                                            • Mesos

                                                                                                                                                                                                                                                                                                                                                            • Swarm

                                                                                                                                                                                                                                                                                                                                                            • CloudProvider

                                                                                                                                                                                                                                                                                                                                                            Default Time Aggregation

                                                                                                                                                                                                                                                                                                                                                            The default time aggregation format for the metric.

                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation Formats

                                                                                                                                                                                                                                                                                                                                                            The time aggregation formats the metric can be aggregated by:

                                                                                                                                                                                                                                                                                                                                                            • Average (Avg)

                                                                                                                                                                                                                                                                                                                                                            • Rate

                                                                                                                                                                                                                                                                                                                                                            • Sum

                                                                                                                                                                                                                                                                                                                                                            • Minimum (Min)

                                                                                                                                                                                                                                                                                                                                                            • Maximum (Max)

                                                                                                                                                                                                                                                                                                                                                            Default Group Aggregation

                                                                                                                                                                                                                                                                                                                                                            The default group aggregation format for the metric.

                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation Formats

                                                                                                                                                                                                                                                                                                                                                            The group aggregation formats the metric can be aggregated by:

                                                                                                                                                                                                                                                                                                                                                            • Average (Avg)

                                                                                                                                                                                                                                                                                                                                                            • Sum

                                                                                                                                                                                                                                                                                                                                                            • Minimum (Min)

                                                                                                                                                                                                                                                                                                                                                            • Maximum (Max)

                                                                                                                                                                                                                                                                                                                                                            5.1 -

                                                                                                                                                                                                                                                                                                                                                            Agent

                                                                                                                                                                                                                                                                                                                                                            dragent.analyzer

                                                                                                                                                                                                                                                                                                                                                            dragent is the main process in the agent that collects and collates data from multiple sources, including syscall events from the kernel in order to generate metrics. The analyzer module that runs in the dragent process does much of the work involved in generating metrics. These internal metrics are used to troubleshoot the health of the analyzer component.

                                                                                                                                                                                                                                                                                                                                                            Sysdig Monitor provides the following analyzer metrics:

                                                                                                                                                                                                                                                                                                                                                            MetricsTypeMinimum Agent VersionDescription
                                                                                                                                                                                                                                                                                                                                                            dragent.analyzer.processesgauge0.80.0 or aboveThe number of processes found by the analyzer.
                                                                                                                                                                                                                                                                                                                                                            dragent.analyzer.threadsThe number of threads found by the analyzer.
                                                                                                                                                                                                                                                                                                                                                            dragent.analyzer.threads.droppedcounterThe number of threads not reported due to thread limits.
                                                                                                                                                                                                                                                                                                                                                            dragent.analyzer.containersgaugeThe number of containers found by the analyzer.
                                                                                                                                                                                                                                                                                                                                                            dragent.analyzer.javaprocsThe number of java processes found by the analyzer.
                                                                                                                                                                                                                                                                                                                                                            dragent.analyzer.appchecksThe number of application checks reporting to the analyzer.
                                                                                                                                                                                                                                                                                                                                                            dragent.analyzer.mesos.autodetectIf the agent is configured to autodetect a Mesos environment, value is 1, otherwise is 0.
                                                                                                                                                                                                                                                                                                                                                            dragent.analyzer.mesos.detectedIf the agent actually found a Mesos environment, value is 1, otherwise, value is 0
                                                                                                                                                                                                                                                                                                                                                            dragent.analyzer.fp.pct100The analyzer flush CPU % (0-100)
                                                                                                                                                                                                                                                                                                                                                            dragent.analyzer.fl.msThe analyzer flush duration (milliseconds)
                                                                                                                                                                                                                                                                                                                                                            dragent.analyzer.srThe current sampling ratio (1=all events, 2= half of events analyzed, 4=one fourth of events analyzed, and so on.
                                                                                                                                                                                                                                                                                                                                                            dragent.analyzer.n_evtsThe number of events processed
                                                                                                                                                                                                                                                                                                                                                            dragent.analyzer.n_dropsThe number of events dropped
                                                                                                                                                                                                                                                                                                                                                            dragent.analyzer.n_drops_bufferThe number of events dropped due to the buffer being full.
                                                                                                                                                                                                                                                                                                                                                            dragent.analyzer.n_preemptionsThe number of driver preemptions.
                                                                                                                                                                                                                                                                                                                                                            dragent.analyzer.n_command_linesThe number of command lines collected and sent to the collector.
                                                                                                                                                                                                                                                                                                                                                            dragent.analyzer.command_line_cats.n_none
                                                                                                                                                                                                                                                                                                                                                            dragent.analyzer.n_container_healthcheck_command_lines0.80.1 or aboveThe number of command lines identified as container health checks. This metric does not change even if healthcheck command lines are not sent to the collector.

                                                                                                                                                                                                                                                                                                                                                            5.2 -

                                                                                                                                                                                                                                                                                                                                                            Metrics and Label Mapping

                                                                                                                                                                                                                                                                                                                                                            This topic outlines the metrics and label mapping between the Sysdig classic store and the new Sysdig datastore.

                                                                                                                                                                                                                                                                                                                                                            5.2.1 -

                                                                                                                                                                                                                                                                                                                                                            Mapping Between Classic Metrics and PromQL Metrics

                                                                                                                                                                                                                                                                                                                                                            Starting SaaS v 3.2.6, Sysdig classic metrics and labels have been renamed to be aligned with Prometheus naming convention. For example, Sysdig classic metrics have a dot-oriented hierarchy, whereas Prometheus has label-based metric organization. The table below helps you identify the Prometheus metrics and labels and the corresponding ones in the Sysdig classic system.

                                                                                                                                                                                                                                                                                                                                                            Entity

                                                                                                                                                                                                                                                                                                                                                            Type

                                                                                                                                                                                                                                                                                                                                                            PromQL Metric Name

                                                                                                                                                                                                                                                                                                                                                            Classic Metric Name

                                                                                                                                                                                                                                                                                                                                                            Label

                                                                                                                                                                                                                                                                                                                                                            Classic Label

                                                                                                                                                                                                                                                                                                                                                            host

                                                                                                                                                                                                                                                                                                                                                            info

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_info

                                                                                                                                                                                                                                                                                                                                                            Not exposed

                                                                                                                                                                                                                                                                                                                                                            • host_mac

                                                                                                                                                                                                                                                                                                                                                            • host

                                                                                                                                                                                                                                                                                                                                                            • instance_id

                                                                                                                                                                                                                                                                                                                                                            • agent_tag_{*}

                                                                                                                                                                                                                                                                                                                                                            • host.mac

                                                                                                                                                                                                                                                                                                                                                            • host.hostName

                                                                                                                                                                                                                                                                                                                                                            • host.instanceId

                                                                                                                                                                                                                                                                                                                                                            • agent.tag.{*}

                                                                                                                                                                                                                                                                                                                                                            sysdig_cloud_provider_info

                                                                                                                                                                                                                                                                                                                                                            • host_mac

                                                                                                                                                                                                                                                                                                                                                            • provider_id

                                                                                                                                                                                                                                                                                                                                                            • account_id

                                                                                                                                                                                                                                                                                                                                                            • region

                                                                                                                                                                                                                                                                                                                                                            • availability_zone

                                                                                                                                                                                                                                                                                                                                                            • instance_type

                                                                                                                                                                                                                                                                                                                                                            • tag_{*}

                                                                                                                                                                                                                                                                                                                                                            • security_groups

                                                                                                                                                                                                                                                                                                                                                            • host_ip_public

                                                                                                                                                                                                                                                                                                                                                            • host_ip_private

                                                                                                                                                                                                                                                                                                                                                            • host_name

                                                                                                                                                                                                                                                                                                                                                            • name

                                                                                                                                                                                                                                                                                                                                                            • host.mac

                                                                                                                                                                                                                                                                                                                                                            • cloudProvider.id

                                                                                                                                                                                                                                                                                                                                                            • cloudProvider.account.id

                                                                                                                                                                                                                                                                                                                                                            • cloudProvider.region

                                                                                                                                                                                                                                                                                                                                                            • cloudProvider.availabilityZone

                                                                                                                                                                                                                                                                                                                                                            • cloudProvider.instance.type

                                                                                                                                                                                                                                                                                                                                                            • cloudProvider.tag.{*}

                                                                                                                                                                                                                                                                                                                                                            • cloudProvider.securityGroups

                                                                                                                                                                                                                                                                                                                                                            • cloudProvider.host.ip.public

                                                                                                                                                                                                                                                                                                                                                            • cloudProvider.host.ip.private

                                                                                                                                                                                                                                                                                                                                                            • cloudProvider.host.name

                                                                                                                                                                                                                                                                                                                                                            • cloudProvider.name

                                                                                                                                                                                                                                                                                                                                                            data

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_cpu_used_percent

                                                                                                                                                                                                                                                                                                                                                            cpu.used.percent

                                                                                                                                                                                                                                                                                                                                                            • host_mac

                                                                                                                                                                                                                                                                                                                                                            • host

                                                                                                                                                                                                                                                                                                                                                            • host.mac

                                                                                                                                                                                                                                                                                                                                                            • host.hostname

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_cpu_cores_used

                                                                                                                                                                                                                                                                                                                                                            cpu.cores.used

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_cpu_user_percent

                                                                                                                                                                                                                                                                                                                                                            cpu.user.percent

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_cpu_idle_percent

                                                                                                                                                                                                                                                                                                                                                            cpu.idle.percent

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_cpu_iowait_percent

                                                                                                                                                                                                                                                                                                                                                            cpu.iowait.percent

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_cpu_nice_percent

                                                                                                                                                                                                                                                                                                                                                            cpu.nice.percent

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_cpu_stolen_percent

                                                                                                                                                                                                                                                                                                                                                            cpu.stolen.percent

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_cpu_system_percent

                                                                                                                                                                                                                                                                                                                                                            cpu.system.percent

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_fd_used_percent

                                                                                                                                                                                                                                                                                                                                                            fd.used.percent

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_file_error_open_count

                                                                                                                                                                                                                                                                                                                                                            file.error.open.count

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_file_error_total_count

                                                                                                                                                                                                                                                                                                                                                            file.error.total.count

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_file_in_bytes

                                                                                                                                                                                                                                                                                                                                                            file.bytes.in

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_file_in_iops

                                                                                                                                                                                                                                                                                                                                                            file.iops.in

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_file_in_time

                                                                                                                                                                                                                                                                                                                                                            file.time.in

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_file_open_count

                                                                                                                                                                                                                                                                                                                                                            file.open.count

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_file_out_bytes

                                                                                                                                                                                                                                                                                                                                                            file.bytes.out

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_file_out_iops

                                                                                                                                                                                                                                                                                                                                                            file.iops.out

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_file_out_time

                                                                                                                                                                                                                                                                                                                                                            file.time.out

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_load_average_15m

                                                                                                                                                                                                                                                                                                                                                            load.average.15m

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_load_average_1m

                                                                                                                                                                                                                                                                                                                                                            load.average.1m

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_load_average_5m

                                                                                                                                                                                                                                                                                                                                                            load.average.5m

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_memory_available_bytes

                                                                                                                                                                                                                                                                                                                                                            memory.bytes.available

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_memory_total_bytes

                                                                                                                                                                                                                                                                                                                                                            memory.bytes.total

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_memory_used_bytes

                                                                                                                                                                                                                                                                                                                                                            memory.bytes.used

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_memory_swap_available_bytes

                                                                                                                                                                                                                                                                                                                                                            memory.swap.bytes.available

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_memory_swap_total_bytes

                                                                                                                                                                                                                                                                                                                                                            memory.swap.bytes.total

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_memory_swap_used_bytes

                                                                                                                                                                                                                                                                                                                                                            memory.swap.bytes.used

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_memory_virtual_bytes

                                                                                                                                                                                                                                                                                                                                                            memory.bytes.virtual

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_net_connection_in_count

                                                                                                                                                                                                                                                                                                                                                            net.connection.count.in

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_net_connection_out_count

                                                                                                                                                                                                                                                                                                                                                            net.connection.count.out

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_net_error_count

                                                                                                                                                                                                                                                                                                                                                            net.error.count

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_net_in_bytes

                                                                                                                                                                                                                                                                                                                                                            net.bytes.in

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_net_out_bytes

                                                                                                                                                                                                                                                                                                                                                            net.bytes.out

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_net_tcp_queue_len

                                                                                                                                                                                                                                                                                                                                                            net.tcp.queue.len

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_proc_count

                                                                                                                                                                                                                                                                                                                                                            proc.count

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_system_uptime

                                                                                                                                                                                                                                                                                                                                                            system.uptime

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_thread_count

                                                                                                                                                                                                                                                                                                                                                            thread.count

                                                                                                                                                                                                                                                                                                                                                            container

                                                                                                                                                                                                                                                                                                                                                            info

                                                                                                                                                                                                                                                                                                                                                            sysdig_container_info

                                                                                                                                                                                                                                                                                                                                                            Not exposed

                                                                                                                                                                                                                                                                                                                                                            container_id

                                                                                                                                                                                                                                                                                                                                                            container_id

                                                                                                                                                                                                                                                                                                                                                            container_full_id

                                                                                                                                                                                                                                                                                                                                                            none

                                                                                                                                                                                                                                                                                                                                                            host_mac

                                                                                                                                                                                                                                                                                                                                                            host.mac

                                                                                                                                                                                                                                                                                                                                                            container

                                                                                                                                                                                                                                                                                                                                                            container.name

                                                                                                                                                                                                                                                                                                                                                            container_type

                                                                                                                                                                                                                                                                                                                                                            container.type

                                                                                                                                                                                                                                                                                                                                                            image

                                                                                                                                                                                                                                                                                                                                                            container.image

                                                                                                                                                                                                                                                                                                                                                            image_id

                                                                                                                                                                                                                                                                                                                                                            container.image.id

                                                                                                                                                                                                                                                                                                                                                            mesos_task_id

                                                                                                                                                                                                                                                                                                                                                            container.mesosTaskId

                                                                                                                                                                                                                                                                                                                                                            Only available in Mesos orchestrator.

                                                                                                                                                                                                                                                                                                                                                            cluster

                                                                                                                                                                                                                                                                                                                                                            kubernetes.cluster.name

                                                                                                                                                                                                                                                                                                                                                            Present only if the container is part of Kubernetes.

                                                                                                                                                                                                                                                                                                                                                            pod

                                                                                                                                                                                                                                                                                                                                                            kubernetes.pod.name

                                                                                                                                                                                                                                                                                                                                                            Present only if the container is part of Kubernetes

                                                                                                                                                                                                                                                                                                                                                            namespace

                                                                                                                                                                                                                                                                                                                                                            kubernetes.namespace.name

                                                                                                                                                                                                                                                                                                                                                            Present only if the container is part of Kubernetes.

                                                                                                                                                                                                                                                                                                                                                            data

                                                                                                                                                                                                                                                                                                                                                            sysdig_container_cpu_used_percent

                                                                                                                                                                                                                                                                                                                                                            cpu.used.percent

                                                                                                                                                                                                                                                                                                                                                            • host_mac

                                                                                                                                                                                                                                                                                                                                                            • container_id

                                                                                                                                                                                                                                                                                                                                                            • container_type

                                                                                                                                                                                                                                                                                                                                                            • container

                                                                                                                                                                                                                                                                                                                                                            • host.mac

                                                                                                                                                                                                                                                                                                                                                            • container.id

                                                                                                                                                                                                                                                                                                                                                            • container.type

                                                                                                                                                                                                                                                                                                                                                            • container.name

                                                                                                                                                                                                                                                                                                                                                            sysdig_container_cpu_cores_used

                                                                                                                                                                                                                                                                                                                                                            cpu.cores.used

                                                                                                                                                                                                                                                                                                                                                            sysdig_container_cpu_cores_used_percent

                                                                                                                                                                                                                                                                                                                                                            cpu.cores.used.percent

                                                                                                                                                                                                                                                                                                                                                            sysdig_container_cpu_quota_used_percent

                                                                                                                                                                                                                                                                                                                                                            cpu.quota.used.percent

                                                                                                                                                                                                                                                                                                                                                            sysdig_container_cpu_shares

                                                                                                                                                                                                                                                                                                                                                            cpu.shares.count

                                                                                                                                                                                                                                                                                                                                                            sysdig_container_cpu_shares_used_percent

                                                                                                                                                                                                                                                                                                                                                            cpu.shares.used.percent

                                                                                                                                                                                                                                                                                                                                                            sysdig_container_fd_used_percent

                                                                                                                                                                                                                                                                                                                                                            fd.used.percent

                                                                                                                                                                                                                                                                                                                                                            sysdig_container_file_error_open_count

                                                                                                                                                                                                                                                                                                                                                            file.error.open.count

                                                                                                                                                                                                                                                                                                                                                            sysdig_container_file_error_total_count

                                                                                                                                                                                                                                                                                                                                                            file.error.total.count

                                                                                                                                                                                                                                                                                                                                                            sysdig_container_file_in_bytes

                                                                                                                                                                                                                                                                                                                                                            file.bytes.in

                                                                                                                                                                                                                                                                                                                                                            sysdig_container_file_in_iops

                                                                                                                                                                                                                                                                                                                                                            file.iops.in

                                                                                                                                                                                                                                                                                                                                                            sysdig_container_file_in_time

                                                                                                                                                                                                                                                                                                                                                            file.time.in

                                                                                                                                                                                                                                                                                                                                                            sysdig_container_file_open_count

                                                                                                                                                                                                                                                                                                                                                            file.open.count

                                                                                                                                                                                                                                                                                                                                                            sysdig_container_file_out_bytes

                                                                                                                                                                                                                                                                                                                                                            file.bytes.out

                                                                                                                                                                                                                                                                                                                                                            sysdig_container_file_out_iops

                                                                                                                                                                                                                                                                                                                                                            file.iops.out

                                                                                                                                                                                                                                                                                                                                                            sysdig_container_file_out_time

                                                                                                                                                                                                                                                                                                                                                            file.time.out

                                                                                                                                                                                                                                                                                                                                                            sysdig_container_memory_limit_bytes

                                                                                                                                                                                                                                                                                                                                                            memory.limit.bytes

                                                                                                                                                                                                                                                                                                                                                            sysdig_container_memory_limit_used_percent

                                                                                                                                                                                                                                                                                                                                                            memory.limit.used.percent

                                                                                                                                                                                                                                                                                                                                                            sysdig_container_memory_swap_available_bytes

                                                                                                                                                                                                                                                                                                                                                            memory.swap.bytes.available

                                                                                                                                                                                                                                                                                                                                                            sysdig_container_memory_swap_total_bytes

                                                                                                                                                                                                                                                                                                                                                            memory.swap.bytes.total

                                                                                                                                                                                                                                                                                                                                                            sysdig_container_memory_swap_used_bytes

                                                                                                                                                                                                                                                                                                                                                            memory.swap.bytes.used

                                                                                                                                                                                                                                                                                                                                                            sysdig_container_memory_used_bytes

                                                                                                                                                                                                                                                                                                                                                            memory.bytes.used

                                                                                                                                                                                                                                                                                                                                                            sysdig_container_memory_virtual_bytes

                                                                                                                                                                                                                                                                                                                                                            memory.bytes.virtual

                                                                                                                                                                                                                                                                                                                                                            sysdig_container_net_connection_in_count

                                                                                                                                                                                                                                                                                                                                                            net.connection.count.in

                                                                                                                                                                                                                                                                                                                                                            sysdig_container_net_connection_out_count

                                                                                                                                                                                                                                                                                                                                                            net.connection.count.out

                                                                                                                                                                                                                                                                                                                                                            sysdig_container_net_error_count

                                                                                                                                                                                                                                                                                                                                                            net.error.count

                                                                                                                                                                                                                                                                                                                                                            sysdig_container_net_in_bytes

                                                                                                                                                                                                                                                                                                                                                            net.bytes.in

                                                                                                                                                                                                                                                                                                                                                            sysdig_container_net_out_bytes

                                                                                                                                                                                                                                                                                                                                                            net.bytes.out

                                                                                                                                                                                                                                                                                                                                                            sysdig_container_net_tcp_queue_len

                                                                                                                                                                                                                                                                                                                                                            net.tcp.queue.len

                                                                                                                                                                                                                                                                                                                                                            sysdig_container_proc_count

                                                                                                                                                                                                                                                                                                                                                            proc.count

                                                                                                                                                                                                                                                                                                                                                            sysdig_container_swap_limit_bytes

                                                                                                                                                                                                                                                                                                                                                            swap.limit.bytes

                                                                                                                                                                                                                                                                                                                                                            sysdig_container_thread_count

                                                                                                                                                                                                                                                                                                                                                            thread.count

                                                                                                                                                                                                                                                                                                                                                            Process/ Program

                                                                                                                                                                                                                                                                                                                                                            Info

                                                                                                                                                                                                                                                                                                                                                            sysdig_program_info

                                                                                                                                                                                                                                                                                                                                                            not exposed

                                                                                                                                                                                                                                                                                                                                                            program

                                                                                                                                                                                                                                                                                                                                                            proc.name

                                                                                                                                                                                                                                                                                                                                                            cmd_line

                                                                                                                                                                                                                                                                                                                                                            proc.commandLine

                                                                                                                                                                                                                                                                                                                                                            host_mac

                                                                                                                                                                                                                                                                                                                                                            host.mac

                                                                                                                                                                                                                                                                                                                                                            container_id

                                                                                                                                                                                                                                                                                                                                                            container.id

                                                                                                                                                                                                                                                                                                                                                            container_type

                                                                                                                                                                                                                                                                                                                                                            container.type

                                                                                                                                                                                                                                                                                                                                                            data

                                                                                                                                                                                                                                                                                                                                                            sysdig_program_cpu_used_percent

                                                                                                                                                                                                                                                                                                                                                            cpu.used.percent

                                                                                                                                                                                                                                                                                                                                                            host_mac

                                                                                                                                                                                                                                                                                                                                                            host.mac

                                                                                                                                                                                                                                                                                                                                                            container_id

                                                                                                                                                                                                                                                                                                                                                            container.id

                                                                                                                                                                                                                                                                                                                                                            container_type

                                                                                                                                                                                                                                                                                                                                                            container.type

                                                                                                                                                                                                                                                                                                                                                            program

                                                                                                                                                                                                                                                                                                                                                            proc.name

                                                                                                                                                                                                                                                                                                                                                            cmd_line

                                                                                                                                                                                                                                                                                                                                                            proc.commandLine

                                                                                                                                                                                                                                                                                                                                                            sysdig_program_memory_used_bytes

                                                                                                                                                                                                                                                                                                                                                            memory.bytes.used

                                                                                                                                                                                                                                                                                                                                                            host_mac

                                                                                                                                                                                                                                                                                                                                                            host.mac

                                                                                                                                                                                                                                                                                                                                                            container_id

                                                                                                                                                                                                                                                                                                                                                            container.id

                                                                                                                                                                                                                                                                                                                                                            container_type

                                                                                                                                                                                                                                                                                                                                                            container.type

                                                                                                                                                                                                                                                                                                                                                            program

                                                                                                                                                                                                                                                                                                                                                            proc.name

                                                                                                                                                                                                                                                                                                                                                            cmd_line

                                                                                                                                                                                                                                                                                                                                                            proc.commandLine

                                                                                                                                                                                                                                                                                                                                                            sysdig_program_net_in_bytes

                                                                                                                                                                                                                                                                                                                                                            net.bytes.in

                                                                                                                                                                                                                                                                                                                                                            container_id

                                                                                                                                                                                                                                                                                                                                                            container.id

                                                                                                                                                                                                                                                                                                                                                            host_mac

                                                                                                                                                                                                                                                                                                                                                            host.mac

                                                                                                                                                                                                                                                                                                                                                            container_type

                                                                                                                                                                                                                                                                                                                                                            container.type

                                                                                                                                                                                                                                                                                                                                                            program

                                                                                                                                                                                                                                                                                                                                                            proc.name

                                                                                                                                                                                                                                                                                                                                                            cmd_line

                                                                                                                                                                                                                                                                                                                                                            proc.commandLine

                                                                                                                                                                                                                                                                                                                                                            sysdig_program_net_out_bytes

                                                                                                                                                                                                                                                                                                                                                            net.bytes.out

                                                                                                                                                                                                                                                                                                                                                            host_mac

                                                                                                                                                                                                                                                                                                                                                            host.mac

                                                                                                                                                                                                                                                                                                                                                            container_id

                                                                                                                                                                                                                                                                                                                                                            container.id

                                                                                                                                                                                                                                                                                                                                                            container_type

                                                                                                                                                                                                                                                                                                                                                            container.type

                                                                                                                                                                                                                                                                                                                                                            program

                                                                                                                                                                                                                                                                                                                                                            proc.name

                                                                                                                                                                                                                                                                                                                                                            cmd_line

                                                                                                                                                                                                                                                                                                                                                            proc.commandLine

                                                                                                                                                                                                                                                                                                                                                            sysdig_program_proc_count

                                                                                                                                                                                                                                                                                                                                                            proc.count

                                                                                                                                                                                                                                                                                                                                                            host_mac

                                                                                                                                                                                                                                                                                                                                                            host.mac

                                                                                                                                                                                                                                                                                                                                                            container_id

                                                                                                                                                                                                                                                                                                                                                            container.id

                                                                                                                                                                                                                                                                                                                                                            container_type

                                                                                                                                                                                                                                                                                                                                                            container.type

                                                                                                                                                                                                                                                                                                                                                            program

                                                                                                                                                                                                                                                                                                                                                            proc.name

                                                                                                                                                                                                                                                                                                                                                            cmd_line

                                                                                                                                                                                                                                                                                                                                                            proc.commandLine

                                                                                                                                                                                                                                                                                                                                                            sysdig_program_thread_count

                                                                                                                                                                                                                                                                                                                                                            thread.count

                                                                                                                                                                                                                                                                                                                                                            host_mac

                                                                                                                                                                                                                                                                                                                                                            host.mac

                                                                                                                                                                                                                                                                                                                                                            container_id

                                                                                                                                                                                                                                                                                                                                                            container.id

                                                                                                                                                                                                                                                                                                                                                            container_type

                                                                                                                                                                                                                                                                                                                                                            container.type

                                                                                                                                                                                                                                                                                                                                                            program

                                                                                                                                                                                                                                                                                                                                                            proc.name

                                                                                                                                                                                                                                                                                                                                                            cmd_line

                                                                                                                                                                                                                                                                                                                                                            proc.commandLine

                                                                                                                                                                                                                                                                                                                                                            fs

                                                                                                                                                                                                                                                                                                                                                            info

                                                                                                                                                                                                                                                                                                                                                            sysdig_fs_info

                                                                                                                                                                                                                                                                                                                                                            not exposed

                                                                                                                                                                                                                                                                                                                                                            host_mac

                                                                                                                                                                                                                                                                                                                                                            host.mac

                                                                                                                                                                                                                                                                                                                                                            container_id

                                                                                                                                                                                                                                                                                                                                                            container.id

                                                                                                                                                                                                                                                                                                                                                            container_type

                                                                                                                                                                                                                                                                                                                                                            container.type

                                                                                                                                                                                                                                                                                                                                                            device

                                                                                                                                                                                                                                                                                                                                                            fs.device

                                                                                                                                                                                                                                                                                                                                                            mount_dir

                                                                                                                                                                                                                                                                                                                                                            fs.mountDir

                                                                                                                                                                                                                                                                                                                                                            type

                                                                                                                                                                                                                                                                                                                                                            fs.type

                                                                                                                                                                                                                                                                                                                                                            data

                                                                                                                                                                                                                                                                                                                                                            sysdig_fs_free_bytes

                                                                                                                                                                                                                                                                                                                                                            fs.bytes.free

                                                                                                                                                                                                                                                                                                                                                            host_mac

                                                                                                                                                                                                                                                                                                                                                            host.mac

                                                                                                                                                                                                                                                                                                                                                            container_id

                                                                                                                                                                                                                                                                                                                                                            container.id

                                                                                                                                                                                                                                                                                                                                                            container_type

                                                                                                                                                                                                                                                                                                                                                            container.type

                                                                                                                                                                                                                                                                                                                                                            device

                                                                                                                                                                                                                                                                                                                                                            fs.device

                                                                                                                                                                                                                                                                                                                                                            sysdig_fs_inodes_total_count

                                                                                                                                                                                                                                                                                                                                                            fs.inodes.total.count

                                                                                                                                                                                                                                                                                                                                                            host_mac

                                                                                                                                                                                                                                                                                                                                                            host.mac

                                                                                                                                                                                                                                                                                                                                                            container_id

                                                                                                                                                                                                                                                                                                                                                            container.id

                                                                                                                                                                                                                                                                                                                                                            container_type

                                                                                                                                                                                                                                                                                                                                                            container.type

                                                                                                                                                                                                                                                                                                                                                            device

                                                                                                                                                                                                                                                                                                                                                            fs.device

                                                                                                                                                                                                                                                                                                                                                            sysdig_fs_inodes_used_count

                                                                                                                                                                                                                                                                                                                                                            fs.inodes.used.count

                                                                                                                                                                                                                                                                                                                                                            host_mac

                                                                                                                                                                                                                                                                                                                                                            host.mac

                                                                                                                                                                                                                                                                                                                                                            container_id

                                                                                                                                                                                                                                                                                                                                                            container.id

                                                                                                                                                                                                                                                                                                                                                            container_type

                                                                                                                                                                                                                                                                                                                                                            container.type

                                                                                                                                                                                                                                                                                                                                                            device

                                                                                                                                                                                                                                                                                                                                                            fs.device

                                                                                                                                                                                                                                                                                                                                                            sysdig_fs_total_bytes

                                                                                                                                                                                                                                                                                                                                                            fs.bytes.total

                                                                                                                                                                                                                                                                                                                                                            host_mac

                                                                                                                                                                                                                                                                                                                                                            host.mac

                                                                                                                                                                                                                                                                                                                                                            container_id

                                                                                                                                                                                                                                                                                                                                                            container.id

                                                                                                                                                                                                                                                                                                                                                            container_type

                                                                                                                                                                                                                                                                                                                                                            container.type

                                                                                                                                                                                                                                                                                                                                                            device

                                                                                                                                                                                                                                                                                                                                                            fs.device

                                                                                                                                                                                                                                                                                                                                                            fs.bytes.used

                                                                                                                                                                                                                                                                                                                                                            host_mac

                                                                                                                                                                                                                                                                                                                                                            host.mac

                                                                                                                                                                                                                                                                                                                                                            container_id

                                                                                                                                                                                                                                                                                                                                                            container_type

                                                                                                                                                                                                                                                                                                                                                            container.id

                                                                                                                                                                                                                                                                                                                                                            container_id

                                                                                                                                                                                                                                                                                                                                                            devide

                                                                                                                                                                                                                                                                                                                                                            fs.device

                                                                                                                                                                                                                                                                                                                                                            5.2.2 -

                                                                                                                                                                                                                                                                                                                                                            Mapping Legacy Sysdig Kubernetes Metrics with Prometheus Metrics

                                                                                                                                                                                                                                                                                                                                                            Prometheus metrics, in Kubernetes parlance, are nothing but Kube State Metrics. These metrics are available in Sysdig PromQL and can be mapped to existing Sysdig Kubernetes metrics.

                                                                                                                                                                                                                                                                                                                                                            For descriptions on Kubernetes State Metrics, see Kubernetes State Metrics.

                                                                                                                                                                                                                                                                                                                                                            Resource

                                                                                                                                                                                                                                                                                                                                                            Sysdig Metrics

                                                                                                                                                                                                                                                                                                                                                            Kubernetes State Metrics

                                                                                                                                                                                                                                                                                                                                                            Label

                                                                                                                                                                                                                                                                                                                                                            Example / More Information

                                                                                                                                                                                                                                                                                                                                                            Pod

                                                                                                                                                                                                                                                                                                                                                            kubernetes.pod.containers.waiting

                                                                                                                                                                                                                                                                                                                                                            kube_pod_container_status_waiting

                                                                                                                                                                                                                                                                                                                                                            • container=<container-name>

                                                                                                                                                                                                                                                                                                                                                            • pod=<pod-name>

                                                                                                                                                                                                                                                                                                                                                            • namespace=<pod-namespace>

                                                                                                                                                                                                                                                                                                                                                            kubernetes.pod.resourceLimits.cpuCores

                                                                                                                                                                                                                                                                                                                                                            kubernetes.pod.resourceLimits.memBytes

                                                                                                                                                                                                                                                                                                                                                            kube_pod_container_resource_limits

                                                                                                                                                                                                                                                                                                                                                            • resource=<resource-name>

                                                                                                                                                                                                                                                                                                                                                            • unit=<resource-unit>

                                                                                                                                                                                                                                                                                                                                                            • pod=<pod-name>

                                                                                                                                                                                                                                                                                                                                                            • namespace=<pod-namespace>

                                                                                                                                                                                                                                                                                                                                                            • node=< node-name>

                                                                                                                                                                                                                                                                                                                                                            {namespace="default",pod="pod0",container="pod1_con1",resource="cpu",unit="core"}

                                                                                                                                                                                                                                                                                                                                                            {namespace="default",pod="pod0",container="pod1_con1",resource="memory",unit="byte"}

                                                                                                                                                                                                                                                                                                                                                            kubernetes.pod.resourceRequests.cpuCores

                                                                                                                                                                                                                                                                                                                                                            kubernetes.pod.resourceRequests.memBytes

                                                                                                                                                                                                                                                                                                                                                            kube_pod_container_resource_requests

                                                                                                                                                                                                                                                                                                                                                            • resource=<resource-name>

                                                                                                                                                                                                                                                                                                                                                            • unit=<resource-unit>

                                                                                                                                                                                                                                                                                                                                                            • container=<container-name>

                                                                                                                                                                                                                                                                                                                                                            • pod=<pod-name>

                                                                                                                                                                                                                                                                                                                                                            • namespace=<pod-namespace>

                                                                                                                                                                                                                                                                                                                                                            • node=< node-name>

                                                                                                                                                                                                                                                                                                                                                            {namespace="default",pod="pod0",container="pod1_con1",resource="cpu",unit="core"}

                                                                                                                                                                                                                                                                                                                                                            {namespace="default",pod="pod0",container="pod1_con1",resource="memory",unit="byte"}

                                                                                                                                                                                                                                                                                                                                                            kubernetes.pod.status.ready

                                                                                                                                                                                                                                                                                                                                                            kube_pod_status_ready

                                                                                                                                                                                                                                                                                                                                                            • pod=<pod-name>

                                                                                                                                                                                                                                                                                                                                                            • namespace=<pod-namespace>

                                                                                                                                                                                                                                                                                                                                                            • condition=<true|false|unknown>

                                                                                                                                                                                                                                                                                                                                                            kube_pod_info

                                                                                                                                                                                                                                                                                                                                                            • pod=<pod-name>

                                                                                                                                                                                                                                                                                                                                                            • namespace=<pod-namespace>

                                                                                                                                                                                                                                                                                                                                                            • host_ip=<host-ip>

                                                                                                                                                                                                                                                                                                                                                            • pod_ip=<pod-ip>

                                                                                                                                                                                                                                                                                                                                                            • node=<node-name>

                                                                                                                                                                                                                                                                                                                                                            • uid=<pod-uid>

                                                                                                                                                                                                                                                                                                                                                            {namespace="default",pod="pod0",host_ip="1.1.1.1",pod_ip="1.2.3.4",uid="abc-0",node="node1",created_by_kind="<none>",created_by_name="<none>",priority_class=""}

                                                                                                                                                                                                                                                                                                                                                            kube_pod_owner

                                                                                                                                                                                                                                                                                                                                                            • pod=<pod-name>

                                                                                                                                                                                                                                                                                                                                                            • namespace=<pod-namespace>

                                                                                                                                                                                                                                                                                                                                                            • owner_kind=<owner kind>

                                                                                                                                                                                                                                                                                                                                                            • owner_name=<owner name>

                                                                                                                                                                                                                                                                                                                                                            {namespace="default",pod="pod0",owner_kind="<none>",owner_name="<none>;",owner_is_controller="<none>"}

                                                                                                                                                                                                                                                                                                                                                            kube_pod_labels

                                                                                                                                                                                                                                                                                                                                                            • pod=<pod-name>

                                                                                                                                                                                                                                                                                                                                                            • namespace=<pod-namespace>

                                                                                                                                                                                                                                                                                                                                                            • label_POD_LABEL=<POD_LABEL>

                                                                                                                                                                                                                                                                                                                                                            {namespace="default",pod="pod0", label_app="myApp"}

                                                                                                                                                                                                                                                                                                                                                            kube_pod_container_info

                                                                                                                                                                                                                                                                                                                                                            • pod=<pod-name>

                                                                                                                                                                                                                                                                                                                                                            • namespace=<pod-namespace>

                                                                                                                                                                                                                                                                                                                                                            • container_id=<containerid>

                                                                                                                                                                                                                                                                                                                                                            {namespace="default",pod="pod0",container="container2",image="k8s.gcr.io/hyperkube2",image_id="docker://sha256:bbb",container_id="docker://cd456"}

                                                                                                                                                                                                                                                                                                                                                            node

                                                                                                                                                                                                                                                                                                                                                            kubernetes.node.allocatable.cpuCores

                                                                                                                                                                                                                                                                                                                                                            kube_node_status_allocatable_cpu_cores

                                                                                                                                                                                                                                                                                                                                                            • node=<node-address>

                                                                                                                                                                                                                                                                                                                                                            • resource=<resource-name>

                                                                                                                                                                                                                                                                                                                                                            • unit=<resource-unit>

                                                                                                                                                                                                                                                                                                                                                            • node=<node-address>

                                                                                                                                                                                                                                                                                                                                                            resource/unit have one of the values: (cpu, core); (memory, byte); (pods, integer). Sysdig currently supports only CPU, pods, and memory resources for kube_node_status_capacity metrics.

                                                                                                                                                                                                                                                                                                                                                            "# HELP kube_node_status_capacity The capacity for different resources of a node.
                                                                                                                                                                                                                                                                                                                                                            kube_node_status_capacity{node=""k8s-master"",resource=""hugepages_1Gi"",unit=""byte""} 0
                                                                                                                                                                                                                                                                                                                                                            kube_node_status_capacity{node=""k8s-master"",resource=""hugepages_2Mi"",unit=""byte""} 0
                                                                                                                                                                                                                                                                                                                                                            kube_node_status_capacity{node=""k8s-master"",resource=""memory"",unit=""byte""} 4.16342016e+09
                                                                                                                                                                                                                                                                                                                                                            kube_node_status_capacity{node=""k8s-master"",resource=""pods"",unit=""integer""} 110
                                                                                                                                                                                                                                                                                                                                                            kube_node_status_capacity{node=""k8s-node1"",resource=""pods"",unit=""integer""} 110
                                                                                                                                                                                                                                                                                                                                                            kube_node_status_capacity{node=""k8s-node1"",resource=""cpu"",unit=""core""} 2
                                                                                                                                                                                                                                                                                                                                                            kube_node_status_capacity{node=""k8s-node1"",resource=""hugepages_1Gi"",unit=""byte""} 0
                                                                                                                                                                                                                                                                                                                                                            kube_node_status_capacity{node=""k8s-node1"",resource=""hugepages_2Mi"",unit=""byte""} 0
                                                                                                                                                                                                                                                                                                                                                            kube_node_status_capacity{node=""k8s-node1"",resource=""memory"",unit=""byte""} 6.274154496e+09
                                                                                                                                                                                                                                                                                                                                                            kube_node_status_capacity{node=""k8s-node2"",resource=""hugepages_1Gi"",unit=""byte""} 0
                                                                                                                                                                                                                                                                                                                                                            kube_node_status_capacity{node=""k8s-node2"",resource=""hugepages_2Mi"",unit=""byte""} 0
                                                                                                                                                                                                                                                                                                                                                            kube_node_status_capacity{node=""k8s-node2"",resource=""memory"",unit=""byte""} 6.274154496e+09
                                                                                                                                                                                                                                                                                                                                                            kube_node_status_capacity{node=""k8s-node2"",resource=""pods"",unit=""integer""} 110
                                                                                                                                                                                                                                                                                                                                                            kube_node_status_capacity{node=""k8s-node2"",resource=""cpu"",unit=""core""} 2

                                                                                                                                                                                                                                                                                                                                                            kubernetes.node.allocatable.memBytes

                                                                                                                                                                                                                                                                                                                                                            kube_node_status_allocatable_memory_bytes

                                                                                                                                                                                                                                                                                                                                                            kubernetes.node.allocatable.pods

                                                                                                                                                                                                                                                                                                                                                            kube_node_status_allocatable_pods

                                                                                                                                                                                                                                                                                                                                                            kubernetes.node.capacity.cpuCores

                                                                                                                                                                                                                                                                                                                                                            kube_node_status_capacity_cpu_cores

                                                                                                                                                                                                                                                                                                                                                            • node=<node-address>

                                                                                                                                                                                                                                                                                                                                                            • resource=<resource-name>

                                                                                                                                                                                                                                                                                                                                                            • unit=<resource-unit>

                                                                                                                                                                                                                                                                                                                                                            • node=<node-address>

                                                                                                                                                                                                                                                                                                                                                            kubernetes.node.capacity.memBytes

                                                                                                                                                                                                                                                                                                                                                            kube_node_status_capacity_memory_bytes

                                                                                                                                                                                                                                                                                                                                                            kubernetes.node.capacity.pod

                                                                                                                                                                                                                                                                                                                                                            kube_node_status_capacity_pods

                                                                                                                                                                                                                                                                                                                                                            kubernetes.node.diskPressure

                                                                                                                                                                                                                                                                                                                                                            kube_node_status_condition

                                                                                                                                                                                                                                                                                                                                                            • node=<node-address

                                                                                                                                                                                                                                                                                                                                                            • condition=<node-condition>

                                                                                                                                                                                                                                                                                                                                                            • status=<true|false|unknown>

                                                                                                                                                                                                                                                                                                                                                            kubernetes.node.memoryPressure

                                                                                                                                                                                                                                                                                                                                                            kubernetes.node.networkUnavailable

                                                                                                                                                                                                                                                                                                                                                            kubernetes.node.outOfDisk

                                                                                                                                                                                                                                                                                                                                                            kubernetes.node.ready

                                                                                                                                                                                                                                                                                                                                                            kubernetes.node.unschedulable

                                                                                                                                                                                                                                                                                                                                                            kube_node_spec_unschedulable

                                                                                                                                                                                                                                                                                                                                                            • node=<node-address>

                                                                                                                                                                                                                                                                                                                                                            kube_node_info

                                                                                                                                                                                                                                                                                                                                                            • node=<node-address>

                                                                                                                                                                                                                                                                                                                                                            kube_node_labels

                                                                                                                                                                                                                                                                                                                                                            • node=<node-address>

                                                                                                                                                                                                                                                                                                                                                            • label_NODE_LABEL=<NODE_LABEL>

                                                                                                                                                                                                                                                                                                                                                            Deployment

                                                                                                                                                                                                                                                                                                                                                            kubernetes.deployment.replicas.available

                                                                                                                                                                                                                                                                                                                                                            kube_deployment_status_replicas_available

                                                                                                                                                                                                                                                                                                                                                            • deployment=<deployment-name>

                                                                                                                                                                                                                                                                                                                                                            • namespace=<deployment-namespace>

                                                                                                                                                                                                                                                                                                                                                            kubernetes.deployment.replicas.desired

                                                                                                                                                                                                                                                                                                                                                            kube_deployment_spec_replicas

                                                                                                                                                                                                                                                                                                                                                            kubernetes.deployment.replicas.paused

                                                                                                                                                                                                                                                                                                                                                            kube_deployment_spec_paused

                                                                                                                                                                                                                                                                                                                                                            kubernetes.deployment.replicas.running

                                                                                                                                                                                                                                                                                                                                                            kube_deployment_status_replicas

                                                                                                                                                                                                                                                                                                                                                            kubernetes.deployment.replicas.unavailable

                                                                                                                                                                                                                                                                                                                                                            kube_deployment_status_replicas_unavailable

                                                                                                                                                                                                                                                                                                                                                            kubernetes.deployment.replicas.updated

                                                                                                                                                                                                                                                                                                                                                            kube_deployment_status_replicas_updated

                                                                                                                                                                                                                                                                                                                                                            kube_deployment_labels

                                                                                                                                                                                                                                                                                                                                                            job

                                                                                                                                                                                                                                                                                                                                                            kubernetes.job.completions

                                                                                                                                                                                                                                                                                                                                                            kube_job_spec_completions

                                                                                                                                                                                                                                                                                                                                                            • job_name=<job-name>

                                                                                                                                                                                                                                                                                                                                                            • namespace=<job-namespace>

                                                                                                                                                                                                                                                                                                                                                            kubernetes.job.numFailed

                                                                                                                                                                                                                                                                                                                                                            kube_job_failed

                                                                                                                                                                                                                                                                                                                                                            kubernetes.job.numSucceeded

                                                                                                                                                                                                                                                                                                                                                            kube_job_complete

                                                                                                                                                                                                                                                                                                                                                            kubernetes.job.parallelism

                                                                                                                                                                                                                                                                                                                                                            kube_job_spec_parallelism

                                                                                                                                                                                                                                                                                                                                                            kube_job_status_active

                                                                                                                                                                                                                                                                                                                                                            kube_job_info

                                                                                                                                                                                                                                                                                                                                                            kube_job_owner

                                                                                                                                                                                                                                                                                                                                                            • job_name=<job-name>

                                                                                                                                                                                                                                                                                                                                                            • namespace=<job-namespace>

                                                                                                                                                                                                                                                                                                                                                            • owner_kind=<owner kind>

                                                                                                                                                                                                                                                                                                                                                            • owner_name=<owner name>

                                                                                                                                                                                                                                                                                                                                                            kube_job_labels

                                                                                                                                                                                                                                                                                                                                                            • job_name=<job-name>

                                                                                                                                                                                                                                                                                                                                                            • namespace=<job-namespace>

                                                                                                                                                                                                                                                                                                                                                            • label_job_label=<job_label>

                                                                                                                                                                                                                                                                                                                                                            daemonSet

                                                                                                                                                                                                                                                                                                                                                            kubernetes.daemonSet.pods.desired

                                                                                                                                                                                                                                                                                                                                                            kube_daemonset_status_desired_number_scheduled

                                                                                                                                                                                                                                                                                                                                                            • daemonset=<daemonset-name>

                                                                                                                                                                                                                                                                                                                                                            • namespace=<daemonset-namespace>

                                                                                                                                                                                                                                                                                                                                                            kubernetes.daemonSet.pods.misscheduled

                                                                                                                                                                                                                                                                                                                                                            kube_daemonset_status_number_misscheduled

                                                                                                                                                                                                                                                                                                                                                            kubernetes.daemonSet.pods.ready

                                                                                                                                                                                                                                                                                                                                                            kube_daemonset_status_number_ready

                                                                                                                                                                                                                                                                                                                                                            kubernetes.daemonSet.pods.scheduled

                                                                                                                                                                                                                                                                                                                                                            kube_daemonset_status_current_number_scheduled

                                                                                                                                                                                                                                                                                                                                                            kube_daemonset_labels

                                                                                                                                                                                                                                                                                                                                                            • daemonset=<daemonset-name>

                                                                                                                                                                                                                                                                                                                                                            • namespace=<daemonset-namespace>

                                                                                                                                                                                                                                                                                                                                                            • label_daemonset_label=<daemonset_label>

                                                                                                                                                                                                                                                                                                                                                            replicaSet

                                                                                                                                                                                                                                                                                                                                                            kubernetes.replicaSet.replicas.fullyLabeled

                                                                                                                                                                                                                                                                                                                                                            kube_replicaset_status_fully_labeled_replicas

                                                                                                                                                                                                                                                                                                                                                            • replicaset=<replicaset-name>

                                                                                                                                                                                                                                                                                                                                                            • namespace=<replicaset-namespace>

                                                                                                                                                                                                                                                                                                                                                            kubernetes.replicaSet.replicas.ready

                                                                                                                                                                                                                                                                                                                                                            kube_replicaset_status_ready_replicas

                                                                                                                                                                                                                                                                                                                                                            kubernetes.replicaSet.replicas.running

                                                                                                                                                                                                                                                                                                                                                            kube_replicaset_status_replicas

                                                                                                                                                                                                                                                                                                                                                            kubernetes.replicaSet.replicas.desired

                                                                                                                                                                                                                                                                                                                                                            kube_replicaset_spec_replicas

                                                                                                                                                                                                                                                                                                                                                            kube_replicaset_owner

                                                                                                                                                                                                                                                                                                                                                            • replicaset=<replicaset-name>

                                                                                                                                                                                                                                                                                                                                                            • namespace=<replicaset-namespace>

                                                                                                                                                                                                                                                                                                                                                            • owner_kind=<owner kind>

                                                                                                                                                                                                                                                                                                                                                            • owner_name=<owner name>

                                                                                                                                                                                                                                                                                                                                                            kube_replicaset_labels

                                                                                                                                                                                                                                                                                                                                                            • label_replicaset_label=<replicaset_label>

                                                                                                                                                                                                                                                                                                                                                            • replicaset=<replicaset-name>

                                                                                                                                                                                                                                                                                                                                                            • namespace=<replicaset-namespace>

                                                                                                                                                                                                                                                                                                                                                            statefulset

                                                                                                                                                                                                                                                                                                                                                            kubernetes.statefulset.replicas

                                                                                                                                                                                                                                                                                                                                                            kube_statefulset_replicas

                                                                                                                                                                                                                                                                                                                                                            • statefulset=<statefulset-name>

                                                                                                                                                                                                                                                                                                                                                            • namespace=<statefulset-namespace>

                                                                                                                                                                                                                                                                                                                                                            kubernetes.statefulset.status.replicas

                                                                                                                                                                                                                                                                                                                                                            kube_statefulset_status_replicas

                                                                                                                                                                                                                                                                                                                                                            kubernetes.statefulset.status.replicas.current

                                                                                                                                                                                                                                                                                                                                                            kube_statefulset_status_replicas_current

                                                                                                                                                                                                                                                                                                                                                            kubernetes.statefulset.status.replicas.ready

                                                                                                                                                                                                                                                                                                                                                            kube_statefulset_status_replicas_ready

                                                                                                                                                                                                                                                                                                                                                            kubernetes.statefulset.status.replicas.updated

                                                                                                                                                                                                                                                                                                                                                            kube_statefulset_status_replicas_updated

                                                                                                                                                                                                                                                                                                                                                            kube_statefulset_labels

                                                                                                                                                                                                                                                                                                                                                            hpa

                                                                                                                                                                                                                                                                                                                                                            kubernetes.hpa.replicas.min

                                                                                                                                                                                                                                                                                                                                                            kube_horizontalpodautoscaler_spec_min_replicas

                                                                                                                                                                                                                                                                                                                                                            • hpa=<hpa-name>

                                                                                                                                                                                                                                                                                                                                                            • namespace=<hpa-namespace>

                                                                                                                                                                                                                                                                                                                                                            kubernetes.hpa.replicas.max

                                                                                                                                                                                                                                                                                                                                                            kube_horizontalpodautoscaler_spec_max_replicas

                                                                                                                                                                                                                                                                                                                                                            kubernetes.hpa.replicas.current

                                                                                                                                                                                                                                                                                                                                                            kube_horizontalpodautoscaler_status_current_replicas

                                                                                                                                                                                                                                                                                                                                                            kubernetes.hpa.replicas.desired

                                                                                                                                                                                                                                                                                                                                                            kube_horizontalpodautoscaler_status_desired_replicas

                                                                                                                                                                                                                                                                                                                                                            kube_horizontalpodautoscaler_labels

                                                                                                                                                                                                                                                                                                                                                            resourcequota

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.configmaps.hard

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.configmaps.used

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.limits.cpu.hard

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.limits.cpu.used

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.limits.memory.hard

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.limits.memory.used

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.persistentvolumeclaims.hard

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.persistentvolumeclaims.used

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.cpu.hard

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.memory.hard

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.pods.hard

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.pods.used

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.replicationcontrollers.hard

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.replicationcontrollers.used

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.requests.cpu.hard

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.requests.cpu.used

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.requests.memory.hard

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.requests.memory.used

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.requests.storage.hard

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.requests.storage.used

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.resourcequotas.hard

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.resourcequotas.used

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.secrets.hard

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.secrets.used

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.services.hard

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.services.used

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.services.loadbalancers.hard

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.services.loadbalancers.used

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.services.nodeports.hard

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.services.nodeports.used

                                                                                                                                                                                                                                                                                                                                                            kube_resourcequota

                                                                                                                                                                                                                                                                                                                                                            • resourcequota=<quota-name>

                                                                                                                                                                                                                                                                                                                                                            • namespace=<namespace>

                                                                                                                                                                                                                                                                                                                                                            • resource=<ResourceName>

                                                                                                                                                                                                                                                                                                                                                            • type=<quota-type>

                                                                                                                                                                                                                                                                                                                                                            namespace

                                                                                                                                                                                                                                                                                                                                                            kube_namespace_labels

                                                                                                                                                                                                                                                                                                                                                            • namespace=<namespace-name>

                                                                                                                                                                                                                                                                                                                                                            • label_ns_label=<ns_label>

                                                                                                                                                                                                                                                                                                                                                            replicationcontroller

                                                                                                                                                                                                                                                                                                                                                            kubernetes.replicationcontroller.replicas.desired

                                                                                                                                                                                                                                                                                                                                                            kube_replicationcontroller_spec_replicase

                                                                                                                                                                                                                                                                                                                                                            • replicationcontroller=<replicationcontroller-name>

                                                                                                                                                                                                                                                                                                                                                            • namespace=<replicationcontroller-namespace>

                                                                                                                                                                                                                                                                                                                                                            kubernetes.replicationcontroller.replicas.running

                                                                                                                                                                                                                                                                                                                                                            kube_replicationcontroller_status_replicas

                                                                                                                                                                                                                                                                                                                                                            kube_replicationcontroller_status_fully_labeled_replicas

                                                                                                                                                                                                                                                                                                                                                            kube_replicationcontroller_status_ready_replicas

                                                                                                                                                                                                                                                                                                                                                            kube_replicationcontroller_status_available_replicas

                                                                                                                                                                                                                                                                                                                                                            kube_replicationcontroller_status_observed_generation

                                                                                                                                                                                                                                                                                                                                                            kube_replicationcontroller_metadata_generation

                                                                                                                                                                                                                                                                                                                                                            kube_replicationcontroller_created

                                                                                                                                                                                                                                                                                                                                                            kube_replicationcontroller_owner

                                                                                                                                                                                                                                                                                                                                                            • replicationcontroller=<replicationcontroller-name>

                                                                                                                                                                                                                                                                                                                                                            • namespace=<replicationcontroller-namespace>

                                                                                                                                                                                                                                                                                                                                                            • owner_kind=<owner kind>

                                                                                                                                                                                                                                                                                                                                                            • owner_name=<owner name>

                                                                                                                                                                                                                                                                                                                                                            service

                                                                                                                                                                                                                                                                                                                                                            kube_service_info

                                                                                                                                                                                                                                                                                                                                                            • service=<service-name>

                                                                                                                                                                                                                                                                                                                                                            • namespace=<service-namespace>

                                                                                                                                                                                                                                                                                                                                                            • cluster_ip=<service cluster ip>

                                                                                                                                                                                                                                                                                                                                                            • external_name=<service external name>

                                                                                                                                                                                                                                                                                                                                                            • load_balancer_ip=<service load balancer ip>

                                                                                                                                                                                                                                                                                                                                                            kube_service_labels

                                                                                                                                                                                                                                                                                                                                                            • service=<service-name>

                                                                                                                                                                                                                                                                                                                                                            • namespace=<service-namespace>

                                                                                                                                                                                                                                                                                                                                                            • label_service_label=<service_label>

                                                                                                                                                                                                                                                                                                                                                            persistentvolume

                                                                                                                                                                                                                                                                                                                                                            kubernetes.persistentvolume.storage

                                                                                                                                                                                                                                                                                                                                                            kube_persistentvolume_capacity_bytes

                                                                                                                                                                                                                                                                                                                                                            • persistentvolume=<pv-name>

                                                                                                                                                                                                                                                                                                                                                            kube_persistentvolume_info

                                                                                                                                                                                                                                                                                                                                                            • persistentvolume=<pv-name>

                                                                                                                                                                                                                                                                                                                                                            kube_persistentvolume_labels

                                                                                                                                                                                                                                                                                                                                                            • persistentvolume=<pv-name>

                                                                                                                                                                                                                                                                                                                                                            • namespace=<label_persistentvolume_label=<persistentvolume_label>

                                                                                                                                                                                                                                                                                                                                                            persistentvolumeclaim

                                                                                                                                                                                                                                                                                                                                                            kubernetes.persistentvolumeclaim.requests.storage

                                                                                                                                                                                                                                                                                                                                                            kube_persistentvolumeclaim_resource_requests_storage_bytes

                                                                                                                                                                                                                                                                                                                                                            • namespace=<persistentvolumeclaim-namespace>

                                                                                                                                                                                                                                                                                                                                                            • persistentvolumeclaim=<persistentvolumeclaim-name>

                                                                                                                                                                                                                                                                                                                                                            kube_persistentvolumeclaim_info

                                                                                                                                                                                                                                                                                                                                                            kube_persistentvolumeclaim_labels

                                                                                                                                                                                                                                                                                                                                                            • persistentvolumeclaim=<persistentvolumeclaim-name>

                                                                                                                                                                                                                                                                                                                                                            • namespace=<persistentvolumeclaim-namespace>

                                                                                                                                                                                                                                                                                                                                                            • label_persistentvolumeclaim_label=<persistentvolumeclaim_label>

                                                                                                                                                                                                                                                                                                                                                            5.2.3 -

                                                                                                                                                                                                                                                                                                                                                            Run PromQL Queries Faster with Extended Label Set

                                                                                                                                                                                                                                                                                                                                                            Sysdig allows you to run PromQL queries smoother and faster with the extended label set. The extended label set is created by augmenting the incoming data with the rich metadata associated with your infrastructure and making it available in PromQL.

                                                                                                                                                                                                                                                                                                                                                            With this, you can troubleshoot a problem or building Dashboards and Alerts without the need to write complex queries. Sysdig automatically enriches your metrics with Kubernetes and application context without the need to instrument additional labels in your environment. This reduces operational complexity and cost—the enrichment takes place in Sysdig metric ingestion pipeline after time series have been sent to the backend.

                                                                                                                                                                                                                                                                                                                                                            Calculate Memory Usage by Deployment in a Cluster

                                                                                                                                                                                                                                                                                                                                                            Using the vector matching operation, you could run the following query and calculate the memory usage by deployment in a cluster:

                                                                                                                                                                                                                                                                                                                                                            sum by(cluster,namespace,owner_name) ((sysdig_container_memory_used_bytes * on(container_id) group_left(pod,namespace,cluster) kube_pod_container_info) * on(pod,namespace,cluster) group_left(owner_name) kube_pod_owner{owner_kind="Deployment",owner_name=~".+",cluster=~".+",namespace=~".+"})
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            To get the result, you need to write a query to perform a join (vector match) of various metrics, usually in the following order:

                                                                                                                                                                                                                                                                                                                                                            • Grab a metric you need that is defined on a container level. For example, a Prometheus metric or some of the Sysdig provided metrics, such as sysdig_container_memory_used_byte.

                                                                                                                                                                                                                                                                                                                                                            • Perform a vector match on container ID with the metric kube_pod_container_info to get the pod metadata.

                                                                                                                                                                                                                                                                                                                                                            • Perform a vector match on the pod, namespace, and cluster with the kube_pod_owner metric.

                                                                                                                                                                                                                                                                                                                                                            In the case of Sysdig’s extended label set for PromQL, all the metrics inherit the metadata, so that necessary container, host, and Kubernetes metadata are set on all the metrics. This simplifies the query so you can build and run it quickly.

                                                                                                                                                                                                                                                                                                                                                            Likewise, the above query can be simplified as follows:

                                                                                                                                                                                                                                                                                                                                                            sum by (kube_cluster_name,kube_namespace_name,kube_deployment_name) (sysdig_container_memory_used_bytes{kube_cluster_name!="",kube_namespace_name!="",kube_deployment_name!=""})
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            The advantages of using a simplified query are:

                                                                                                                                                                                                                                                                                                                                                            • Complex vector matching operations (the group_left and group_right operators) are no longer required. All the labels are already available on each of the metrics, and therefore, any filtering can be performed directly on the metric itself.

                                                                                                                                                                                                                                                                                                                                                            • The metrics now will have a huge amount of labels. You can use PromQL Explorer to deal with this rich metadata.

                                                                                                                                                                                                                                                                                                                                                            • The metadata is distinguishable from user-defined labels. For example, Kubernetes metadata labels start with kube_. For instance, cluster is replaced with kube_cluster_name.

                                                                                                                                                                                                                                                                                                                                                            • Create a dashboard panel or an alert from the PromQL query you run in the PromQL Query Explore.

                                                                                                                                                                                                                                                                                                                                                            • Filter data by applying the comparison operators on the label values given in the table.

                                                                                                                                                                                                                                                                                                                                                            Examples for Simplifying Queries

                                                                                                                                                                                                                                                                                                                                                            Given below are some of the examples of using the extended label set to simplify complex query operations.

                                                                                                                                                                                                                                                                                                                                                            Memory Usage in a Kubernetes Cluster

                                                                                                                                                                                                                                                                                                                                                            Query with core label set:

                                                                                                                                                                                                                                                                                                                                                            avg by (agent_tag_cluster) ((sysdig_host_memory_used_bytes/sysdig_host_memory_total_bytes) * on(host,agent_tag_cluster) sysdig_host_info{agent_tag_cluster=~".+"}) * 100
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            Query with the extended label set:

                                                                                                                                                                                                                                                                                                                                                            avg by (agent_tag_cluster) (sysdig_host_memory_used_bytes/sysdig_host_memory_total_bytes) * 100
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            CPU Usage in Containers

                                                                                                                                                                                                                                                                                                                                                            Query with the core label set:

                                                                                                                                                                                                                                                                                                                                                            sum by (cluster,namespace)(sysdig_container_cpu_cores_used * on (container_id) group_left(cluster,pod,namespace) kube_pod_container_info{cluster=~".+"})
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            Simplified query with the extended label set:

                                                                                                                                                                                                                                                                                                                                                            sum by (kube_cluster_name,kube_namespace_name)(sysdig_container_cpu_cores_used{kube_cluster_name=~".+"})
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            Memory Usage in Daemonset

                                                                                                                                                                                                                                                                                                                                                            Query with the core label set:

                                                                                                                                                                                                                                                                                                                                                            sum by(cluster,namespace,owner_name) (sum by(pod) (label_replace(sysdig_container_memory_used_bytes * on(container_id,host_mac) group_left(label_io_kubernetes_pod_namespace,label_io_kubernetes_pod_name,label_io_kubernetes_container_name) sysdig_container_info{label_io_kubernetes_pod_namespace=~".*",cluster=~".*"},"pod","$1","label_io_kubernetes_pod_name","(.*)"))  * on(pod) group_right sum by(cluster,namespace,owner_name,pod) (kube_pod_owner{owner_kind=~"DaemonSet",owner_name=~".*",cluster=~".*",namespace=~".*"}))
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            Simplified query with the extended label set:

                                                                                                                                                                                                                                                                                                                                                            sum by(kube_cluster_name,kube_namespace_name,kube_daemonset_name) (sysdig_container_memory_used_bytes{kube_daemonset_name=~".*",kube_cluster_name=~".*",kube_namespace_name=~".*"})
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            Pod Restarts in a Kubernetes Cluster

                                                                                                                                                                                                                                                                                                                                                            Query with the core label set:

                                                                                                                                                                                                                                                                                                                                                            sum by(cluster,namespace,owner_name)(changes(kube_pod_status_ready{condition="true",cluster=~$cluster,namespace=~$namespace}[$__interval]) * on(cluster,namespace,pod) group_left(owner_name) kube_pod_owner{owner_kind="Deployment",owner_name=~".+",cluster=~$cluster,namespace=~$namespace})
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            Simplified query with the extended label set:

                                                                                                                                                                                                                                                                                                                                                            sum by (kube_cluster_name,kube_namespace_name,kube_deployment_name)(changes(kube_pod_status_ready{condition="true",kube_cluster_name=~$cluster,kube_namespace_name=~$namespace,kube_deployment_name=~".+"}[$__interval]))
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            Containers per Image

                                                                                                                                                                                                                                                                                                                                                            Query with the core label set:

                                                                                                                                                                                                                                                                                                                                                            count by (owner_name,image,cluster,namespace)((sysdig_container_info{cluster=~$cluster,namespace=~$namespace})  * on(pod,namespace,cluster) group_left(owner_name) max by (pod,namespace,cluster,owner_name)(kube_pod_owner{owner_kind="Deployment",owner_name=~".+"}))
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            Simplified query with the extended label set:

                                                                                                                                                                                                                                                                                                                                                            count by (kube_deployment_name,image,kube_cluster_name,kube_namespace_name)(sysdig_container_info{kube_deployment_name=~".+",kube_cluster_name=~$cluster,kube_namespace_name=~$namespace})
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            Average TCP Queue per Node

                                                                                                                                                                                                                                                                                                                                                            Query with the core label set:

                                                                                                                                                                                                                                                                                                                                                            avg by (agent_tag_cluster,host)( sysdig_host_net_tcp_queue_len * on (host_mac) group_left(agent_tag_cluster,host) sysdig_host_info{agent_tag_cluster=~$cluster,host=~".+"})
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            Simplified query with the extended label set:

                                                                                                                                                                                                                                                                                                                                                            avg by (agent_tag_cluster,host_hostname) (sysdig_host_net_tcp_queue_len{agent_tag_cluster =~ $cluster})
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            5.3.1 -

                                                                                                                                                                                                                                                                                                                                                            Apache Metrics

                                                                                                                                                                                                                                                                                                                                                            See also: Apache integration information.

                                                                                                                                                                                                                                                                                                                                                            apache.conns_async_closing

                                                                                                                                                                                                                                                                                                                                                            The number of asynchronous closing connections.

                                                                                                                                                                                                                                                                                                                                                            apache.conns_async_keep_alive

                                                                                                                                                                                                                                                                                                                                                            The number of asynchronous keep-alive connections.

                                                                                                                                                                                                                                                                                                                                                            apache.conns_async_writing

                                                                                                                                                                                                                                                                                                                                                            The number of asynchronous write connections.

                                                                                                                                                                                                                                                                                                                                                            apache.conns_total

                                                                                                                                                                                                                                                                                                                                                            The total number of connections handled.

                                                                                                                                                                                                                                                                                                                                                            apache.net.bytes

                                                                                                                                                                                                                                                                                                                                                            The total number of bytes served.

                                                                                                                                                                                                                                                                                                                                                            apache.net.bytes_per_s

                                                                                                                                                                                                                                                                                                                                                            The number of bytes served per second.

                                                                                                                                                                                                                                                                                                                                                            apache.net.hits

                                                                                                                                                                                                                                                                                                                                                            The total number of requests performed.

                                                                                                                                                                                                                                                                                                                                                            apache.net.request_per_s

                                                                                                                                                                                                                                                                                                                                                            The number of requests performed per second.

                                                                                                                                                                                                                                                                                                                                                            apache.performance.busy_workers

                                                                                                                                                                                                                                                                                                                                                            The number of workers currently serving requests.

                                                                                                                                                                                                                                                                                                                                                            apache.performance.cpu_load

                                                                                                                                                                                                                                                                                                                                                            The percentage of CPU used.

                                                                                                                                                                                                                                                                                                                                                            apache.performance.idle_workers

                                                                                                                                                                                                                                                                                                                                                            The number of idle workers in the instance.

                                                                                                                                                                                                                                                                                                                                                            apache.performance.uptime

                                                                                                                                                                                                                                                                                                                                                            The amount of time the server has been running in seconds.

                                                                                                                                                                                                                                                                                                                                                            5.3.2.1 -

                                                                                                                                                                                                                                                                                                                                                            Apache Kafka Consumer Metrics

                                                                                                                                                                                                                                                                                                                                                            See also: Apache Kafka integration information.

                                                                                                                                                                                                                                                                                                                                                            kafka.broker_offset

                                                                                                                                                                                                                                                                                                                                                            The current message offset value on the broker.

                                                                                                                                                                                                                                                                                                                                                            kafka.consumer_lag

                                                                                                                                                                                                                                                                                                                                                            The lag in messages between the consumer and the broker.

                                                                                                                                                                                                                                                                                                                                                            kafka.consumer_offset

                                                                                                                                                                                                                                                                                                                                                            The current message offset value on the consumer.

                                                                                                                                                                                                                                                                                                                                                            5.3.2.2 -

                                                                                                                                                                                                                                                                                                                                                            Apache Kafka JMX Metrics

                                                                                                                                                                                                                                                                                                                                                            See also: Apache Kafka integration information.

                                                                                                                                                                                                                                                                                                                                                            The kafka.consumer.* and kafka.producer.* metrics are only available with JMX customization as documented in Integrate JMX Metrics from Java Virtual Machines.

                                                                                                                                                                                                                                                                                                                                                            kafka.consumer.bytes_consumed

                                                                                                                                                                                                                                                                                                                                                            The average number of bytes consumed for a specific topic per second.

                                                                                                                                                                                                                                                                                                                                                            kafka.consumer.bytes_in

                                                                                                                                                                                                                                                                                                                                                            The rate of bytes coming in to the consumer.

                                                                                                                                                                                                                                                                                                                                                            kafka.consumer.delayed_requests

                                                                                                                                                                                                                                                                                                                                                            The number of delayed consumer requests.

                                                                                                                                                                                                                                                                                                                                                            kafka.consumer.expires_per_second

                                                                                                                                                                                                                                                                                                                                                            The rate of delayed consumer request expiration.

                                                                                                                                                                                                                                                                                                                                                            kafka.consumer.fetch_rate

                                                                                                                                                                                                                                                                                                                                                            The minimum rate at which the consumer sends fetch requests to a broker.

                                                                                                                                                                                                                                                                                                                                                            kafka.consumer.fetch_size_avg

                                                                                                                                                                                                                                                                                                                                                            The average number of bytes fetched for a specific topic per request.

                                                                                                                                                                                                                                                                                                                                                            kafka.consumer.fetch_size_max

                                                                                                                                                                                                                                                                                                                                                            The maximum number of bytes fetched for a specific topic per request.

                                                                                                                                                                                                                                                                                                                                                            kafka.consumer.kafka_commits

                                                                                                                                                                                                                                                                                                                                                            The rate of offset commits to Kafka.

                                                                                                                                                                                                                                                                                                                                                            kafka.consumer.max_lag

                                                                                                                                                                                                                                                                                                                                                            The maximum consumer lag.

                                                                                                                                                                                                                                                                                                                                                            kafka.consumer.messages_in

                                                                                                                                                                                                                                                                                                                                                            The rate of consumer message consumption.

                                                                                                                                                                                                                                                                                                                                                            kafka.consumer.records_consumed

                                                                                                                                                                                                                                                                                                                                                            The average number of records consumed per second for a specific topic.

                                                                                                                                                                                                                                                                                                                                                            kafka.consumer.records_per_request_avg

                                                                                                                                                                                                                                                                                                                                                            The average number of records in each request for a specific topic.

                                                                                                                                                                                                                                                                                                                                                            kafka.consumer.zookeeper_commits

                                                                                                                                                                                                                                                                                                                                                            The rate of offset commits to ZooKeeper.

                                                                                                                                                                                                                                                                                                                                                            kafka.expires_sec

                                                                                                                                                                                                                                                                                                                                                            The rate of delayed producer request expiration.

                                                                                                                                                                                                                                                                                                                                                            kafka.follower.expires_per_second

                                                                                                                                                                                                                                                                                                                                                            The rate of request expiration on followers.

                                                                                                                                                                                                                                                                                                                                                            kafka.log.flush_rate

                                                                                                                                                                                                                                                                                                                                                            The log flush rate.

                                                                                                                                                                                                                                                                                                                                                            kafka.messages_in

                                                                                                                                                                                                                                                                                                                                                            The incoming message rate.

                                                                                                                                                                                                                                                                                                                                                            kafka.net.bytes_in

                                                                                                                                                                                                                                                                                                                                                            The incoming byte rate.

                                                                                                                                                                                                                                                                                                                                                            kafka.net.bytes_out

                                                                                                                                                                                                                                                                                                                                                            The outgoing byte rate.

                                                                                                                                                                                                                                                                                                                                                            kafka.net.bytes_rejected

                                                                                                                                                                                                                                                                                                                                                            The rejected byte rate.

                                                                                                                                                                                                                                                                                                                                                            kafka.producer.available_buffer_bytes

                                                                                                                                                                                                                                                                                                                                                            The total amount of buffer memory, including unallocated buffer memory and memory in the free list, that is not being used.

                                                                                                                                                                                                                                                                                                                                                            kafka.producer.batch_size_avg

                                                                                                                                                                                                                                                                                                                                                            The average number of bytes sent per partition per-request.

                                                                                                                                                                                                                                                                                                                                                            kafka.producer.batch_size_max

                                                                                                                                                                                                                                                                                                                                                            The maximum number of bytes sent per partition per-request.

                                                                                                                                                                                                                                                                                                                                                            kafka.producer.buffer_bytes_total

                                                                                                                                                                                                                                                                                                                                                            The maximum amount of buffer memory the client can use.

                                                                                                                                                                                                                                                                                                                                                            kafka.producer.bufferpool_wait_time

                                                                                                                                                                                                                                                                                                                                                            The fraction of time an appender waits for space allocation.

                                                                                                                                                                                                                                                                                                                                                            kafka.producer.bytes_out

                                                                                                                                                                                                                                                                                                                                                            The rate of bytes going out for the producer.

                                                                                                                                                                                                                                                                                                                                                            kafka.producer.compression_rate

                                                                                                                                                                                                                                                                                                                                                            The average compression rate of record batches for a topic.

                                                                                                                                                                                                                                                                                                                                                            kafka.producer.compression_rate_avg

                                                                                                                                                                                                                                                                                                                                                            The average compression rate of record batches.

                                                                                                                                                                                                                                                                                                                                                            kafka.producer.delayed_requests

                                                                                                                                                                                                                                                                                                                                                            The number of producer requests delayed.

                                                                                                                                                                                                                                                                                                                                                            kafka.producer.expires_per_seconds

                                                                                                                                                                                                                                                                                                                                                            The rate of producer request expiration.

                                                                                                                                                                                                                                                                                                                                                            kafka.producer.io_wait

                                                                                                                                                                                                                                                                                                                                                            The producer I/O wait time.

                                                                                                                                                                                                                                                                                                                                                            kafka.producer.message_rate

                                                                                                                                                                                                                                                                                                                                                            The producer message rate.

                                                                                                                                                                                                                                                                                                                                                            kafka.producer.metadata_age

                                                                                                                                                                                                                                                                                                                                                            The age of the current producer metadata being used, in seconds.

                                                                                                                                                                                                                                                                                                                                                            kafka.producer.record_error_rate

                                                                                                                                                                                                                                                                                                                                                            The average number of retried record sends for a topic per second.

                                                                                                                                                                                                                                                                                                                                                            kafka.producer.record_queue_time_avg

                                                                                                                                                                                                                                                                                                                                                            The average time that record batches spent in the record accumulator, in milliseconds.

                                                                                                                                                                                                                                                                                                                                                            kafka.producer.record_queue_time_max

                                                                                                                                                                                                                                                                                                                                                            The maximum amount of time record batches can spend in the record accumulator, in milliseconds.

                                                                                                                                                                                                                                                                                                                                                            kafka.producer.record_retry_rate

                                                                                                                                                                                                                                                                                                                                                            The average number of retried record sends for a topic per second.

                                                                                                                                                                                                                                                                                                                                                            kafka.producer.record_send_rate

                                                                                                                                                                                                                                                                                                                                                            The average number of records sent per second for a topic.

                                                                                                                                                                                                                                                                                                                                                            kafka.producer.record_size_avg

                                                                                                                                                                                                                                                                                                                                                            The average record size.

                                                                                                                                                                                                                                                                                                                                                            kafka.producer.record_size_max

                                                                                                                                                                                                                                                                                                                                                            The maximum record size.

                                                                                                                                                                                                                                                                                                                                                            kafka.producer.records_per_request

                                                                                                                                                                                                                                                                                                                                                            The average number of records sent per second.

                                                                                                                                                                                                                                                                                                                                                            kafka.producer.request_latency_avg

                                                                                                                                                                                                                                                                                                                                                            The average request latency of the producer.

                                                                                                                                                                                                                                                                                                                                                            kafka.producer.request_latency_max

                                                                                                                                                                                                                                                                                                                                                            The maximum request latency in milliseconds.

                                                                                                                                                                                                                                                                                                                                                            kafka.producer.request_rate

                                                                                                                                                                                                                                                                                                                                                            The number of producer requests per second.

                                                                                                                                                                                                                                                                                                                                                            kafka.producer.requests_in_flight

                                                                                                                                                                                                                                                                                                                                                            The current number of in-flight requests awaiting a response

                                                                                                                                                                                                                                                                                                                                                            kafka.producer.response_rate

                                                                                                                                                                                                                                                                                                                                                            The number of producer responses per second.

                                                                                                                                                                                                                                                                                                                                                            kafka.producer.throttle_time_avg

                                                                                                                                                                                                                                                                                                                                                            The average time in a request was throttled by a broker, in milliseconds.

                                                                                                                                                                                                                                                                                                                                                            kafka.producer.throttle_time_max

                                                                                                                                                                                                                                                                                                                                                            The maximum time in a request was throttled by a broker, in milliseconds.

                                                                                                                                                                                                                                                                                                                                                            kafka.producer.waiting_threads

                                                                                                                                                                                                                                                                                                                                                            The number of user threads blocked waiting for buffer memory to enqueue their records.

                                                                                                                                                                                                                                                                                                                                                            kafka.replication.isr_expands

                                                                                                                                                                                                                                                                                                                                                            The rate of replicas joining the ISR pool.

                                                                                                                                                                                                                                                                                                                                                            kafka.replication.isr_shrinks

                                                                                                                                                                                                                                                                                                                                                            The rate of replicas leaving the ISR pool.

                                                                                                                                                                                                                                                                                                                                                            kafka.replication.leader_elections

                                                                                                                                                                                                                                                                                                                                                            The leader election rate.

                                                                                                                                                                                                                                                                                                                                                            kafka.replication.unclean_leader_elections

                                                                                                                                                                                                                                                                                                                                                            The unclean leader election rate.

                                                                                                                                                                                                                                                                                                                                                            kafka.replication.under_replicated_partitions

                                                                                                                                                                                                                                                                                                                                                            The number of unreplicated partitions.

                                                                                                                                                                                                                                                                                                                                                            kafka.request.fetch.failed

                                                                                                                                                                                                                                                                                                                                                            The number of client fetch request failures.

                                                                                                                                                                                                                                                                                                                                                            kafka.request.fetch.failed_per_second

                                                                                                                                                                                                                                                                                                                                                            The rate of client fetch request failures per second.

                                                                                                                                                                                                                                                                                                                                                            kafka.request.fetch.time.99percentile

                                                                                                                                                                                                                                                                                                                                                            The time for fetch requests for the 99th percentile.

                                                                                                                                                                                                                                                                                                                                                            kafka.request.fetch.time.avg

                                                                                                                                                                                                                                                                                                                                                            The average time per fetch request.

                                                                                                                                                                                                                                                                                                                                                            kafka.request.handler.avg.idle.pct

                                                                                                                                                                                                                                                                                                                                                            The average fraction of time the request handler threads are idle.

                                                                                                                                                                                                                                                                                                                                                            kafka.request.metadata.time.99percentile

                                                                                                                                                                                                                                                                                                                                                            The time for metadata requests for 99th percentile.

                                                                                                                                                                                                                                                                                                                                                            kafka.request.metadata.time.avg

                                                                                                                                                                                                                                                                                                                                                            The average time for a metadata request.

                                                                                                                                                                                                                                                                                                                                                            kafka.request.offsets.time.99percentile

                                                                                                                                                                                                                                                                                                                                                            The time for offset requests for the 99th percentile.

                                                                                                                                                                                                                                                                                                                                                            kafka.request.offsets.time.avg

                                                                                                                                                                                                                                                                                                                                                            The average time for an offset request.

                                                                                                                                                                                                                                                                                                                                                            kafka.request.produce.failed

                                                                                                                                                                                                                                                                                                                                                            The number of failed produce requests.

                                                                                                                                                                                                                                                                                                                                                            kafka.request.produce.failed_per_second

                                                                                                                                                                                                                                                                                                                                                            The rate of failed produce requests per second.

                                                                                                                                                                                                                                                                                                                                                            kafka.request.produce.time.99percentile

                                                                                                                                                                                                                                                                                                                                                            The time for produce requests for the 99th percentile.

                                                                                                                                                                                                                                                                                                                                                            kafka.request.produce.time.avg

                                                                                                                                                                                                                                                                                                                                                            The average time for a produce request.

                                                                                                                                                                                                                                                                                                                                                            kafka.request.update_metadata.time.99percentile

                                                                                                                                                                                                                                                                                                                                                            The time for update metadata requests for the 99th percentile

                                                                                                                                                                                                                                                                                                                                                            kafka.request.update_metadata.time.avg

                                                                                                                                                                                                                                                                                                                                                            The average time for a request to update metadata.

                                                                                                                                                                                                                                                                                                                                                            5.3.3 -

                                                                                                                                                                                                                                                                                                                                                            Consul Metrics

                                                                                                                                                                                                                                                                                                                                                            Contents

                                                                                                                                                                                                                                                                                                                                                            5.3.3.1 -

                                                                                                                                                                                                                                                                                                                                                            Base Consul Metrics

                                                                                                                                                                                                                                                                                                                                                            For related information, see Consul integration.

                                                                                                                                                                                                                                                                                                                                                            consul.catalog.nodes_critical

                                                                                                                                                                                                                                                                                                                                                            Number of nodes with service status `critical` from those registered.

                                                                                                                                                                                                                                                                                                                                                            consul.catalog.nodes_passing

                                                                                                                                                                                                                                                                                                                                                            Number of nodes with service status `passing` from those registered.

                                                                                                                                                                                                                                                                                                                                                            consul.catalog.nodes_up

                                                                                                                                                                                                                                                                                                                                                            Number of nodes.

                                                                                                                                                                                                                                                                                                                                                            consul.catalog.nodes_warning

                                                                                                                                                                                                                                                                                                                                                            Number of nodes with service status `warning` from those registered.

                                                                                                                                                                                                                                                                                                                                                            consul.catalog.services_critical

                                                                                                                                                                                                                                                                                                                                                            Total critical services on nodes.

                                                                                                                                                                                                                                                                                                                                                            consul.catalog.services_passing

                                                                                                                                                                                                                                                                                                                                                            Total passing services on nodes.

                                                                                                                                                                                                                                                                                                                                                            consul.catalog.services_up

                                                                                                                                                                                                                                                                                                                                                            Total services registered on nodes.

                                                                                                                                                                                                                                                                                                                                                            consul.catalog.services_warning

                                                                                                                                                                                                                                                                                                                                                            Total warning services on nodes.

                                                                                                                                                                                                                                                                                                                                                            consul.catalog.total_nodes

                                                                                                                                                                                                                                                                                                                                                            Number of nodes registered in the consul cluster.

                                                                                                                                                                                                                                                                                                                                                            consul.net.node.latency.max

                                                                                                                                                                                                                                                                                                                                                            Maximum latency from this node to all others.

                                                                                                                                                                                                                                                                                                                                                            consul.net.node.latency.median

                                                                                                                                                                                                                                                                                                                                                            Median latency from this node to all others.

                                                                                                                                                                                                                                                                                                                                                            consul.net.node.latency.min

                                                                                                                                                                                                                                                                                                                                                            Minimum latency from this node to all others.

                                                                                                                                                                                                                                                                                                                                                            consul.net.node.latency.p25

                                                                                                                                                                                                                                                                                                                                                            p25 latency from this node to all others.

                                                                                                                                                                                                                                                                                                                                                            consul.net.node.latency.p75

                                                                                                                                                                                                                                                                                                                                                            p75 latency from this node to all others.

                                                                                                                                                                                                                                                                                                                                                            consul.net.node.latency.p90

                                                                                                                                                                                                                                                                                                                                                            p90 latency from this node to all others.

                                                                                                                                                                                                                                                                                                                                                            consul.net.node.latency.p95

                                                                                                                                                                                                                                                                                                                                                            p95 latency from this node to all others.

                                                                                                                                                                                                                                                                                                                                                            consul.net.node.latency.p99

                                                                                                                                                                                                                                                                                                                                                            p99 latency from this node to all others.

                                                                                                                                                                                                                                                                                                                                                            consul.peers

                                                                                                                                                                                                                                                                                                                                                            Number of peers in the peer set.

                                                                                                                                                                                                                                                                                                                                                            5.3.3.2 -

                                                                                                                                                                                                                                                                                                                                                            Consul StatsD Metrics

                                                                                                                                                                                                                                                                                                                                                            For related information, see Consul integration.

                                                                                                                                                                                                                                                                                                                                                            consul.memberlist.msg.suspect

                                                                                                                                                                                                                                                                                                                                                            Number of times an agent suspects another as failed while probing during gossip protocol.

                                                                                                                                                                                                                                                                                                                                                            consul.raft.apply

                                                                                                                                                                                                                                                                                                                                                            Number of raft transactions occurring.

                                                                                                                                                                                                                                                                                                                                                            consul.raft.commitTime.95percentile

                                                                                                                                                                                                                                                                                                                                                            The p95 time it takes to commit a new entry to the raft log on the leader.

                                                                                                                                                                                                                                                                                                                                                            consul.raft.commitTime.avg

                                                                                                                                                                                                                                                                                                                                                            The average time it takes to commit a new entry to the raft log on the leader.

                                                                                                                                                                                                                                                                                                                                                            consul.raft.commitTime.count

                                                                                                                                                                                                                                                                                                                                                            The number of samples of raft.commitTime

                                                                                                                                                                                                                                                                                                                                                            consul.raft.commitTime.max

                                                                                                                                                                                                                                                                                                                                                            The max time it takes to commit a new entry to the raft log on the leader.

                                                                                                                                                                                                                                                                                                                                                            consul.raft.commitTime.median

                                                                                                                                                                                                                                                                                                                                                            The median time it takes to commit a new entry to the raft log on the leader.

                                                                                                                                                                                                                                                                                                                                                            consul.raft.leader.dispatchLog.95percentile

                                                                                                                                                                                                                                                                                                                                                            The p95 time it takes for the leader to write log entries to disk.

                                                                                                                                                                                                                                                                                                                                                            consul.raft.leader.dispatchLog.avg

                                                                                                                                                                                                                                                                                                                                                            The average time it takes for the leader to write log entries to disk.

                                                                                                                                                                                                                                                                                                                                                            consul.raft.leader.dispatchLog.count

                                                                                                                                                                                                                                                                                                                                                            The number of samples of raft.leader.dispatchLog.

                                                                                                                                                                                                                                                                                                                                                            consul.raft.leader.dispatchLog.max

                                                                                                                                                                                                                                                                                                                                                            The max time it takes for the leader to write log entries to disk.

                                                                                                                                                                                                                                                                                                                                                            consul.raft.leader.dispatchLog.median

                                                                                                                                                                                                                                                                                                                                                            The median time it takes for the leader to write log entries to disk.

                                                                                                                                                                                                                                                                                                                                                            consul.raft.leader.lastContact.95percentile

                                                                                                                                                                                                                                                                                                                                                            P95 time elapsed since the leader was last able to check its lease with followers.

                                                                                                                                                                                                                                                                                                                                                            consul.raft.leader.lastContact.avg

                                                                                                                                                                                                                                                                                                                                                            Average time elapsed since the leader was last able to check its lease with followers.

                                                                                                                                                                                                                                                                                                                                                            consul.raft.leader.lastContact.count

                                                                                                                                                                                                                                                                                                                                                            The number of samples of raft.leader.lastContact.

                                                                                                                                                                                                                                                                                                                                                            consul.raft.leader.lastContact.max

                                                                                                                                                                                                                                                                                                                                                            Max time elapsed since the leader was last able to check its lease with followers.

                                                                                                                                                                                                                                                                                                                                                            consul.raft.leader.lastContact.median

                                                                                                                                                                                                                                                                                                                                                            Median time elapsed since the leader was last able to check its lease with followers.

                                                                                                                                                                                                                                                                                                                                                            consul.raft.state.candidate

                                                                                                                                                                                                                                                                                                                                                            The number of initiated leader elections.

                                                                                                                                                                                                                                                                                                                                                            consul.raft.state.leader

                                                                                                                                                                                                                                                                                                                                                            Number of completed leader elections.

                                                                                                                                                                                                                                                                                                                                                            consul.runtime.alloc_bytes

                                                                                                                                                                                                                                                                                                                                                            Current bytes allocated by the Consul process.

                                                                                                                                                                                                                                                                                                                                                            consul.runtime.free_count

                                                                                                                                                                                                                                                                                                                                                            Cumulative count of heap objects freed.

                                                                                                                                                                                                                                                                                                                                                            consul.runtime.heap_objects

                                                                                                                                                                                                                                                                                                                                                            Number of objects allocated on the heap.

                                                                                                                                                                                                                                                                                                                                                            consul.runtime.malloc_count

                                                                                                                                                                                                                                                                                                                                                            Cumulative count of heap objects allocated.

                                                                                                                                                                                                                                                                                                                                                            consul.runtime.num_goroutines

                                                                                                                                                                                                                                                                                                                                                            Number of running goroutines.

                                                                                                                                                                                                                                                                                                                                                            consul.runtime.sys_bytes

                                                                                                                                                                                                                                                                                                                                                            Total size of the virtual address space reserved by the Go runtime.

                                                                                                                                                                                                                                                                                                                                                            consul.runtime.total_gc_pause_ns

                                                                                                                                                                                                                                                                                                                                                            Cumulative nanoseconds in GC stop-the-world pauses since Consul started.

                                                                                                                                                                                                                                                                                                                                                            consul.runtime.total_gc_runs

                                                                                                                                                                                                                                                                                                                                                            Number of completed GC cycles.

                                                                                                                                                                                                                                                                                                                                                            consul.serf.events

                                                                                                                                                                                                                                                                                                                                                            Incremented when an agent processes a serf event.

                                                                                                                                                                                                                                                                                                                                                            consul.serf.member.flap

                                                                                                                                                                                                                                                                                                                                                            Number of times an agent is marked dead and then quickly recovers.

                                                                                                                                                                                                                                                                                                                                                            consul.serf.member.join

                                                                                                                                                                                                                                                                                                                                                            Incremented when an agent processes a join event.

                                                                                                                                                                                                                                                                                                                                                            5.3.4 -

                                                                                                                                                                                                                                                                                                                                                            Couchbase Metrics

                                                                                                                                                                                                                                                                                                                                                            See also: Couchbase integration information.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.avg_bg_wait_time

                                                                                                                                                                                                                                                                                                                                                            The average background wait time.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.avg_disk_commit_time

                                                                                                                                                                                                                                                                                                                                                            The average disk commit time.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.avg_disk_update_time

                                                                                                                                                                                                                                                                                                                                                            The average disk update time.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.bg_wait_total

                                                                                                                                                                                                                                                                                                                                                            The total background wait time.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.bytes_read

                                                                                                                                                                                                                                                                                                                                                            The number of bytes read.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.bytes_written

                                                                                                                                                                                                                                                                                                                                                            The number of bytes written.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.cas_badval

                                                                                                                                                                                                                                                                                                                                                            The number of compare and swap bad values.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.cas_hits

                                                                                                                                                                                                                                                                                                                                                            The number of compare and swap hits.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.cas_misses

                                                                                                                                                                                                                                                                                                                                                            The number of compare and swap misses.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.cmd_get

                                                                                                                                                                                                                                                                                                                                                            The number of compare and swap gets.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.cmd_set

                                                                                                                                                                                                                                                                                                                                                            The number of compare and swap sets.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.couch_docs_actual_disk_size

                                                                                                                                                                                                                                                                                                                                                            The size of the couchbase docs on disk.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.couch_docs_data_size

                                                                                                                                                                                                                                                                                                                                                            The data size of the couchbase docs.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.couch_docs_disk_size

                                                                                                                                                                                                                                                                                                                                                            Couch docs total size in bytes.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.couch_docs_fragmentation

                                                                                                                                                                                                                                                                                                                                                            The percentage of couchbase docs fragmentation.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.couch_spatial_data_size

                                                                                                                                                                                                                                                                                                                                                            The size of object data for spatial views.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.couch_spatial_disk_size

                                                                                                                                                                                                                                                                                                                                                            The amount of disk space occupied by spatial views.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.couch_spatial_ops

                                                                                                                                                                                                                                                                                                                                                            Spatial operations.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.couch_total_disk_size

                                                                                                                                                                                                                                                                                                                                                            The total disk size for couchbase.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.couch_views_data_size

                                                                                                                                                                                                                                                                                                                                                            The size of object data for views.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.couch_views_disk_size

                                                                                                                                                                                                                                                                                                                                                            The amount of disk space occupied by views.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.couch_views_fragmentation

                                                                                                                                                                                                                                                                                                                                                            The view fragmentation.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.couch_views_ops

                                                                                                                                                                                                                                                                                                                                                            View operations.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.cpu_idle_ms

                                                                                                                                                                                                                                                                                                                                                            CPU idle milliseconds.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.cpu_utilization_rate

                                                                                                                                                                                                                                                                                                                                                            CPU utilization percentage.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.curr_connections

                                                                                                                                                                                                                                                                                                                                                            Current bucket connections.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.curr_items

                                                                                                                                                                                                                                                                                                                                                            Number of active items in memory.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.curr_items_tot

                                                                                                                                                                                                                                                                                                                                                            Total number of items.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.decr_hits

                                                                                                                                                                                                                                                                                                                                                            Decrement hits.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.decr_misses

                                                                                                                                                                                                                                                                                                                                                            Decrement misses.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.delete_hits

                                                                                                                                                                                                                                                                                                                                                            Delete hits.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.delete_misses

                                                                                                                                                                                                                                                                                                                                                            Delete misses.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.disk_commit_count

                                                                                                                                                                                                                                                                                                                                                            Disk commits.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.disk_update_count

                                                                                                                                                                                                                                                                                                                                                            Disk updates.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.disk_write_queue

                                                                                                                                                                                                                                                                                                                                                            Disk write queue depth.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_bg_fetched

                                                                                                                                                                                                                                                                                                                                                            Disk reads per second.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_cache_miss_rate

                                                                                                                                                                                                                                                                                                                                                            Cache miss rate.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_cache_miss_ratio

                                                                                                                                                                                                                                                                                                                                                            Cache miss ratio.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_dcp_2i_backoff

                                                                                                                                                                                                                                                                                                                                                            Number of backoffs for indexes DCP connections.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_dcp_2i_count

                                                                                                                                                                                                                                                                                                                                                            Number of indexes DCP connections.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_dcp_2i_items_remaining

                                                                                                                                                                                                                                                                                                                                                            Number of indexes items remaining to be sent.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_dcp_2i_items_sent

                                                                                                                                                                                                                                                                                                                                                            Number of indexes items sent.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_dcp_2i_producer_count

                                                                                                                                                                                                                                                                                                                                                            Number of indexes producers

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_dcp_2i_total_bytes

                                                                                                                                                                                                                                                                                                                                                            Number bytes per second being sent for indexes DCP connections.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_dcp_fts_backoff

                                                                                                                                                                                                                                                                                                                                                            Number of backoffs for fts DCP connections.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_dcp_fts_count

                                                                                                                                                                                                                                                                                                                                                            Number of fts DCP connections.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_dcp_fts_items_remaining

                                                                                                                                                                                                                                                                                                                                                            Number of fts items remaining to be sent.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_dcp_fts_items_sent

                                                                                                                                                                                                                                                                                                                                                            Number of fts items sent.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_dcp_fts_producer_count

                                                                                                                                                                                                                                                                                                                                                            Number of fts producers.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_dcp_fts_total_bytes

                                                                                                                                                                                                                                                                                                                                                            Number bytes per second being sent for fts DCP connections.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_dcp_other_backoff

                                                                                                                                                                                                                                                                                                                                                            Number of backoffs for other DCP connections.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_dcp_other_count

                                                                                                                                                                                                                                                                                                                                                            Number of other DCP connections.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_dcp_other_items_remaining

                                                                                                                                                                                                                                                                                                                                                            Number of other items remaining to be sent.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_dcp_other_items_sent

                                                                                                                                                                                                                                                                                                                                                            Number of other items sent.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_dcp_other_producer_count

                                                                                                                                                                                                                                                                                                                                                            Number of other producers.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_dcp_other_total_bytes

                                                                                                                                                                                                                                                                                                                                                            Number bytes per second being sent for other DCP connections.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_dcp_replica_backoff

                                                                                                                                                                                                                                                                                                                                                            Number of backoffs for replica DCP connections.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_dcp_replica_count

                                                                                                                                                                                                                                                                                                                                                            Number of replica DCP connections.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_dcp_replica_items_remaining

                                                                                                                                                                                                                                                                                                                                                            Number of replica items remaining to be sent.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_dcp_replica_items_sent

                                                                                                                                                                                                                                                                                                                                                            Number of replica items sent.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_dcp_replica_producer_count

                                                                                                                                                                                                                                                                                                                                                            Number of replica producers.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_dcp_replica_total_bytes

                                                                                                                                                                                                                                                                                                                                                            Number bytes per second being sent for replica DCP connections.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_dcp_views_backoff

                                                                                                                                                                                                                                                                                                                                                            Number of backoffs for views DCP connections.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_dcp_views_count

                                                                                                                                                                                                                                                                                                                                                            Number of views DCP connections.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_dcp_views_items_remaining

                                                                                                                                                                                                                                                                                                                                                            Number of views items remaining to be sent.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_dcp_views_items_sent

                                                                                                                                                                                                                                                                                                                                                            Number of views items sent.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_dcp_views_producer_count

                                                                                                                                                                                                                                                                                                                                                            Number of views producers.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_dcp_views_total_bytes

                                                                                                                                                                                                                                                                                                                                                            Number bytes per second being sent for views DCP connections.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_dcp_xdcr_backoff

                                                                                                                                                                                                                                                                                                                                                            Number of backoffs for xdcr DCP connections.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_dcp_xdcr_count

                                                                                                                                                                                                                                                                                                                                                            Number of xdcr DCP connections.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_dcp_xdcr_items_remaining

                                                                                                                                                                                                                                                                                                                                                            Number of xdcr items remaining to be sent.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_dcp_xdcr_items_sent

                                                                                                                                                                                                                                                                                                                                                            Number of xdcr items sent.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_dcp_xdcr_producer_count

                                                                                                                                                                                                                                                                                                                                                            Number of xdcr producers.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_dcp_xdcr_total_bytes

                                                                                                                                                                                                                                                                                                                                                            Number bytes per second being sent for xdcr DCP connections.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_diskqueue_drain

                                                                                                                                                                                                                                                                                                                                                            Total Drained items on disk queue.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_diskqueue_fill

                                                                                                                                                                                                                                                                                                                                                            Total enqueued items on disk queue.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_diskqueue_items

                                                                                                                                                                                                                                                                                                                                                            Total number of items waiting to be written to disk.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_flusher_todo

                                                                                                                                                                                                                                                                                                                                                            Number of items currently being written.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_item_commit_failed

                                                                                                                                                                                                                                                                                                                                                            Number of times a transaction failed to commit due to storage errors.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_kv_size

                                                                                                                                                                                                                                                                                                                                                            Total amount of user data cached in RAM in this bucket.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_max_size

                                                                                                                                                                                                                                                                                                                                                            The maximum amount of memory this bucket can use.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_mem_high_wat

                                                                                                                                                                                                                                                                                                                                                            Memory usage high water mark for auto-evictions.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_mem_low_wat

                                                                                                                                                                                                                                                                                                                                                            Memory usage low water mark for auto-evictions.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_meta_data_memory

                                                                                                                                                                                                                                                                                                                                                            Total amount of item metadata consuming RAM in this bucket.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_num_non_resident

                                                                                                                                                                                                                                                                                                                                                            Number of non-resident items.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_num_ops_del_meta

                                                                                                                                                                                                                                                                                                                                                            Number of delete operations per second for this bucket as the target for XDCR.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_num_ops_del_ret_meta

                                                                                                                                                                                                                                                                                                                                                            Number of delRetMeta operations per second for this bucket as the target for XDCR.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_num_ops_get_meta

                                                                                                                                                                                                                                                                                                                                                            Number of read operations per second for this bucket as the target for XDCR.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_num_ops_set_meta

                                                                                                                                                                                                                                                                                                                                                            Number of set operations per second for this bucket as the target for XDCR.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_num_ops_set_ret_meta

                                                                                                                                                                                                                                                                                                                                                            Number of setRetMeta operations per second for this bucket as the target for XDCR.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_num_value_ejects

                                                                                                                                                                                                                                                                                                                                                            Number of times item values got ejected from memory to disk.\

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_oom_errors

                                                                                                                                                                                                                                                                                                                                                            Number of times unrecoverable OOMs happened while processing operations.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_ops_create

                                                                                                                                                                                                                                                                                                                                                            Create operations.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_ops_update

                                                                                                                                                                                                                                                                                                                                                            Update operations.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_overhead

                                                                                                                                                                                                                                                                                                                                                            Extra memory used by transient data like persistence queues or checkpoints.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_queue_size

                                                                                                                                                                                                                                                                                                                                                            Number of items queued for storage.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_resident_items_rate

                                                                                                                                                                                                                                                                                                                                                            Number of resident items.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_tap_replica_queue_drain

                                                                                                                                                                                                                                                                                                                                                            Total drained items in the replica queue.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_tap_total_queue_drain

                                                                                                                                                                                                                                                                                                                                                            Total drained items in the queue.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_tap_total_queue_fill

                                                                                                                                                                                                                                                                                                                                                            Total enqueued items in the queue.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_tap_total_total_backlog_size

                                                                                                                                                                                                                                                                                                                                                            Number of remaining items for replication.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_tmp_oom_errors

                                                                                                                                                                                                                                                                                                                                                            Number of times recoverable OOMs happened while processing operations.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ep_vb_total

                                                                                                                                                                                                                                                                                                                                                            Total number of vBuckets for this bucket.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.evictions

                                                                                                                                                                                                                                                                                                                                                            Number of evictions

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.get_hits

                                                                                                                                                                                                                                                                                                                                                            Number of get hits

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.get_misses

                                                                                                                                                                                                                                                                                                                                                            Number of get misses.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.hibernated_requests

                                                                                                                                                                                                                                                                                                                                                            Number of streaming requests now idle.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.hibernated_waked

                                                                                                                                                                                                                                                                                                                                                            Rate of streaming request wakeups.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.hit_ratio

                                                                                                                                                                                                                                                                                                                                                            Hit ratio.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.incr_hits

                                                                                                                                                                                                                                                                                                                                                            Number of increment hits.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.incr_misses

                                                                                                                                                                                                                                                                                                                                                            Number of increment misses.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.mem_actual_free

                                                                                                                                                                                                                                                                                                                                                            Free memory.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.mem_actual_used

                                                                                                                                                                                                                                                                                                                                                            Used memory.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.mem_free

                                                                                                                                                                                                                                                                                                                                                            Free memory.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.mem_total

                                                                                                                                                                                                                                                                                                                                                            Total available memory.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.mem_used (deprecated)

                                                                                                                                                                                                                                                                                                                                                            Engine’s total memory usage.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.mem_used_sys

                                                                                                                                                                                                                                                                                                                                                            System memory usage.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.misses

                                                                                                                                                                                                                                                                                                                                                            Total number of misses.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.ops

                                                                                                                                                                                                                                                                                                                                                            Total number of operations.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.page_faults

                                                                                                                                                                                                                                                                                                                                                            Number of page faults.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.replication_docs_rep_queue

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.replication_meta_latency_aggr

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.rest_requests

                                                                                                                                                                                                                                                                                                                                                            Number of HTTP requests.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.swap_total

                                                                                                                                                                                                                                                                                                                                                            Total amount of swap available.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.swap_used

                                                                                                                                                                                                                                                                                                                                                            Amount of swap used.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_active_eject

                                                                                                                                                                                                                                                                                                                                                            Number of items per second being ejected to disk from active vBuckets.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_active_itm_memory

                                                                                                                                                                                                                                                                                                                                                            Amount of active user data cached in RAM in this bucket.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_active_meta_data_memory

                                                                                                                                                                                                                                                                                                                                                            Amount of active item metadata consuming RAM in this bucket.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_active_num

                                                                                                                                                                                                                                                                                                                                                            Number of active items.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_active_num_non_resident

                                                                                                                                                                                                                                                                                                                                                            Number of non resident vBuckets in the active state for this bucket.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_active_ops_create

                                                                                                                                                                                                                                                                                                                                                            New items per second being inserted into active vBuckets in this bucket.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_active_ops_update

                                                                                                                                                                                                                                                                                                                                                            Number of items updated on active vBucket per second for this bucket.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_active_queue_age

                                                                                                                                                                                                                                                                                                                                                            Sum of disk queue item age in milliseconds.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_active_queue_drain

                                                                                                                                                                                                                                                                                                                                                            Total drained items in the queue.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_active_queue_fill

                                                                                                                                                                                                                                                                                                                                                            Number of active items per second being put on the active item disk queue.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_active_queue_size

                                                                                                                                                                                                                                                                                                                                                            Number of active items in the queue.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_active_resident_items_ratio

                                                                                                                                                                                                                                                                                                                                                            Number of resident items.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_avg_active_queue_age

                                                                                                                                                                                                                                                                                                                                                            Average age in seconds of active items in the active item queue.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_avg_pending_queue_age

                                                                                                                                                                                                                                                                                                                                                            Average age in seconds of pending items in the pending item queue.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_avg_replica_queue_age

                                                                                                                                                                                                                                                                                                                                                            Average age in seconds of replica items in the replica item queue.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_avg_total_queue_age

                                                                                                                                                                                                                                                                                                                                                            Average age of items in the queue.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_pending_curr_items

                                                                                                                                                                                                                                                                                                                                                            Number of items in pending vBuckets.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_pending_eject

                                                                                                                                                                                                                                                                                                                                                            Number of items per second being ejected to disk from pending vBuckets.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_pending_itm_memory

                                                                                                                                                                                                                                                                                                                                                            Amount of pending user data cached in RAM in this bucket.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_pending_meta_data_memory

                                                                                                                                                                                                                                                                                                                                                            Amount of pending item metadata consuming RAM in this bucket.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_pending_num

                                                                                                                                                                                                                                                                                                                                                            Number of pending items.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_pending_num_non_resident

                                                                                                                                                                                                                                                                                                                                                            Number of non resident vBuckets in the pending state for this bucket.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_pending_ops_create

                                                                                                                                                                                                                                                                                                                                                            Number of pending create operations.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_pending_ops_update

                                                                                                                                                                                                                                                                                                                                                            Number of items updated on pending vBucket per second for this bucket.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_pending_queue_age

                                                                                                                                                                                                                                                                                                                                                            Sum of disk pending queue item age in milliseconds.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_pending_queue_drain

                                                                                                                                                                                                                                                                                                                                                            Total drained pending items in the queue.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_pending_queue_fill

                                                                                                                                                                                                                                                                                                                                                            Total enqueued pending items on disk queue.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_pending_queue_size

                                                                                                                                                                                                                                                                                                                                                            Number of pending items in the queue.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_pending_resident_items_ratio

                                                                                                                                                                                                                                                                                                                                                            Number of resident pending items.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_replica_curr_items

                                                                                                                                                                                                                                                                                                                                                            Number of in memory items.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_replica_eject

                                                                                                                                                                                                                                                                                                                                                            Number of items per second being ejected to disk from replica vBuckets.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_replica_itm_memory

                                                                                                                                                                                                                                                                                                                                                            Amount of replica user data cached in RAM in this bucket.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_replica_meta_data_memory

                                                                                                                                                                                                                                                                                                                                                            Total metadata memory.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_replica_num

                                                                                                                                                                                                                                                                                                                                                            Number of replica vBuckets.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_replica_num_non_resident

                                                                                                                                                                                                                                                                                                                                                            Number of non resident vBuckets in the replica state for this bucket.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_replica_ops_create

                                                                                                                                                                                                                                                                                                                                                            Number of replica create operations.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_replica_ops_update

                                                                                                                                                                                                                                                                                                                                                            Number of items updated on replica vBucket per second for this bucket.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_replica_queue_age

                                                                                                                                                                                                                                                                                                                                                            Sum of disk replica queue item age in milliseconds.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_replica_queue_drain

                                                                                                                                                                                                                                                                                                                                                            Total drained replica items in the queue.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_replica_queue_fill

                                                                                                                                                                                                                                                                                                                                                            Total enqueued replica items on disk queue.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_replica_queue_size

                                                                                                                                                                                                                                                                                                                                                            Replica items in disk queue.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_replica_resident_items_ratio

                                                                                                                                                                                                                                                                                                                                                            Number of resident replica items.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.vb_total_queue_age

                                                                                                                                                                                                                                                                                                                                                            Sum of disk queue item age in milliseconds.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_bucket.xdc_ops

                                                                                                                                                                                                                                                                                                                                                            Number of cross-datacenter replication operations.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_node.couch_docs_actual_disk_size

                                                                                                                                                                                                                                                                                                                                                            Couch docs total size on disk in bytes.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_node.couch_docs_data_size

                                                                                                                                                                                                                                                                                                                                                            Couch docs data size in bytes.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_node.couch_views_actual_disk_size

                                                                                                                                                                                                                                                                                                                                                            Couch views total size on disk in bytes.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_node.couch_views_data_size

                                                                                                                                                                                                                                                                                                                                                            Couch views data size on disk in bytes.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_node.curr_items

                                                                                                                                                                                                                                                                                                                                                            Number of active items in memory.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_node.curr_items_tot

                                                                                                                                                                                                                                                                                                                                                            Total number of items.

                                                                                                                                                                                                                                                                                                                                                            couchbase.by_node.vb_replica_curr_items

                                                                                                                                                                                                                                                                                                                                                            Number of in memory items.

                                                                                                                                                                                                                                                                                                                                                            couchbase.hdd.free

                                                                                                                                                                                                                                                                                                                                                            Free hard disk space.

                                                                                                                                                                                                                                                                                                                                                            couchbase.hdd.quota_total

                                                                                                                                                                                                                                                                                                                                                            Hard disk quota.

                                                                                                                                                                                                                                                                                                                                                            couchbase.hdd.total

                                                                                                                                                                                                                                                                                                                                                            Total hard disk space.

                                                                                                                                                                                                                                                                                                                                                            couchbase.hdd.used

                                                                                                                                                                                                                                                                                                                                                            Used hard disk space.

                                                                                                                                                                                                                                                                                                                                                            couchbase.hdd.used_by_data

                                                                                                                                                                                                                                                                                                                                                            Hard disk used for data.

                                                                                                                                                                                                                                                                                                                                                            couchbase.query.cores

                                                                                                                                                                                                                                                                                                                                                            couchbase.query.cpu_sys_percent

                                                                                                                                                                                                                                                                                                                                                            couchbase.query.cpu_user_percent

                                                                                                                                                                                                                                                                                                                                                            couchbase.query.gc_num

                                                                                                                                                                                                                                                                                                                                                            couchbase.query.gc_pause_percent

                                                                                                                                                                                                                                                                                                                                                            couchbase.query.gc_pause_time

                                                                                                                                                                                                                                                                                                                                                            couchbase.query.memory_system

                                                                                                                                                                                                                                                                                                                                                            couchbase.query.memory_total

                                                                                                                                                                                                                                                                                                                                                            couchbase.query.memory_usage

                                                                                                                                                                                                                                                                                                                                                            couchbase.query.request_active_count

                                                                                                                                                                                                                                                                                                                                                            couchbase.query.request_completed_count

                                                                                                                                                                                                                                                                                                                                                            couchbase.query.request_per_sec_15min

                                                                                                                                                                                                                                                                                                                                                            couchbase.query.request_per_sec_1min

                                                                                                                                                                                                                                                                                                                                                            couchbase.query.request_per_sec_5min

                                                                                                                                                                                                                                                                                                                                                            couchbase.query.request_prepared_percent

                                                                                                                                                                                                                                                                                                                                                            couchbase.query.request_time_80percentile

                                                                                                                                                                                                                                                                                                                                                            couchbase.query.request_time_95percentile

                                                                                                                                                                                                                                                                                                                                                            couchbase.query.request_time_99percentile

                                                                                                                                                                                                                                                                                                                                                            couchbase.query.request_time_mean

                                                                                                                                                                                                                                                                                                                                                            couchbase.query.request_time_median

                                                                                                                                                                                                                                                                                                                                                            couchbase.query.total_threads

                                                                                                                                                                                                                                                                                                                                                            couchbase.ram.quota_total

                                                                                                                                                                                                                                                                                                                                                            RAM quota.

                                                                                                                                                                                                                                                                                                                                                            couchbase.ram.total

                                                                                                                                                                                                                                                                                                                                                            The total RAM available.

                                                                                                                                                                                                                                                                                                                                                            couchbase.ram.used

                                                                                                                                                                                                                                                                                                                                                            The amount of RAM in use.

                                                                                                                                                                                                                                                                                                                                                            couchbase.ram.used_by_data

                                                                                                                                                                                                                                                                                                                                                            The amount of RAM used for data.

                                                                                                                                                                                                                                                                                                                                                            5.3.5 -

                                                                                                                                                                                                                                                                                                                                                            Elasticsearch Metrics

                                                                                                                                                                                                                                                                                                                                                            See also: Elasticsearch integration information.

                                                                                                                                                                                                                                                                                                                                                            All Elasticsearch metrics have the type gauge.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.active_primary_shards

                                                                                                                                                                                                                                                                                                                                                            The number of active primary shards in the cluster.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.active_shards

                                                                                                                                                                                                                                                                                                                                                            The number of active shards in the cluster.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.breakers.fielddata.estimated_size_in_bytes

                                                                                                                                                                                                                                                                                                                                                            The estimated size in bytes of the field data circuit breaker.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.breakers.fielddata.overhead

                                                                                                                                                                                                                                                                                                                                                            The constant multiplier for byte estimations of the field data circuit breaker.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.breakers.fielddata.tripped

                                                                                                                                                                                                                                                                                                                                                            The number of times the field data circuit breaker has tripped.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.breakers.parent.estimated_size_in_bytes

                                                                                                                                                                                                                                                                                                                                                            The estimated size in bytes of the parent circuit breaker.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.breakers.parent.overhead

                                                                                                                                                                                                                                                                                                                                                            The constant multiplier for byte estimations of the parent circuit breaker.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.breakers.parent.tripped

                                                                                                                                                                                                                                                                                                                                                            The number of times the parent circuit breaker has tripped.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.breakers.request.estimated_size_in_bytes

                                                                                                                                                                                                                                                                                                                                                            The estimated size in bytes of the request circuit breaker.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.breakers.request.overhead

                                                                                                                                                                                                                                                                                                                                                            The constant multiplier for byte estimations of the request circuit breaker.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.breakers.request.tripped

                                                                                                                                                                                                                                                                                                                                                            The number of times the request circuit breaker has tripped.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.breakers.inflight_requests.tripped

                                                                                                                                                                                                                                                                                                                                                            The number of times the inflight circuit breaker has tripped.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.breakers.inflight_requests.overhead

                                                                                                                                                                                                                                                                                                                                                            The constant multiplier for byte estimations of the inflight circuit breaker.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.breakers.inflight_requests.estimated_size_in_bytes

                                                                                                                                                                                                                                                                                                                                                            The estimated size in bytes of the inflight circuit breaker.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.cache.field.evictions

                                                                                                                                                                                                                                                                                                                                                            The total number of evictions from the field data cache.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.cache.field.size

                                                                                                                                                                                                                                                                                                                                                            The size of the field cache.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.cache.filter.count

                                                                                                                                                                                                                                                                                                                                                            The number of items in the filter cache.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.cache.filter.evictions

                                                                                                                                                                                                                                                                                                                                                            The total number of evictions from the filter cache.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.cache.filter.size

                                                                                                                                                                                                                                                                                                                                                            The size of the filter cache.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.cluster_status

                                                                                                                                                                                                                                                                                                                                                            The elasticsearch cluster health as a number: red = 0, yellow = 1, green = 2

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.docs.count

                                                                                                                                                                                                                                                                                                                                                            The total number of documents in the cluster across all shards.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.docs.deleted

                                                                                                                                                                                                                                                                                                                                                            The total number of documents deleted from the cluster across all shards.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.fielddata.evictions

                                                                                                                                                                                                                                                                                                                                                            The total number of evictions from the fielddata cache.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.fielddata.size

                                                                                                                                                                                                                                                                                                                                                            The size of the fielddata cache.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.flush.total

                                                                                                                                                                                                                                                                                                                                                            The total number of index flushes to disk since start.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.flush.total.time

                                                                                                                                                                                                                                                                                                                                                            The total time spent flushing the index to disk.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.fs.total.available_in_bytes

                                                                                                                                                                                                                                                                                                                                                            The total number of bytes available to this Java virtual machine on this file store.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.fs.total.disk_io_op

                                                                                                                                                                                                                                                                                                                                                            The total I/O operations on the file store.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.fs.total.disk_io_size_in_bytes

                                                                                                                                                                                                                                                                                                                                                            Total bytes used for all I/O operations on the file store.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.fs.total.disk_read_size_in_bytes

                                                                                                                                                                                                                                                                                                                                                            The total bytes read from the file store.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.fs.total.disk_reads

                                                                                                                                                                                                                                                                                                                                                            The total number of reads from the file store.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.fs.total.disk_write_size_in_bytes

                                                                                                                                                                                                                                                                                                                                                            The total bytes written to the file store.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.fs.total.disk_writes

                                                                                                                                                                                                                                                                                                                                                            The total number of writes to the file store.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.fs.total.free_in_bytes

                                                                                                                                                                                                                                                                                                                                                            The total number of unallocated bytes in the file store.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.fs.total.total_in_bytes

                                                                                                                                                                                                                                                                                                                                                            The total size in bytes of the file store.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.get.current

                                                                                                                                                                                                                                                                                                                                                            The number of get requests currently running.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.get.exists.time

                                                                                                                                                                                                                                                                                                                                                            The total time spent on get requests where the document existed.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.get.exists.total

                                                                                                                                                                                                                                                                                                                                                            The total number of get requests where the document existed.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.get.missing.time

                                                                                                                                                                                                                                                                                                                                                            The total time spent on get requests where the document was missing.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.get.missing.total

                                                                                                                                                                                                                                                                                                                                                            The total number of get requests where the document was missing.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.get.time

                                                                                                                                                                                                                                                                                                                                                            The total time spent on get requests.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.get.total

                                                                                                                                                                                                                                                                                                                                                            The total number of get requests.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.http.current_open

                                                                                                                                                                                                                                                                                                                                                            The number of current open HTTP connections.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.http.total_opened

                                                                                                                                                                                                                                                                                                                                                            The total number of opened HTTP connections.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.id_cache.size

                                                                                                                                                                                                                                                                                                                                                            The size of the id cache

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.indexing.delete.current

                                                                                                                                                                                                                                                                                                                                                            The number of documents currently being deleted from an index.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.indexing.delete.time

                                                                                                                                                                                                                                                                                                                                                            The total time spent deleting documents from an index.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.indexing.delete.total

                                                                                                                                                                                                                                                                                                                                                            The total number of documents deleted from an index.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.indexing.index.current

                                                                                                                                                                                                                                                                                                                                                            The number of documents currently being indexed to an index.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.indexing.index.time

                                                                                                                                                                                                                                                                                                                                                            The total time spent indexing documents to an index.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.indexing.index.total

                                                                                                                                                                                                                                                                                                                                                            The total number of documents indexed to an index.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.indices.count

                                                                                                                                                                                                                                                                                                                                                            The number of indices in the cluster.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.indices.indexing.index_failed

                                                                                                                                                                                                                                                                                                                                                            The number of failed indexing operations.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.indices.indexing.throttle_time

                                                                                                                                                                                                                                                                                                                                                            The total time indexing waited due to throttling.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.indices.query_cache.evictions

                                                                                                                                                                                                                                                                                                                                                            The number of query cache evictions.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.indices.query_cache.hit_count

                                                                                                                                                                                                                                                                                                                                                            The number of query cache hits.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.indices.query_cache.memory_size_in_bytes

                                                                                                                                                                                                                                                                                                                                                            The memory used by the query cache.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.indices.query_cache.miss_count

                                                                                                                                                                                                                                                                                                                                                            The number of query cache misses.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.indices.recovery.current_as_source

                                                                                                                                                                                                                                                                                                                                                            The number of ongoing recoveries for which a shard serves as a source.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.indices.recovery.current_as_target

                                                                                                                                                                                                                                                                                                                                                            The number of ongoing recoveries for which a shard serves as a target.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.indices.recovery.throttle_time

                                                                                                                                                                                                                                                                                                                                                            The total time recoveries waited due to throttling.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.indices.request_cache.evictions

                                                                                                                                                                                                                                                                                                                                                            The number of request cache evictions.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.indices.request_cache.hit_count

                                                                                                                                                                                                                                                                                                                                                            The number of request cache hits.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.indices.request_cache.memory_size_in_bytes

                                                                                                                                                                                                                                                                                                                                                            The memory used by the request cache.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.indices.request_cache.miss_count

                                                                                                                                                                                                                                                                                                                                                            The number of request cache misses.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.indices.segments.count

                                                                                                                                                                                                                                                                                                                                                            The number of segments in an index shard.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.indices.segments.doc_values_memory_in_bytes

                                                                                                                                                                                                                                                                                                                                                            The memory used by doc values.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.indices.segments.fixed_bit_set_memory_in_bytes

                                                                                                                                                                                                                                                                                                                                                            The memory used by fixed bit set.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.indices.segments.index_writer_max_memory_in_bytes

                                                                                                                                                                                                                                                                                                                                                            The maximum memory used by the index writer.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.indices.segments.index_writer_memory_in_bytes

                                                                                                                                                                                                                                                                                                                                                            The memory used by the index writer.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.indices.segments.memory_in_bytes

                                                                                                                                                                                                                                                                                                                                                            The memory used by index segments.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.indices.segments.norms_memory_in_bytes

                                                                                                                                                                                                                                                                                                                                                            The memory used by norms.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.indices.segments.stored_fields_memory_in_bytes

                                                                                                                                                                                                                                                                                                                                                            The memory used by stored fields.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.indices.segments.term_vectors_memory_in_bytes

                                                                                                                                                                                                                                                                                                                                                            The memory used by term vectors.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.indices.segments.terms_memory_in_bytes

                                                                                                                                                                                                                                                                                                                                                            The memory used by terms.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.indices.segments.version_map_memory_in_bytes

                                                                                                                                                                                                                                                                                                                                                            The memory used by the segment version map.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.indices.translog.operations

                                                                                                                                                                                                                                                                                                                                                            The number of operations in the transaction log.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.indices.translog.size_in_bytes

                                                                                                                                                                                                                                                                                                                                                            The size of the transaction log.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.initializing_shards

                                                                                                                                                                                                                                                                                                                                                            The number of shards that are currently initializing.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.merges.current

                                                                                                                                                                                                                                                                                                                                                            The number of currently active segment merges.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.merges.current.docs

                                                                                                                                                                                                                                                                                                                                                            The number of documents across segments currently being merged.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.merges.current.size

                                                                                                                                                                                                                                                                                                                                                            The size of the segments currently being merged.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.merges.total

                                                                                                                                                                                                                                                                                                                                                            The total number of segment merges.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.merges.total.docs

                                                                                                                                                                                                                                                                                                                                                            The total number of documents across all merged segments.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.merges.total.size

                                                                                                                                                                                                                                                                                                                                                            The total size of all merged segments.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.merges.total.time

                                                                                                                                                                                                                                                                                                                                                            The total time spent on segment merging.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.number_of_data_nodes

                                                                                                                                                                                                                                                                                                                                                            The number of data nodes in the cluster.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.number_of_nodes

                                                                                                                                                                                                                                                                                                                                                            The total number of nodes in the cluster.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.pending_tasks_priority_high

                                                                                                                                                                                                                                                                                                                                                            The number of high priority pending tasks.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.pending_tasks_priority_urgent

                                                                                                                                                                                                                                                                                                                                                            The number of urgent priority pending tasks.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.pending_tasks_time_in_queue

                                                                                                                                                                                                                                                                                                                                                            The average time spent by tasks in the queue.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.pending_tasks_total

                                                                                                                                                                                                                                                                                                                                                            The total number of pending tasks.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.process.open_fd

                                                                                                                                                                                                                                                                                                                                                            The number of opened file descriptors associated with the current process, or -1 if not supported.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.refresh.total

                                                                                                                                                                                                                                                                                                                                                            The total number of index refreshes.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.refresh.total.time

                                                                                                                                                                                                                                                                                                                                                            The total time spent on index refreshes.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.relocating_shards

                                                                                                                                                                                                                                                                                                                                                            The number of shards that are relocating from one node to another.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.search.fetch.current

                                                                                                                                                                                                                                                                                                                                                            The number of search fetches currently running.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.search.fetch.open_contexts

                                                                                                                                                                                                                                                                                                                                                            The number of active searches.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.search.fetch.time

                                                                                                                                                                                                                                                                                                                                                            The total time spent on the search fetch.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.search.fetch.total

                                                                                                                                                                                                                                                                                                                                                            The total number of search fetches.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.search.query.current

                                                                                                                                                                                                                                                                                                                                                            The number of currently active queries.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.search.query.time

                                                                                                                                                                                                                                                                                                                                                            The total time spent on queries.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.search.query.total

                                                                                                                                                                                                                                                                                                                                                            The total number of queries.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.store.size

                                                                                                                                                                                                                                                                                                                                                            The total size in bytes of the store.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.bulk.active

                                                                                                                                                                                                                                                                                                                                                            The number of active threads in the bulk pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.bulk.queue

                                                                                                                                                                                                                                                                                                                                                            The number of queued threads in the bulk pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.bulk.threads

                                                                                                                                                                                                                                                                                                                                                            The total number of threads in the bulk pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.bulk.rejected

                                                                                                                                                                                                                                                                                                                                                            The number of rejected threads in the bulk pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.fetch_shard_started.active

                                                                                                                                                                                                                                                                                                                                                            The number of active threads in the fetch shard started pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.fetch_shard_started.threads

                                                                                                                                                                                                                                                                                                                                                            The total number of threads in the fetch shard started pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.fetch_shard_started.queue

                                                                                                                                                                                                                                                                                                                                                            The number of queued threads in the fetch shard started pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.fetch_shard_started.rejected

                                                                                                                                                                                                                                                                                                                                                            The number of rejected threads in the fetch shard started pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.fetch_shard_store.active

                                                                                                                                                                                                                                                                                                                                                            The number of active threads in the fetch shard store pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.fetch_shard_store.threads

                                                                                                                                                                                                                                                                                                                                                            The total number of threads in the fetch shard store pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.fetch_shard_store.queue

                                                                                                                                                                                                                                                                                                                                                            The number of queued threads in the fetch shard store pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.fetch_shard_store.rejected

                                                                                                                                                                                                                                                                                                                                                            The number of rejected threads in the fetch shard store pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.flush.active

                                                                                                                                                                                                                                                                                                                                                            The number of active threads in the flush queue.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.flush.queue

                                                                                                                                                                                                                                                                                                                                                            The number of queued threads in the flush pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.flush.threads

                                                                                                                                                                                                                                                                                                                                                            The total number of threads in the flush pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.flush.rejected

                                                                                                                                                                                                                                                                                                                                                            The number of rejected threads in the flush pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.force_merge.active

                                                                                                                                                                                                                                                                                                                                                            The number of active threads for force merge operations.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.force_merge.threads

                                                                                                                                                                                                                                                                                                                                                            The total number of threads for force merge operations.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.force_merge.queue

                                                                                                                                                                                                                                                                                                                                                            The number of queued threads for force merge operations.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.force_merge.rejected

                                                                                                                                                                                                                                                                                                                                                            The number of rejected threads for force merge operations.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.generic.active

                                                                                                                                                                                                                                                                                                                                                            The number of active threads in the generic pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.generic.queue

                                                                                                                                                                                                                                                                                                                                                            The number of queued threads in the generic pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.generic.threads

                                                                                                                                                                                                                                                                                                                                                            The total number of threads in the generic pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.generic.rejected

                                                                                                                                                                                                                                                                                                                                                            The number of rejected threads in the generic pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.get.active

                                                                                                                                                                                                                                                                                                                                                            The number of active threads in the get pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.get.queue

                                                                                                                                                                                                                                                                                                                                                            The number of queued threads in the get pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.get.threads

                                                                                                                                                                                                                                                                                                                                                            The total number of threads in the get pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.get.rejected

                                                                                                                                                                                                                                                                                                                                                            The number of rejected threads in the get pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.index.active

                                                                                                                                                                                                                                                                                                                                                            The number of active threads in the index pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.index.queue

                                                                                                                                                                                                                                                                                                                                                            The number of queued threads in the index pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.index.threads

                                                                                                                                                                                                                                                                                                                                                            The total number of threads in the index pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.index.rejected

                                                                                                                                                                                                                                                                                                                                                            The number of rejected threads in the index pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.listener.active

                                                                                                                                                                                                                                                                                                                                                            The number of active threads in the listener pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.listener.queue

                                                                                                                                                                                                                                                                                                                                                            The number of queued threads in the listener pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.listener.threads

                                                                                                                                                                                                                                                                                                                                                            The total number of threads in the listener pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.listener.rejected

                                                                                                                                                                                                                                                                                                                                                            The number of rejected threads in the listener pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.management.active

                                                                                                                                                                                                                                                                                                                                                            The number of active threads in the management pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.management.queue

                                                                                                                                                                                                                                                                                                                                                            The number of queued threads in the management pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.management.threads

                                                                                                                                                                                                                                                                                                                                                            The total number of threads in the management pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.management.rejected

                                                                                                                                                                                                                                                                                                                                                            The number of rejected threads in the management pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.merge.active

                                                                                                                                                                                                                                                                                                                                                            The number of active threads in the merge pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.merge.queue

                                                                                                                                                                                                                                                                                                                                                            The number of queued threads in the merge pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.merge.threads

                                                                                                                                                                                                                                                                                                                                                            The total number of threads in the merge pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.merge.rejected

                                                                                                                                                                                                                                                                                                                                                            The number of rejected threads in the merge pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.percolate.active

                                                                                                                                                                                                                                                                                                                                                            The number of active threads in the percolate pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.percolate.queue

                                                                                                                                                                                                                                                                                                                                                            The number of queued threads in the percolate pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.percolate.threads

                                                                                                                                                                                                                                                                                                                                                            The total number of threads in the percolate pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.percolate.rejected

                                                                                                                                                                                                                                                                                                                                                            The number of rejected threads in the percolate pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.refresh.active

                                                                                                                                                                                                                                                                                                                                                            The number of active threads in the refresh pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.refresh.queue

                                                                                                                                                                                                                                                                                                                                                            The number of queued threads in the refresh pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.refresh.threads

                                                                                                                                                                                                                                                                                                                                                            The total number of threads in the refresh pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.refresh.rejected

                                                                                                                                                                                                                                                                                                                                                            The number of rejected threads in the refresh pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.search.active

                                                                                                                                                                                                                                                                                                                                                            The number of active threads in the search pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.search.queue

                                                                                                                                                                                                                                                                                                                                                            The number of queued threads in the search pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.search.threads

                                                                                                                                                                                                                                                                                                                                                            The total number of threads in the search pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.search.rejected

                                                                                                                                                                                                                                                                                                                                                            The number of rejected threads in the search pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.snapshot.active

                                                                                                                                                                                                                                                                                                                                                            The number of active threads in the snapshot pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.snapshot.queue

                                                                                                                                                                                                                                                                                                                                                            The number of queued threads in the snapshot pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.snapshot.threads

                                                                                                                                                                                                                                                                                                                                                            The total number of threads in the snapshot pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.snapshot.rejected

                                                                                                                                                                                                                                                                                                                                                            The number of rejected threads in the snapshot pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.write.active

                                                                                                                                                                                                                                                                                                                                                            The number of active threads in the write pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.write.queue

                                                                                                                                                                                                                                                                                                                                                            The number of queued threads in the write pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.write.threads

                                                                                                                                                                                                                                                                                                                                                            The total number of threads in the write pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.thread_pool.write.rejected

                                                                                                                                                                                                                                                                                                                                                            The number of rejected threads in the write pool.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.transport.rx_count

                                                                                                                                                                                                                                                                                                                                                            The total number of packets received in cluster communication.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.transport.rx_size

                                                                                                                                                                                                                                                                                                                                                            The total size of data received in cluster communication.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.transport.server_open

                                                                                                                                                                                                                                                                                                                                                            The number of connections opened for cluster communication.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.transport.tx_count

                                                                                                                                                                                                                                                                                                                                                            The total number of packets sent in cluster communication.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.transport.tx_size

                                                                                                                                                                                                                                                                                                                                                            The total size of data sent in cluster communication.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.unassigned_shards

                                                                                                                                                                                                                                                                                                                                                            The number of shards that are unassigned to a node.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.delayed_unassigned_shards

                                                                                                                                                                                                                                                                                                                                                            The number of shards whose allocation has been delayed.

                                                                                                                                                                                                                                                                                                                                                            jvm.gc.collection_count

                                                                                                                                                                                                                                                                                                                                                            The total number of garbage collections run by the JVM.

                                                                                                                                                                                                                                                                                                                                                            jvm.gc.collection_time

                                                                                                                                                                                                                                                                                                                                                            The total time spent on garbage collection in the JVM.

                                                                                                                                                                                                                                                                                                                                                            jvm.gc.collectors.old.collection_time

                                                                                                                                                                                                                                                                                                                                                            The total time spent in major GCs in the JVM that collect old generation objects.

                                                                                                                                                                                                                                                                                                                                                            jvm.gc.collectors.old.count

                                                                                                                                                                                                                                                                                                                                                            The total count of major GCs in the JVM that collect old generation objects.

                                                                                                                                                                                                                                                                                                                                                            jvm.gc.collectors.young.collection_time

                                                                                                                                                                                                                                                                                                                                                            The total time spent in minor GCs in the JVM that collects young generation objects.

                                                                                                                                                                                                                                                                                                                                                            jvm.gc.collectors.young.count

                                                                                                                                                                                                                                                                                                                                                            The total count of minor GCs in the JVM that collects young generation objects.

                                                                                                                                                                                                                                                                                                                                                            jvm.gc.concurrent_mark_sweep.collection_time

                                                                                                                                                                                                                                                                                                                                                            The total time spent on “concurrent mark & sweep” GCs in the JVM.

                                                                                                                                                                                                                                                                                                                                                            jvm.gc.concurrent_mark_sweep.count

                                                                                                                                                                                                                                                                                                                                                            The total count of “concurrent mark & sweep” GCs in the JVM.

                                                                                                                                                                                                                                                                                                                                                            jvm.gc.par_new.collection_time

                                                                                                                                                                                                                                                                                                                                                            The total time spent on “parallel new” GCs in the JVM.

                                                                                                                                                                                                                                                                                                                                                            jvm.gc.par_new.count

                                                                                                                                                                                                                                                                                                                                                            The total count of “parallel new” GCs in the JVM.

                                                                                                                                                                                                                                                                                                                                                            jvm.mem.heap_committed

                                                                                                                                                                                                                                                                                                                                                            The amount of memory guaranteed to be available to the JVM heap.

                                                                                                                                                                                                                                                                                                                                                            jvm.mem.heap_in_use

                                                                                                                                                                                                                                                                                                                                                            The amount of memory currently used by the JVM heap as a value between 0 and 1.

                                                                                                                                                                                                                                                                                                                                                            jvm.mem.heap_max

                                                                                                                                                                                                                                                                                                                                                            The maximum amount of memory that can be used by the JVM heap.

                                                                                                                                                                                                                                                                                                                                                            jvm.mem.heap_used

                                                                                                                                                                                                                                                                                                                                                            The amount of memory in bytes currently used by the JVM heap.

                                                                                                                                                                                                                                                                                                                                                            jvm.mem.non_heap_committed

                                                                                                                                                                                                                                                                                                                                                            The amount of memory guaranteed to be available to JVM non-heap.

                                                                                                                                                                                                                                                                                                                                                            jvm.mem.non_heap_used

                                                                                                                                                                                                                                                                                                                                                            The amount of memory in bytes currently used by the JVM non-heap.

                                                                                                                                                                                                                                                                                                                                                            jvm.mem.pools.young.used

                                                                                                                                                                                                                                                                                                                                                            The amount of memory in bytes currently used by the Young Generation heap region.

                                                                                                                                                                                                                                                                                                                                                            jvm.mem.pools.young.max

                                                                                                                                                                                                                                                                                                                                                            The maximum amount of memory that can be used by the Young Generation heap region.

                                                                                                                                                                                                                                                                                                                                                            jvm.mem.pools.old.used

                                                                                                                                                                                                                                                                                                                                                            The amount of memory in bytes currently used by the Old Generation heap region.

                                                                                                                                                                                                                                                                                                                                                            jvm.mem.pools.old.max

                                                                                                                                                                                                                                                                                                                                                            The maximum amount of memory that can be used by the Old Generation heap region.

                                                                                                                                                                                                                                                                                                                                                            jvm.mem.pools.survivor.used

                                                                                                                                                                                                                                                                                                                                                            The amount of memory in bytes currently used by the Survivor Space.

                                                                                                                                                                                                                                                                                                                                                            jvm.mem.pools.survivor.max

                                                                                                                                                                                                                                                                                                                                                            The maximum amount of memory that can be used by the Survivor Space.

                                                                                                                                                                                                                                                                                                                                                            jvm.threads.count

                                                                                                                                                                                                                                                                                                                                                            The number of active threads in the JVM.

                                                                                                                                                                                                                                                                                                                                                            jvm.threads.peak_count

                                                                                                                                                                                                                                                                                                                                                            The peak number of threads used by the JVM.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.index.health

                                                                                                                                                                                                                                                                                                                                                            The status of the index.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.index.docs.count

                                                                                                                                                                                                                                                                                                                                                            The number of documents in the index.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.index.docs.deleted

                                                                                                                                                                                                                                                                                                                                                            The number of deleted documents in the index.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.index.primary_shards

                                                                                                                                                                                                                                                                                                                                                            The number of primary shards in the index.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.index.replica_shards

                                                                                                                                                                                                                                                                                                                                                            The number of replica shards in the index.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.index.primary_store_size

                                                                                                                                                                                                                                                                                                                                                            The store size of primary shards in the index.

                                                                                                                                                                                                                                                                                                                                                            elasticsearch.index.store_size

                                                                                                                                                                                                                                                                                                                                                            The store size of primary and replica shards in the index.

                                                                                                                                                                                                                                                                                                                                                            5.3.6 -

                                                                                                                                                                                                                                                                                                                                                            etcd Metrics

                                                                                                                                                                                                                                                                                                                                                            For related information, see etcd integration.

                                                                                                                                                                                                                                                                                                                                                            etcd.leader.counts.fail

                                                                                                                                                                                                                                                                                                                                                            Rate of failed Raft RPC requests.

                                                                                                                                                                                                                                                                                                                                                            etcd.leader.counts.success

                                                                                                                                                                                                                                                                                                                                                            Rate of successful Raft RPC requests.

                                                                                                                                                                                                                                                                                                                                                            etcd.leader.latency.avg

                                                                                                                                                                                                                                                                                                                                                            Average latency to each peer in the cluster.

                                                                                                                                                                                                                                                                                                                                                            etcd.leader.latency.current

                                                                                                                                                                                                                                                                                                                                                            Current latency to each peer in the cluster.

                                                                                                                                                                                                                                                                                                                                                            etcd.leader.latency.max

                                                                                                                                                                                                                                                                                                                                                            Maximum latency to each peer in the cluster.

                                                                                                                                                                                                                                                                                                                                                            etcd.leader.latency.min

                                                                                                                                                                                                                                                                                                                                                            Minimum latency to each peer in the cluster.

                                                                                                                                                                                                                                                                                                                                                            etcd.leader.latency.stddev

                                                                                                                                                                                                                                                                                                                                                            Standard deviation latency to each peer in the cluster.

                                                                                                                                                                                                                                                                                                                                                            etcd.self.recv.appendrequest.count

                                                                                                                                                                                                                                                                                                                                                            Rate of append requests this node has processed.

                                                                                                                                                                                                                                                                                                                                                            etcd.self.recv.bandwidthrate

                                                                                                                                                                                                                                                                                                                                                            Rate of bytes received.

                                                                                                                                                                                                                                                                                                                                                            etcd.self.recv.pkgrate

                                                                                                                                                                                                                                                                                                                                                            Rate of packets received.

                                                                                                                                                                                                                                                                                                                                                            etcd.self.send.appendrequest.count

                                                                                                                                                                                                                                                                                                                                                            Rate of append requests this node has sent.

                                                                                                                                                                                                                                                                                                                                                            etcd.self.send.bandwidthrate

                                                                                                                                                                                                                                                                                                                                                            Rate of bytes sent.

                                                                                                                                                                                                                                                                                                                                                            etcd.self.send.pkgrate

                                                                                                                                                                                                                                                                                                                                                            Rate of packets sent.

                                                                                                                                                                                                                                                                                                                                                            etcd.store.compareanddelete.fail

                                                                                                                                                                                                                                                                                                                                                            Rate of compare and delete requests failure.

                                                                                                                                                                                                                                                                                                                                                            etcd.store.compareanddelete.success

                                                                                                                                                                                                                                                                                                                                                            Rate of compare and delete requests success.

                                                                                                                                                                                                                                                                                                                                                            etcd.store.compareandswap.fail

                                                                                                                                                                                                                                                                                                                                                            Rate of compare and swap requests failure.

                                                                                                                                                                                                                                                                                                                                                            etcd.store.compareandswap.success

                                                                                                                                                                                                                                                                                                                                                            Rate of compare and swap requests success.

                                                                                                                                                                                                                                                                                                                                                            etcd.store.create.fail

                                                                                                                                                                                                                                                                                                                                                            Rate of failed create requests.

                                                                                                                                                                                                                                                                                                                                                            etcd.store.create.success

                                                                                                                                                                                                                                                                                                                                                            Rate of successful create requests.

                                                                                                                                                                                                                                                                                                                                                            etcd.store.delete.fail

                                                                                                                                                                                                                                                                                                                                                            Rate of failed delete requests.

                                                                                                                                                                                                                                                                                                                                                            etcd.store.delete.success

                                                                                                                                                                                                                                                                                                                                                            Rate of successful delete requests.

                                                                                                                                                                                                                                                                                                                                                            etcd.store.expire.count

                                                                                                                                                                                                                                                                                                                                                            Rate of expired keys.

                                                                                                                                                                                                                                                                                                                                                            etcd.store.gets.fail

                                                                                                                                                                                                                                                                                                                                                            Rate of failed get requests.

                                                                                                                                                                                                                                                                                                                                                            etcd.store.gets.success

                                                                                                                                                                                                                                                                                                                                                            Rate of successful get requests.

                                                                                                                                                                                                                                                                                                                                                            etcd.store.sets.fail

                                                                                                                                                                                                                                                                                                                                                            Rate of failed set requests.

                                                                                                                                                                                                                                                                                                                                                            etcd.store.sets.success

                                                                                                                                                                                                                                                                                                                                                            Rate of successful set requests.

                                                                                                                                                                                                                                                                                                                                                            etcd.store.update.fail

                                                                                                                                                                                                                                                                                                                                                            Rate of failed update requests.

                                                                                                                                                                                                                                                                                                                                                            etcd.store.update.success

                                                                                                                                                                                                                                                                                                                                                            Rate of successful update requests.

                                                                                                                                                                                                                                                                                                                                                            etcd.store.watchers

                                                                                                                                                                                                                                                                                                                                                            Rate of watchers.

                                                                                                                                                                                                                                                                                                                                                            5.3.7 -

                                                                                                                                                                                                                                                                                                                                                            fluentd Metrics

                                                                                                                                                                                                                                                                                                                                                            For related information, see fluentd integration.

                                                                                                                                                                                                                                                                                                                                                            fluentd.buffer_queue_length

                                                                                                                                                                                                                                                                                                                                                            The length of the plugin buffer queue for this plugin.

                                                                                                                                                                                                                                                                                                                                                            fluentd.buffer_total_queued_size

                                                                                                                                                                                                                                                                                                                                                            The size of the buffer queue for this plugin.

                                                                                                                                                                                                                                                                                                                                                            fluentd.retry_count

                                                                                                                                                                                                                                                                                                                                                            The number of retries for this plugin.

                                                                                                                                                                                                                                                                                                                                                            5.3.8 -

                                                                                                                                                                                                                                                                                                                                                            Go Metrics

                                                                                                                                                                                                                                                                                                                                                            See also: Go integration information.

                                                                                                                                                                                                                                                                                                                                                            go_expvar.memstats.alloc

                                                                                                                                                                                                                                                                                                                                                            The number of bytes allocated and not yet freed.

                                                                                                                                                                                                                                                                                                                                                            go_expvar.memstats.frees

                                                                                                                                                                                                                                                                                                                                                            The number of free bytes.

                                                                                                                                                                                                                                                                                                                                                            go_expvar.memstats.heap_alloc

                                                                                                                                                                                                                                                                                                                                                            go_expvar.memstats.heap_idle

                                                                                                                                                                                                                                                                                                                                                            The number of bytes in idle spans.

                                                                                                                                                                                                                                                                                                                                                            go_expvar.memstats.heap_inuse

                                                                                                                                                                                                                                                                                                                                                            The number of bytes in non-idle spans.

                                                                                                                                                                                                                                                                                                                                                            go_expvar.memstats.heap_objects

                                                                                                                                                                                                                                                                                                                                                            The total number of allocated objects.

                                                                                                                                                                                                                                                                                                                                                            go_expvar.memstats.heap_released

                                                                                                                                                                                                                                                                                                                                                            The number of bytes released to the OS.

                                                                                                                                                                                                                                                                                                                                                            go_expvar.memstats.heap_sys

                                                                                                                                                                                                                                                                                                                                                            The number of bytes obtained from the system.

                                                                                                                                                                                                                                                                                                                                                            go_expvar.memstats.lookups

                                                                                                                                                                                                                                                                                                                                                            The number of pointer lookups.

                                                                                                                                                                                                                                                                                                                                                            go_expvar.memstats.mallocs

                                                                                                                                                                                                                                                                                                                                                            The number of mallocs.

                                                                                                                                                                                                                                                                                                                                                            go_expvar.memstats.num_gc

                                                                                                                                                                                                                                                                                                                                                            The number of garbage collections.

                                                                                                                                                                                                                                                                                                                                                            go_expvar.memstats.pause_ns.avg

                                                                                                                                                                                                                                                                                                                                                            The average of recent GC pause durations.

                                                                                                                                                                                                                                                                                                                                                            go_expvar.memstats.pause_ns.count

                                                                                                                                                                                                                                                                                                                                                            The number of submitted GC pause durations.

                                                                                                                                                                                                                                                                                                                                                            go_expvar.memstats.pause_ns.max

                                                                                                                                                                                                                                                                                                                                                            The max GC pause duration.

                                                                                                                                                                                                                                                                                                                                                            go_expvar.memstats.pause_ns.median

                                                                                                                                                                                                                                                                                                                                                            The median GC pause duration.

                                                                                                                                                                                                                                                                                                                                                            go_expvar.memstats.pause_total_ns

                                                                                                                                                                                                                                                                                                                                                            The total GC pause duration over the lifetime of process.

                                                                                                                                                                                                                                                                                                                                                            go_expvar.memstats.total_alloc

                                                                                                                                                                                                                                                                                                                                                            The bytes allocated (even if freed).

                                                                                                                                                                                                                                                                                                                                                            5.3.9 -

                                                                                                                                                                                                                                                                                                                                                            HTTP Metrics

                                                                                                                                                                                                                                                                                                                                                            See HTTP integration.

                                                                                                                                                                                                                                                                                                                                                            http.ssl.days_left

                                                                                                                                                                                                                                                                                                                                                            The number of days until the SSL certificate expires.

                                                                                                                                                                                                                                                                                                                                                            network.http.response_time

                                                                                                                                                                                                                                                                                                                                                            The response time of a HTTP request to a specified URL.

                                                                                                                                                                                                                                                                                                                                                            5.3.10 -

                                                                                                                                                                                                                                                                                                                                                            HAProxy Metrics

                                                                                                                                                                                                                                                                                                                                                            See also: HAProxy integration information.

                                                                                                                                                                                                                                                                                                                                                            haproxy.backend_hosts

                                                                                                                                                                                                                                                                                                                                                            The number of backend hosts.

                                                                                                                                                                                                                                                                                                                                                            haproxy.backend.bytes.in_rate

                                                                                                                                                                                                                                                                                                                                                            The rate of bytes in on backend hosts.

                                                                                                                                                                                                                                                                                                                                                            haproxy.backend.bytes.out_rate

                                                                                                                                                                                                                                                                                                                                                            The rate of bytes out on backend hosts.

                                                                                                                                                                                                                                                                                                                                                            haproxy.backend.connect.time

                                                                                                                                                                                                                                                                                                                                                            The average connect time over the last 1024 requests.

                                                                                                                                                                                                                                                                                                                                                            haproxy.backend.denied.req_rate

                                                                                                                                                                                                                                                                                                                                                            The number of requests denied due to security concerns.

                                                                                                                                                                                                                                                                                                                                                            haproxy.backend.denied.resp_rate

                                                                                                                                                                                                                                                                                                                                                            The number of responses denied due to security concerns.

                                                                                                                                                                                                                                                                                                                                                            haproxy.backend.errors.con_rate

                                                                                                                                                                                                                                                                                                                                                            The rate of requests that encountered an error trying to connect to a backend server.

                                                                                                                                                                                                                                                                                                                                                            haproxy.backend.errors.resp_rate

                                                                                                                                                                                                                                                                                                                                                            The rate of responses aborted due to error.

                                                                                                                                                                                                                                                                                                                                                            haproxy.backend.queue.current

                                                                                                                                                                                                                                                                                                                                                            The number of requests without an assigned backend.

                                                                                                                                                                                                                                                                                                                                                            haproxy.backend.queue.time

                                                                                                                                                                                                                                                                                                                                                            The average queue time over the last 1024 requests.

                                                                                                                                                                                                                                                                                                                                                            haproxy.backend.response.1xx

                                                                                                                                                                                                                                                                                                                                                            The backend HTTP responses with 1xx code.

                                                                                                                                                                                                                                                                                                                                                            haproxy.backend.response.2xx

                                                                                                                                                                                                                                                                                                                                                            The backend HTTP responses with 2xx code.

                                                                                                                                                                                                                                                                                                                                                            haproxy.backend.response.3xx

                                                                                                                                                                                                                                                                                                                                                            The backend HTTP responses with 3xx code.

                                                                                                                                                                                                                                                                                                                                                            haproxy.backend.response.4xx

                                                                                                                                                                                                                                                                                                                                                            The backend HTTP responses with 4xx code.

                                                                                                                                                                                                                                                                                                                                                            haproxy.backend.response.5xx

                                                                                                                                                                                                                                                                                                                                                            The backend HTTP responses with 5xx code.

                                                                                                                                                                                                                                                                                                                                                            haproxy.backend.response.other

                                                                                                                                                                                                                                                                                                                                                            The backend HTTP responses with another code (protocol error).

                                                                                                                                                                                                                                                                                                                                                            haproxy.backend.response.time

                                                                                                                                                                                                                                                                                                                                                            The average response time over the last 1024 requests (0 for TCP).

                                                                                                                                                                                                                                                                                                                                                            haproxy.backend.session.current

                                                                                                                                                                                                                                                                                                                                                            The number of active backend sessions.

                                                                                                                                                                                                                                                                                                                                                            haproxy.backend.session.limit

                                                                                                                                                                                                                                                                                                                                                            The configured backend session limit.

                                                                                                                                                                                                                                                                                                                                                            haproxy.backend.session.pct

                                                                                                                                                                                                                                                                                                                                                            The percentage of sessions in use. The formula used for this metric is backend.session.current / backend.session.limit * 100.

                                                                                                                                                                                                                                                                                                                                                            haproxy.backend.session.rate

                                                                                                                                                                                                                                                                                                                                                            The number of backend sessions created per second.

                                                                                                                                                                                                                                                                                                                                                            haproxy.backend.session.time

                                                                                                                                                                                                                                                                                                                                                            The average total session time over the last 1024 requests.

                                                                                                                                                                                                                                                                                                                                                            haproxy.backend.uptime

                                                                                                                                                                                                                                                                                                                                                            The number of seconds since the last UP<->DOWN transition.

                                                                                                                                                                                                                                                                                                                                                            haproxy.backend.warnings.redis_rate

                                                                                                                                                                                                                                                                                                                                                            The number of times a request was redispatched to another server.

                                                                                                                                                                                                                                                                                                                                                            haproxy.backend.warnings.retr_rate

                                                                                                                                                                                                                                                                                                                                                            The number of times a connection to a server was retried.

                                                                                                                                                                                                                                                                                                                                                            haproxy.count_per_status

                                                                                                                                                                                                                                                                                                                                                            The number of hosts by status (UP/DOWN/NOLB/MAINT).

                                                                                                                                                                                                                                                                                                                                                            haproxy.frontend.bytes.in_rate

                                                                                                                                                                                                                                                                                                                                                            The rate of bytes in on frontend hosts.

                                                                                                                                                                                                                                                                                                                                                            haproxy.frontend.bytes.out_rate

                                                                                                                                                                                                                                                                                                                                                            The rate of bytes out on frontend hosts.

                                                                                                                                                                                                                                                                                                                                                            haproxy.frontend.denied.req_rate

                                                                                                                                                                                                                                                                                                                                                            The number of requests denied due to security concerns.

                                                                                                                                                                                                                                                                                                                                                            haproxy.frontend.denied.resp_rate

                                                                                                                                                                                                                                                                                                                                                            The number of responses denied due to security concerns.

                                                                                                                                                                                                                                                                                                                                                            haproxy.frontend.errors.req_rate

                                                                                                                                                                                                                                                                                                                                                            The rate of request errors.

                                                                                                                                                                                                                                                                                                                                                            haproxy.frontend.requests.rate

                                                                                                                                                                                                                                                                                                                                                            The number of HTTP requests per second.

                                                                                                                                                                                                                                                                                                                                                            haproxy.frontend.response.1xx

                                                                                                                                                                                                                                                                                                                                                            The frontend HTTP responses with 1xx code.

                                                                                                                                                                                                                                                                                                                                                            haproxy.frontend.response.2xx

                                                                                                                                                                                                                                                                                                                                                            The frontend HTTP responses with 2xx code.

                                                                                                                                                                                                                                                                                                                                                            haproxy.frontend.response.3xx

                                                                                                                                                                                                                                                                                                                                                            The frontend HTTP responses with 3xx code.

                                                                                                                                                                                                                                                                                                                                                            haproxy.frontend.response.4xx

                                                                                                                                                                                                                                                                                                                                                            The frontend HTTP responses with 4xx code.

                                                                                                                                                                                                                                                                                                                                                            haproxy.frontend.response.5xx

                                                                                                                                                                                                                                                                                                                                                            The frontend HTTP responses with 5xx code.

                                                                                                                                                                                                                                                                                                                                                            haproxy.frontend.response.other

                                                                                                                                                                                                                                                                                                                                                            The frontend HTTP responses with another code (protocol error).

                                                                                                                                                                                                                                                                                                                                                            haproxy.frontend.session.current

                                                                                                                                                                                                                                                                                                                                                            The number of active frontend sessions.

                                                                                                                                                                                                                                                                                                                                                            haproxy.frontend.session.limit

                                                                                                                                                                                                                                                                                                                                                            The configured backend session limit.

                                                                                                                                                                                                                                                                                                                                                            haproxy.frontend.session.pct

                                                                                                                                                                                                                                                                                                                                                            The percentage of sessions in use. The formula used for this metric is frontend.session.current / frontend.session.limit * 100.

                                                                                                                                                                                                                                                                                                                                                            haproxy.frontend.session.rate

                                                                                                                                                                                                                                                                                                                                                            The number of frontend sessions created per second.

                                                                                                                                                                                                                                                                                                                                                            Agent 9.6.0 Additional HAProxy Metrics

                                                                                                                                                                                                                                                                                                                                                            • haproxy.backend.requests.tot_rate

                                                                                                                                                                                                                                                                                                                                                              Rate of total number of HTTP requests

                                                                                                                                                                                                                                                                                                                                                            • haproxy.frontend.connections.rate

                                                                                                                                                                                                                                                                                                                                                              Number of connections per second

                                                                                                                                                                                                                                                                                                                                                            • haproxy.frontend.connections.tot_rate

                                                                                                                                                                                                                                                                                                                                                              Rate of total number of connections

                                                                                                                                                                                                                                                                                                                                                            • haproxy.frontend.requests.intercepted

                                                                                                                                                                                                                                                                                                                                                              Number of intercepted requests per second

                                                                                                                                                                                                                                                                                                                                                            • haproxy.frontend.requests.tot_rate

                                                                                                                                                                                                                                                                                                                                                              Rate of total number of HTTP requests

                                                                                                                                                                                                                                                                                                                                                            5.3.11 -

                                                                                                                                                                                                                                                                                                                                                            Jenkins Metrics

                                                                                                                                                                                                                                                                                                                                                            See also: Jenkins integration information.

                                                                                                                                                                                                                                                                                                                                                            jenkins.job.duration

                                                                                                                                                                                                                                                                                                                                                            The duration of a job, measured in seconds.

                                                                                                                                                                                                                                                                                                                                                            jenkins.job.success

                                                                                                                                                                                                                                                                                                                                                            The status of a successful job.

                                                                                                                                                                                                                                                                                                                                                            jenkins.job.failure

                                                                                                                                                                                                                                                                                                                                                            The status of a failed job.

                                                                                                                                                                                                                                                                                                                                                            5.3.12 -

                                                                                                                                                                                                                                                                                                                                                            Lighttpd Metrics

                                                                                                                                                                                                                                                                                                                                                            See also: Lighttpd integration information.

                                                                                                                                                                                                                                                                                                                                                            lighttpd.net.bytes

                                                                                                                                                                                                                                                                                                                                                            The total number of bytes sent and received.

                                                                                                                                                                                                                                                                                                                                                            lighttpd.net.bytes_per_s

                                                                                                                                                                                                                                                                                                                                                            The number of bytes sent and received per second.

                                                                                                                                                                                                                                                                                                                                                            lighttpd.net.hits

                                                                                                                                                                                                                                                                                                                                                            The total number of hits since the start.

                                                                                                                                                                                                                                                                                                                                                            lighttpd.net.request_per_s

                                                                                                                                                                                                                                                                                                                                                            The number of requests per second.

                                                                                                                                                                                                                                                                                                                                                            lighttpd.performance.busy_servers

                                                                                                                                                                                                                                                                                                                                                            The number of active connections.

                                                                                                                                                                                                                                                                                                                                                            lighttpd.performance.idle_server

                                                                                                                                                                                                                                                                                                                                                            The number of idle connections.

                                                                                                                                                                                                                                                                                                                                                            lighttpd.performance.uptime

                                                                                                                                                                                                                                                                                                                                                            The amount of time the server has been up and running.

                                                                                                                                                                                                                                                                                                                                                            5.3.13 -

                                                                                                                                                                                                                                                                                                                                                            Memcached Metrics

                                                                                                                                                                                                                                                                                                                                                            See also: Memcached integration information.

                                                                                                                                                                                                                                                                                                                                                            memcache.avg_item_size

                                                                                                                                                                                                                                                                                                                                                            The average size of an item.

                                                                                                                                                                                                                                                                                                                                                            memcache.bytes

                                                                                                                                                                                                                                                                                                                                                            The current number of bytes used by this server to store items.

                                                                                                                                                                                                                                                                                                                                                            memcache.bytes_read_rate

                                                                                                                                                                                                                                                                                                                                                            The rate of bytes read from the network by this server.

                                                                                                                                                                                                                                                                                                                                                            memcache.bytes_written_rate

                                                                                                                                                                                                                                                                                                                                                            The rate of bytes written to the network by this server.

                                                                                                                                                                                                                                                                                                                                                            memcache.cas_badval_rate

                                                                                                                                                                                                                                                                                                                                                            The rate at which keys are compared and swapped where the comparison (original) value did not match the supplied value.

                                                                                                                                                                                                                                                                                                                                                            memcache.cas_hits_rate

                                                                                                                                                                                                                                                                                                                                                            The rate at which keys are compared and swapped and found present.

                                                                                                                                                                                                                                                                                                                                                            memcache.cas_misses_rate

                                                                                                                                                                                                                                                                                                                                                            The rate at which keys are compared and swapped and not found present.

                                                                                                                                                                                                                                                                                                                                                            memcache.cmd_flush_rate

                                                                                                                                                                                                                                                                                                                                                            The rate of flush_all commands.

                                                                                                                                                                                                                                                                                                                                                            memcache.cmd_get_rate

                                                                                                                                                                                                                                                                                                                                                            The rate of get commands.

                                                                                                                                                                                                                                                                                                                                                            memcache.cmd_set_rate

                                                                                                                                                                                                                                                                                                                                                            The rate of set commands.

                                                                                                                                                                                                                                                                                                                                                            memcache.connection_structures

                                                                                                                                                                                                                                                                                                                                                            The number of connection structures allocated by the server.

                                                                                                                                                                                                                                                                                                                                                            memcache.curr_connections

                                                                                                                                                                                                                                                                                                                                                            The number of open connections to this server.

                                                                                                                                                                                                                                                                                                                                                            memcache.curr_items

                                                                                                                                                                                                                                                                                                                                                            The current number of items stored by the server.

                                                                                                                                                                                                                                                                                                                                                            memcache.delete_hits_rate

                                                                                                                                                                                                                                                                                                                                                            The rate at which delete commands result in items being removed.

                                                                                                                                                                                                                                                                                                                                                            memcache.delete_misses_rate

                                                                                                                                                                                                                                                                                                                                                            The rate at which delete commands result in no items being removed.

                                                                                                                                                                                                                                                                                                                                                            memcache.evictions_rate

                                                                                                                                                                                                                                                                                                                                                            The rate at which valid items are removed from cache to free memory for new items.

                                                                                                                                                                                                                                                                                                                                                            memcache.fill_percent

                                                                                                                                                                                                                                                                                                                                                            The amount of memory being used by the server for storing items as a percentage of the max allowed.

                                                                                                                                                                                                                                                                                                                                                            memcache.get_hit_percent

                                                                                                                                                                                                                                                                                                                                                            The percentage of requested keys that are found present since the start of the Memcached server.

                                                                                                                                                                                                                                                                                                                                                            memcache.get_hits_rate

                                                                                                                                                                                                                                                                                                                                                            The rate at which keys are requested and found present.

                                                                                                                                                                                                                                                                                                                                                            memcache.get_misses_rate

                                                                                                                                                                                                                                                                                                                                                            The rate at which keys are requested and not found.

                                                                                                                                                                                                                                                                                                                                                            memcache.items.age

                                                                                                                                                                                                                                                                                                                                                            The age of the oldest item in the LRU.

                                                                                                                                                                                                                                                                                                                                                            memcache.items.crawler_reclaimed_rate

                                                                                                                                                                                                                                                                                                                                                            The rate at which items freed by the LRU Crawler.

                                                                                                                                                                                                                                                                                                                                                            memcache.items.direct_reclaims_rate

                                                                                                                                                                                                                                                                                                                                                            The rate at which worker threads had to directly pull LRU tails to find memory for a new item.

                                                                                                                                                                                                                                                                                                                                                            memcache.items.evicted_nonzero_rate

                                                                                                                                                                                                                                                                                                                                                            The rate at which nonzero items which had an explicit expire time set had to be evicted from the LRU before expiring.

                                                                                                                                                                                                                                                                                                                                                            memcache.items.evicted_rate

                                                                                                                                                                                                                                                                                                                                                            The rate st which items had to be evicted from the LRU before expiring.

                                                                                                                                                                                                                                                                                                                                                            memcache.items.evicted_time

                                                                                                                                                                                                                                                                                                                                                            The number of seconds since the last access for the most recent item evicted from this class.

                                                                                                                                                                                                                                                                                                                                                            memcache.items.evicted_unfetched_rate

                                                                                                                                                                                                                                                                                                                                                            The rate at which valid items evicted from the LRU which were never touched after being set.

                                                                                                                                                                                                                                                                                                                                                            memcache.items.expired_unfetched_rate

                                                                                                                                                                                                                                                                                                                                                            The rate at which expired items reclaimed from the LRU which were never touched after being set.

                                                                                                                                                                                                                                                                                                                                                            memcache.items.lrutail_reflocked_rate

                                                                                                                                                                                                                                                                                                                                                            The rate at which items found to be refcount locked in the LRU tail.

                                                                                                                                                                                                                                                                                                                                                            memcache.items.moves_to_cold_rate

                                                                                                                                                                                                                                                                                                                                                            The rate at which items were moved from HOT or WARM into COLD.

                                                                                                                                                                                                                                                                                                                                                            memcache.items.moves_to_warm_rate

                                                                                                                                                                                                                                                                                                                                                            The rate at which items were moved from COLD to WARM.

                                                                                                                                                                                                                                                                                                                                                            memcache.items.moves_within_lru_rate

                                                                                                                                                                                                                                                                                                                                                            The rate at which active items were bumped within HOT or WARM.

                                                                                                                                                                                                                                                                                                                                                            memcache.items.number

                                                                                                                                                                                                                                                                                                                                                            The number of items presently stored in this slab class.

                                                                                                                                                                                                                                                                                                                                                            memcache.items.number_cold

                                                                                                                                                                                                                                                                                                                                                            The number of items presently stored in the COLD LRU.

                                                                                                                                                                                                                                                                                                                                                            memcache.items.number_hot

                                                                                                                                                                                                                                                                                                                                                            The number of items presently stored in the HOT LRU.

                                                                                                                                                                                                                                                                                                                                                            memcache.items.number_noexp

                                                                                                                                                                                                                                                                                                                                                            The number of items presently stored in the NOEXP class.

                                                                                                                                                                                                                                                                                                                                                            memcache.items.number_warm

                                                                                                                                                                                                                                                                                                                                                            The number of items presently stored in the WARM LRU.

                                                                                                                                                                                                                                                                                                                                                            memcache.items.outofmemory_rate

                                                                                                                                                                                                                                                                                                                                                            The rate at which the underlying slab class was unable to store a new item.

                                                                                                                                                                                                                                                                                                                                                            memcache.items.reclaimed_rate

                                                                                                                                                                                                                                                                                                                                                            The rate at which entries were stored using memory from an expired entry.

                                                                                                                                                                                                                                                                                                                                                            memcache.items.tailrepairs_rate

                                                                                                                                                                                                                                                                                                                                                            The rate at which Memcached self-healed a slab with a refcount leak.

                                                                                                                                                                                                                                                                                                                                                            memcache.limit_maxbytes

                                                                                                                                                                                                                                                                                                                                                            The number of bytes this server is allowed to use for storage.

                                                                                                                                                                                                                                                                                                                                                            memcache.listen_disabled_num_rate

                                                                                                                                                                                                                                                                                                                                                            The rate at which the server has reached the max connection limit.

                                                                                                                                                                                                                                                                                                                                                            memcache.pointer_size

                                                                                                                                                                                                                                                                                                                                                            The default size of pointers on the host OS (generally 32 or 64).

                                                                                                                                                                                                                                                                                                                                                            memcache.rusage_system_rate

                                                                                                                                                                                                                                                                                                                                                            The fraction of user time the CPU spent executing this server process.

                                                                                                                                                                                                                                                                                                                                                            memcache.rusage_user_rate

                                                                                                                                                                                                                                                                                                                                                            The fraction of time the CPU spent executing kernel code on behalf of this server process.

                                                                                                                                                                                                                                                                                                                                                            memcache.slabs.active_slabs

                                                                                                                                                                                                                                                                                                                                                            The total number of slab classes allocated.

                                                                                                                                                                                                                                                                                                                                                            memcache.slabs.cas_badval_rate

                                                                                                                                                                                                                                                                                                                                                            The rate at which CAS commands failed to modify a value due to a bad CAS ID.

                                                                                                                                                                                                                                                                                                                                                            memcache.slabs.cas_hits_rate

                                                                                                                                                                                                                                                                                                                                                            The rate at which CAS commands modified this slab class.

                                                                                                                                                                                                                                                                                                                                                            memcache.slabs.chunk_size

                                                                                                                                                                                                                                                                                                                                                            The amount of space each chunk uses.

                                                                                                                                                                                                                                                                                                                                                            memcache.slabs.chunks_per_page

                                                                                                                                                                                                                                                                                                                                                            The number of chunks that exist within one page.

                                                                                                                                                                                                                                                                                                                                                            memcache.slabs.cmd_set_rate

                                                                                                                                                                                                                                                                                                                                                            The rate at which set requests stored data in this slab class.

                                                                                                                                                                                                                                                                                                                                                            memcache.slabs.decr_hits_rate

                                                                                                                                                                                                                                                                                                                                                            The rate at which decrs commands modified this slab class.

                                                                                                                                                                                                                                                                                                                                                            memcache.slabs.delete_hits_rate

                                                                                                                                                                                                                                                                                                                                                            The rate at which delete commands succeeded in this slab class.

                                                                                                                                                                                                                                                                                                                                                            memcache.slabs.free_chunks

                                                                                                                                                                                                                                                                                                                                                            The number of chunks not yet allocated to items or freed via delete.

                                                                                                                                                                                                                                                                                                                                                            memcache.slabs.free_chunks_end

                                                                                                                                                                                                                                                                                                                                                            The number of free chunks at the end of the last allocated page.

                                                                                                                                                                                                                                                                                                                                                            memcache.slabs.get_hits_rate

                                                                                                                                                                                                                                                                                                                                                            The rate at which get requests were serviced by this slab class.

                                                                                                                                                                                                                                                                                                                                                            memcache.slabs.incr_hits_rate

                                                                                                                                                                                                                                                                                                                                                            The rate at which incrs commands modified this slab class.

                                                                                                                                                                                                                                                                                                                                                            memcache.slabs.mem_requested

                                                                                                                                                                                                                                                                                                                                                            The number of bytes requested to be stored in this slab.

                                                                                                                                                                                                                                                                                                                                                            memcache.slabs.total_chunks

                                                                                                                                                                                                                                                                                                                                                            The total number of chunks allocated to the slab class.

                                                                                                                                                                                                                                                                                                                                                            memcache.slabs.total_malloced

                                                                                                                                                                                                                                                                                                                                                            The total amount of memory allocated to slab pages.

                                                                                                                                                                                                                                                                                                                                                            memcache.slabs.total_pages

                                                                                                                                                                                                                                                                                                                                                            The total number of pages allocated to the slab class.

                                                                                                                                                                                                                                                                                                                                                            memcache.slabs.touch_hits_rate

                                                                                                                                                                                                                                                                                                                                                            The rate of touches serviced by this slab class.

                                                                                                                                                                                                                                                                                                                                                            memcache.slabs.used_chunks

                                                                                                                                                                                                                                                                                                                                                            The number of chunks that have been allocated to items.

                                                                                                                                                                                                                                                                                                                                                            memcache.slabs.used_chunks_rate

                                                                                                                                                                                                                                                                                                                                                            The rate at which chunks have been allocated to items.

                                                                                                                                                                                                                                                                                                                                                            memcache.threads

                                                                                                                                                                                                                                                                                                                                                            The number of threads used by the current Memcached server process.

                                                                                                                                                                                                                                                                                                                                                            memcache.total_connections_rate

                                                                                                                                                                                                                                                                                                                                                            The rate at which connections to this server are opened.

                                                                                                                                                                                                                                                                                                                                                            memcache.total_items

                                                                                                                                                                                                                                                                                                                                                            The total number of items stored by this server since it started.

                                                                                                                                                                                                                                                                                                                                                            memcache.uptime

                                                                                                                                                                                                                                                                                                                                                            The number of seconds this server has been running.

                                                                                                                                                                                                                                                                                                                                                            5.3.14 -

                                                                                                                                                                                                                                                                                                                                                            Mesos/Marathon Metrics

                                                                                                                                                                                                                                                                                                                                                            Contents

                                                                                                                                                                                                                                                                                                                                                            5.3.14.1 -

                                                                                                                                                                                                                                                                                                                                                            Mesos Agent Metrics

                                                                                                                                                                                                                                                                                                                                                            See also: Mesos/Marathon integration information.

                                                                                                                                                                                                                                                                                                                                                            mesos.slave.cpus_percent

                                                                                                                                                                                                                                                                                                                                                            The percentage of CPUs allocated to the slave.

                                                                                                                                                                                                                                                                                                                                                            mesos.slave.cpus_total

                                                                                                                                                                                                                                                                                                                                                            The total number of CPUs.

                                                                                                                                                                                                                                                                                                                                                            mesos.slave.cpus_used

                                                                                                                                                                                                                                                                                                                                                            The number of CPUs allocated to the slave.

                                                                                                                                                                                                                                                                                                                                                            mesos.slave.disk_percent

                                                                                                                                                                                                                                                                                                                                                            The percentage of disk space allocated to the slave.

                                                                                                                                                                                                                                                                                                                                                            mesos.slave.disk_total

                                                                                                                                                                                                                                                                                                                                                            The total disk space available.

                                                                                                                                                                                                                                                                                                                                                            mesos.slave.disk_used

                                                                                                                                                                                                                                                                                                                                                            The amount of disk space allocated to the slave.

                                                                                                                                                                                                                                                                                                                                                            mesos.slave.executors_registering

                                                                                                                                                                                                                                                                                                                                                            The number of executors registering.

                                                                                                                                                                                                                                                                                                                                                            mesos.slave.executors_running

                                                                                                                                                                                                                                                                                                                                                            The number of executors currently running.

                                                                                                                                                                                                                                                                                                                                                            mesos.slave.executors_terminated

                                                                                                                                                                                                                                                                                                                                                            The number of terminated executors.

                                                                                                                                                                                                                                                                                                                                                            mesos.slave.executors_terminating

                                                                                                                                                                                                                                                                                                                                                            The number of terminating executors.

                                                                                                                                                                                                                                                                                                                                                            mesos.slave.frameworks_active

                                                                                                                                                                                                                                                                                                                                                            The number of active frameworks.

                                                                                                                                                                                                                                                                                                                                                            mesos.slave.invalid_framework_messages

                                                                                                                                                                                                                                                                                                                                                            The number of invalid framework messages.

                                                                                                                                                                                                                                                                                                                                                            mesos.slave.invalid_status_updates

                                                                                                                                                                                                                                                                                                                                                            The number of invalid status updates.

                                                                                                                                                                                                                                                                                                                                                            mesos.slave.mem_percent

                                                                                                                                                                                                                                                                                                                                                            The percentage of memory allocated to the slave.

                                                                                                                                                                                                                                                                                                                                                            mesos.slave.mem_total

                                                                                                                                                                                                                                                                                                                                                            The total memory available.

                                                                                                                                                                                                                                                                                                                                                            mesos.slave.mem_used

                                                                                                                                                                                                                                                                                                                                                            The amount of memory allocated to the slave.

                                                                                                                                                                                                                                                                                                                                                            mesos.slave.recovery_errors

                                                                                                                                                                                                                                                                                                                                                            The number of errors encountered during slave recovery.

                                                                                                                                                                                                                                                                                                                                                            mesos.slave.tasks_failed

                                                                                                                                                                                                                                                                                                                                                            The number of failed tasks.

                                                                                                                                                                                                                                                                                                                                                            mesos.slave.tasks_finished

                                                                                                                                                                                                                                                                                                                                                            The number of finished tasks.

                                                                                                                                                                                                                                                                                                                                                            mesos.slave.tasks_killed

                                                                                                                                                                                                                                                                                                                                                            The number of killed tasks.

                                                                                                                                                                                                                                                                                                                                                            mesos.slave.tasks_lost

                                                                                                                                                                                                                                                                                                                                                            The number of lost tasks.

                                                                                                                                                                                                                                                                                                                                                            mesos.slave.tasks_running

                                                                                                                                                                                                                                                                                                                                                            The number of running tasks.

                                                                                                                                                                                                                                                                                                                                                            mesos.slave.tasks_staging

                                                                                                                                                                                                                                                                                                                                                            The number of staging tasks.

                                                                                                                                                                                                                                                                                                                                                            mesos.slave.tasks_starting

                                                                                                                                                                                                                                                                                                                                                            The number of starting tasks.

                                                                                                                                                                                                                                                                                                                                                            mesos.slave.valid_framework_messages

                                                                                                                                                                                                                                                                                                                                                            The number of valid framework messages.

                                                                                                                                                                                                                                                                                                                                                            mesos.slave.valid_status_updates

                                                                                                                                                                                                                                                                                                                                                            The number of valid status updates.

                                                                                                                                                                                                                                                                                                                                                            mesos.state.task.cpu

                                                                                                                                                                                                                                                                                                                                                            The task CPU.

                                                                                                                                                                                                                                                                                                                                                            mesos.state.task.disk

                                                                                                                                                                                                                                                                                                                                                            The disk space available for the task.

                                                                                                                                                                                                                                                                                                                                                            mesos.state.task.mem

                                                                                                                                                                                                                                                                                                                                                            The amount of memory used by the task.

                                                                                                                                                                                                                                                                                                                                                            mesos.stats.registered

                                                                                                                                                                                                                                                                                                                                                            Defines whether this slave is registered with a master.

                                                                                                                                                                                                                                                                                                                                                            mesos.stats.system.cpus_total

                                                                                                                                                                                                                                                                                                                                                            The total number of CPUs available.

                                                                                                                                                                                                                                                                                                                                                            mesos.stats.system.load_1min

                                                                                                                                                                                                                                                                                                                                                            The average load for the last minute.

                                                                                                                                                                                                                                                                                                                                                            mesos.stats.system.load_5min

                                                                                                                                                                                                                                                                                                                                                            The average load for the last five minutes.

                                                                                                                                                                                                                                                                                                                                                            mesos.stats.system.load_15min

                                                                                                                                                                                                                                                                                                                                                            The average load for the last 15 minutes.

                                                                                                                                                                                                                                                                                                                                                            mesos.stats.system.mem_free_bytes

                                                                                                                                                                                                                                                                                                                                                            The amount of free memory.

                                                                                                                                                                                                                                                                                                                                                            mesos.stats.system.mem_total_bytes

                                                                                                                                                                                                                                                                                                                                                            The total amount of memory.

                                                                                                                                                                                                                                                                                                                                                            mesos.stats.uptime_secs

                                                                                                                                                                                                                                                                                                                                                            The current uptime for the slave.

                                                                                                                                                                                                                                                                                                                                                            5.3.14.2 -

                                                                                                                                                                                                                                                                                                                                                            Mesos Master Metrics

                                                                                                                                                                                                                                                                                                                                                            See also: Mesos/Marathon integration information.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.cpus_percent

                                                                                                                                                                                                                                                                                                                                                            The percentage of CPUs allocated to the cluster.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.cpus_total

                                                                                                                                                                                                                                                                                                                                                            The total number of CPUs.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.cpus_used

                                                                                                                                                                                                                                                                                                                                                            The number of CPUs used by the cluster.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.disk_percent

                                                                                                                                                                                                                                                                                                                                                            The percentage of disk space allocated to the cluster.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.disk_total

                                                                                                                                                                                                                                                                                                                                                            The total amount of disk space.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.disk_used

                                                                                                                                                                                                                                                                                                                                                            The amount of disk space used by the cluster.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.dropped_messages

                                                                                                                                                                                                                                                                                                                                                            The number of dropped messages.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.event_queue_dispatches

                                                                                                                                                                                                                                                                                                                                                            The number of dispatches in the event queue.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.event_queue_http_requests

                                                                                                                                                                                                                                                                                                                                                            The number of HTTP requests in the event queue.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.event_queue_messages

                                                                                                                                                                                                                                                                                                                                                            The number of messages in the event queue.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.frameworks_active

                                                                                                                                                                                                                                                                                                                                                            The number of active frameworks.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.frameworks_connected

                                                                                                                                                                                                                                                                                                                                                            The number of connected frameworks.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.frameworks_disconnected

                                                                                                                                                                                                                                                                                                                                                            The number of disconnected frameworks.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.frameworks_inactive

                                                                                                                                                                                                                                                                                                                                                            The number of inactive frameworks.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.gpus_total

                                                                                                                                                                                                                                                                                                                                                            The total number of GPUs.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.invalid_framework_to_executor_messages

                                                                                                                                                                                                                                                                                                                                                            The number of invalid messages between the framework and the executor.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.invalid_status_update_acknowledgements

                                                                                                                                                                                                                                                                                                                                                            The number of invalid status update acknowledgements.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.invalid_status_updates

                                                                                                                                                                                                                                                                                                                                                            The number of invalid framework messages.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.mem_percent

                                                                                                                                                                                                                                                                                                                                                            The percentage of memory allocated to the cluster.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.mem_total

                                                                                                                                                                                                                                                                                                                                                            The total amount of memory available.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.mem_used

                                                                                                                                                                                                                                                                                                                                                            The amount of memory the cluster is using.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.outstanding_offers

                                                                                                                                                                                                                                                                                                                                                            The number of outstanding resource offers.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.slave_registrations

                                                                                                                                                                                                                                                                                                                                                            The number of slaves able to rejoin the cluster after a disconnect.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.slave_removals

                                                                                                                                                                                                                                                                                                                                                            The number of slaves that have been removed for any reason, including maintenance.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.slave_reregistrations

                                                                                                                                                                                                                                                                                                                                                            The number of slaves that have re-registered.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.slave_shutdowns_canceled

                                                                                                                                                                                                                                                                                                                                                            The number of slave shutdowns processes that have been cancelled.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.slave_shutdowns_scheduled

                                                                                                                                                                                                                                                                                                                                                            The number of slaves that have failed health checks and are scheduled for removal.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.slaves_active

                                                                                                                                                                                                                                                                                                                                                            The number of active slaves.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.slaves_connected

                                                                                                                                                                                                                                                                                                                                                            The number of connected slaves.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.slaves_disconnected

                                                                                                                                                                                                                                                                                                                                                            The number of disconnected slaves.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.slaves_inactive

                                                                                                                                                                                                                                                                                                                                                            The number of inactive slaves.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.tasks_error

                                                                                                                                                                                                                                                                                                                                                            The number of cluster tasks that resulted in an error.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.tasks_failed

                                                                                                                                                                                                                                                                                                                                                            The number of failed cluster tasks.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.tasks_finished

                                                                                                                                                                                                                                                                                                                                                            The number of completed cluster tasks.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.tasks_killed

                                                                                                                                                                                                                                                                                                                                                            The number of killed cluster tasks.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.tasks_lost

                                                                                                                                                                                                                                                                                                                                                            The number of lost cluster tasks.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.tasks_running

                                                                                                                                                                                                                                                                                                                                                            The number of cluster tasks currently running.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.tasks_staging

                                                                                                                                                                                                                                                                                                                                                            The number of cluster tasks currently staging.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.tasks_starting

                                                                                                                                                                                                                                                                                                                                                            The number of cluster tasks starting.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.valid_framework_to_executor_messages

                                                                                                                                                                                                                                                                                                                                                            The number of valid framework messages.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.valid_status_update_acknowledgements

                                                                                                                                                                                                                                                                                                                                                            The number of valid status update acknowledgements.

                                                                                                                                                                                                                                                                                                                                                            mesos.cluster.valid_status_updates

                                                                                                                                                                                                                                                                                                                                                            The number of valid status updates.

                                                                                                                                                                                                                                                                                                                                                            mesos.framework.cpu

                                                                                                                                                                                                                                                                                                                                                            The CPU of the Mesos framework.

                                                                                                                                                                                                                                                                                                                                                            mesos.framework.disk

                                                                                                                                                                                                                                                                                                                                                            The total disk space of the Mesos framework, measured in mebibytes.

                                                                                                                                                                                                                                                                                                                                                            mesos.framework.mem

                                                                                                                                                                                                                                                                                                                                                            The total memory of the Mesos framework, measured in mebibytes.

                                                                                                                                                                                                                                                                                                                                                            mesos.registrar.queued_operations

                                                                                                                                                                                                                                                                                                                                                            The number of queued operations.

                                                                                                                                                                                                                                                                                                                                                            mesos.registrar.registry_size_bytes

                                                                                                                                                                                                                                                                                                                                                            The size of the Mesos registry in bytes.

                                                                                                                                                                                                                                                                                                                                                            mesos.registrar.state_fetch_ms

                                                                                                                                                                                                                                                                                                                                                            The Mesos registry’s read latency, in bytes.

                                                                                                                                                                                                                                                                                                                                                            mesos.registrar.state_store_ms

                                                                                                                                                                                                                                                                                                                                                            The Mesos registry’s write latency, in bytes.

                                                                                                                                                                                                                                                                                                                                                            mesos.registrar.state_store_ms.count

                                                                                                                                                                                                                                                                                                                                                            The Mesos registry’s write count, in bytes.

                                                                                                                                                                                                                                                                                                                                                            mesos.registrar.state_store_ms.max

                                                                                                                                                                                                                                                                                                                                                            The maximum write latency for the registry, in milliseconds.

                                                                                                                                                                                                                                                                                                                                                            mesos.registrar.state_store_ms.min

                                                                                                                                                                                                                                                                                                                                                            The minimum write latency for the registry, in miliseconds.

                                                                                                                                                                                                                                                                                                                                                            mesos.registrar.state_store_ms.p50

                                                                                                                                                                                                                                                                                                                                                            The median registry write latency, in milliseconds.

                                                                                                                                                                                                                                                                                                                                                            mesos.registrar.state_store_ms.p90

                                                                                                                                                                                                                                                                                                                                                            The 90th percentile registry write latency, in milliseconds.

                                                                                                                                                                                                                                                                                                                                                            mesos.registrar.state_store_ms.p95

                                                                                                                                                                                                                                                                                                                                                            The 95th percentile registry write latency, in milliseconds.

                                                                                                                                                                                                                                                                                                                                                            mesos.registrar.state_store_ms.p99

                                                                                                                                                                                                                                                                                                                                                            The 99th percentile registry write latency, in milliseconds.

                                                                                                                                                                                                                                                                                                                                                            mesos.registrar.state_store_ms.p999

                                                                                                                                                                                                                                                                                                                                                            The 99.9th percentile registry write latency, in milliseconds.

                                                                                                                                                                                                                                                                                                                                                            mesos.registrar.state_store_ms.p9999

                                                                                                                                                                                                                                                                                                                                                            The 99.99th percentile registry write latency, in milliseconds.

                                                                                                                                                                                                                                                                                                                                                            mesos.role.cpu

                                                                                                                                                                                                                                                                                                                                                            The CPU capacity of the configured role.

                                                                                                                                                                                                                                                                                                                                                            mesos.role.disk

                                                                                                                                                                                                                                                                                                                                                            The total disk space available to the Mesos role, in mebibytes.

                                                                                                                                                                                                                                                                                                                                                            mesos.role.mem

                                                                                                                                                                                                                                                                                                                                                            The total memory available to the Mesos role, in mebibytes.

                                                                                                                                                                                                                                                                                                                                                            mesos.stats.elected

                                                                                                                                                                                                                                                                                                                                                            Defines whether this is the elected master or not.

                                                                                                                                                                                                                                                                                                                                                            mesos.stats.system.cpus_total

                                                                                                                                                                                                                                                                                                                                                            The total number of CPUs in the system.

                                                                                                                                                                                                                                                                                                                                                            mesos.stats.system.load_1min

                                                                                                                                                                                                                                                                                                                                                            The average load for the last minute.

                                                                                                                                                                                                                                                                                                                                                            mesos.stats.system.load_5min

                                                                                                                                                                                                                                                                                                                                                            The average load for the last five minutes.

                                                                                                                                                                                                                                                                                                                                                            mesos.stats.system.load_15min

                                                                                                                                                                                                                                                                                                                                                            The average load for the last fifteen minutes.

                                                                                                                                                                                                                                                                                                                                                            mesos.stats.system.mem_free_bytes

                                                                                                                                                                                                                                                                                                                                                            The total amount of free system memory, in bytes.

                                                                                                                                                                                                                                                                                                                                                            mesos.stats.system.mem_total_bytes

                                                                                                                                                                                                                                                                                                                                                            The total cluster memory in bytes.

                                                                                                                                                                                                                                                                                                                                                            mesos.stats.uptime_secs

                                                                                                                                                                                                                                                                                                                                                            The current uptime of the cluster.

                                                                                                                                                                                                                                                                                                                                                            5.3.14.3 -

                                                                                                                                                                                                                                                                                                                                                            Marathon Metrics

                                                                                                                                                                                                                                                                                                                                                            See also: Mesos/Marathon integration information.

                                                                                                                                                                                                                                                                                                                                                            marathon.apps

                                                                                                                                                                                                                                                                                                                                                            The total number of applications.

                                                                                                                                                                                                                                                                                                                                                            marathon.backoffFactor

                                                                                                                                                                                                                                                                                                                                                            The multiplication factor for the delay between each consecutive failed task. This value is multiplied by the value of marathon.backoffSeconds each time the task fails until the maximum delay is reached, or the task succeeds.

                                                                                                                                                                                                                                                                                                                                                            marathon.backoffSeconds

                                                                                                                                                                                                                                                                                                                                                            The period of time between attempts to run a failed task. This value is multiplied by marathon.backoffFactor for each consecutive task failure, until either the task succeeds or the maximum delay is reached.

                                                                                                                                                                                                                                                                                                                                                            marathon.cpus

                                                                                                                                                                                                                                                                                                                                                            The number of CPUs configured for each application instance.

                                                                                                                                                                                                                                                                                                                                                            marathon.disk

                                                                                                                                                                                                                                                                                                                                                            The amount of disk space configured for each application instance.

                                                                                                                                                                                                                                                                                                                                                            marathon.instances

                                                                                                                                                                                                                                                                                                                                                            The number of instances of a specific application.

                                                                                                                                                                                                                                                                                                                                                            marathon.mem

                                                                                                                                                                                                                                                                                                                                                            The total amount of configured memory for each instance of a specific application.

                                                                                                                                                                                                                                                                                                                                                            marathon.tasksRunning

                                                                                                                                                                                                                                                                                                                                                            The number of tasks running for a specific application.

                                                                                                                                                                                                                                                                                                                                                            marathon.tasksStaged

                                                                                                                                                                                                                                                                                                                                                            The number of tasks staged for a specific application.

                                                                                                                                                                                                                                                                                                                                                            5.3.15 -

                                                                                                                                                                                                                                                                                                                                                            MongoDB Metrics

                                                                                                                                                                                                                                                                                                                                                            See also: MongoDB integration information.

                                                                                                                                                                                                                                                                                                                                                            Metrics Introduced with Agent v9.7.0

                                                                                                                                                                                                                                                                                                                                                            The following metrics are supported by Sysdig Agent v9.7.0 and above.

                                                                                                                                                                                                                                                                                                                                                            Metric NameDescription
                                                                                                                                                                                                                                                                                                                                                            mongodb.tcmalloc.generic.current_allocated_bytesThe number of bytes used by the application.
                                                                                                                                                                                                                                                                                                                                                            mongodb.tcmalloc.generic.heap_sizeBytes of system memory reserved by TCMalloc.
                                                                                                                                                                                                                                                                                                                                                            mongodb.tcmalloc.tcmalloc.aggressive_memory_decommitStatus of aggressive memory de-commit mode.
                                                                                                                                                                                                                                                                                                                                                            mongodb.tcmalloc.tcmalloc.central_cache_free_bytesThe number of free bytes in the central cache.
                                                                                                                                                                                                                                                                                                                                                            mongodb.tcmalloc.tcmalloc.current_total_thread_cache_bytesThe number of bytes used across all thread caches.
                                                                                                                                                                                                                                                                                                                                                            mongodb.tcmalloc.tcmalloc.max_total_thread_cache_bytesThe upper limit on the total number of bytes stored across all per-thread caches.
                                                                                                                                                                                                                                                                                                                                                            mongodb.tcmalloc.tcmalloc.pageheap_free_bytesThe number of bytes in free mapped pages in page heap.
                                                                                                                                                                                                                                                                                                                                                            mongodb.tcmalloc.tcmalloc.pageheap_unmapped_bytesThe number of bytes in free unmapped pages in page heap.
                                                                                                                                                                                                                                                                                                                                                            mongodb.tcmalloc.tcmalloc.spinlock_total_delay_nsGives the spinlock delay time.
                                                                                                                                                                                                                                                                                                                                                            mongodb.tcmalloc.tcmalloc.thread_cache_free_bytesThe number of free bytes in thread caches.
                                                                                                                                                                                                                                                                                                                                                            mongodb.tcmalloc.tcmalloc.transfer_cache_free_bytesThe number of free bytes that are waiting to be transferred between the central cache and a thread cache.

                                                                                                                                                                                                                                                                                                                                                            mongodb.asserts.msgps

                                                                                                                                                                                                                                                                                                                                                            Number of message assertions raised per second.

                                                                                                                                                                                                                                                                                                                                                            mongodb.asserts.regularps

                                                                                                                                                                                                                                                                                                                                                            Number of regular assertions raised per second.

                                                                                                                                                                                                                                                                                                                                                            mongodb.asserts.rolloversps

                                                                                                                                                                                                                                                                                                                                                            Number of times that the rollover counters roll over per second. The counters rollover to zero every 2^30 assertions.

                                                                                                                                                                                                                                                                                                                                                            mongodb.asserts.userps

                                                                                                                                                                                                                                                                                                                                                            Number of user assertions raised per second.

                                                                                                                                                                                                                                                                                                                                                            mongodb.asserts.warningps

                                                                                                                                                                                                                                                                                                                                                            Number of warnings raised per second.

                                                                                                                                                                                                                                                                                                                                                            mongodb.backgroundflushing.average_ms

                                                                                                                                                                                                                                                                                                                                                            Average time for each flush to disk.

                                                                                                                                                                                                                                                                                                                                                            mongodb.backgroundflushing.flushesps

                                                                                                                                                                                                                                                                                                                                                            Number of times the database has flushed all writes to disk.

                                                                                                                                                                                                                                                                                                                                                            mongodb.backgroundflushing.last_ms

                                                                                                                                                                                                                                                                                                                                                            Amount of time that the last flush operation took to complete.

                                                                                                                                                                                                                                                                                                                                                            mongodb.backgroundflushing.total_ms

                                                                                                                                                                                                                                                                                                                                                            Total number of time that the `mongod` processes have spent writing (i.e. flushing) data to disk.

                                                                                                                                                                                                                                                                                                                                                            mongodb.connections.available

                                                                                                                                                                                                                                                                                                                                                            Number of unused available incoming connections the database can provide.

                                                                                                                                                                                                                                                                                                                                                            mongodb.connections.current

                                                                                                                                                                                                                                                                                                                                                            Number of connections to the database server from clients.

                                                                                                                                                                                                                                                                                                                                                            mongodb.connections.totalcreated

                                                                                                                                                                                                                                                                                                                                                            Total number of connections created.

                                                                                                                                                                                                                                                                                                                                                            mongodb.cursors.timedout

                                                                                                                                                                                                                                                                                                                                                            Total number of cursors that have timed out since the server process started.

                                                                                                                                                                                                                                                                                                                                                            mongodb.cursors.totalopen

                                                                                                                                                                                                                                                                                                                                                            Number of cursors that MongoDB is maintaining for clients

                                                                                                                                                                                                                                                                                                                                                            mongodb.dbs

                                                                                                                                                                                                                                                                                                                                                            Total number of existing databases

                                                                                                                                                                                                                                                                                                                                                            mongodb.dur.commits

                                                                                                                                                                                                                                                                                                                                                            Number of transactions written to the journal during the last journal group commit interval.

                                                                                                                                                                                                                                                                                                                                                            mongodb.dur.commitsinwritelock

                                                                                                                                                                                                                                                                                                                                                            Count of the commits that occurred while a write lock was held.

                                                                                                                                                                                                                                                                                                                                                            mongodb.dur.compression

                                                                                                                                                                                                                                                                                                                                                            Compression ratio of the data written to the journal.

                                                                                                                                                                                                                                                                                                                                                            mongodb.dur.earlycommits

                                                                                                                                                                                                                                                                                                                                                            Number of times MongoDB requested a commit before the scheduled journal group commit interval.

                                                                                                                                                                                                                                                                                                                                                            mongodb.dur.journaledmb

                                                                                                                                                                                                                                                                                                                                                            Amount of data written to journal during the last journal group commit interval.

                                                                                                                                                                                                                                                                                                                                                            mongodb.dur.timems.commits

                                                                                                                                                                                                                                                                                                                                                            Amount of time spent for commits.

                                                                                                                                                                                                                                                                                                                                                            mongodb.dur.timems.commitsinwritelock

                                                                                                                                                                                                                                                                                                                                                            Amount of time spent for commits that occurred while a write lock was held.

                                                                                                                                                                                                                                                                                                                                                            mongodb.dur.timems.dt

                                                                                                                                                                                                                                                                                                                                                            Amount of time over which MongoDB collected the `dur.timeMS` data.

                                                                                                                                                                                                                                                                                                                                                            mongodb.dur.timems.preplogbuffer

                                                                                                                                                                                                                                                                                                                                                            Amount of time spent preparing to write to the journal.

                                                                                                                                                                                                                                                                                                                                                            mongodb.dur.timems.remapprivateview

                                                                                                                                                                                                                                                                                                                                                            Amount of time spent remapping copy-on-write memory mapped views.

                                                                                                                                                                                                                                                                                                                                                            mongodb.dur.timems.writetodatafiles

                                                                                                                                                                                                                                                                                                                                                            Amount of time spent writing to data files after journaling.

                                                                                                                                                                                                                                                                                                                                                            mongodb.dur.timems.writetojournal

                                                                                                                                                                                                                                                                                                                                                            Amount of time spent writing to the journal

                                                                                                                                                                                                                                                                                                                                                            mongodb.dur.writetodatafilesmb

                                                                                                                                                                                                                                                                                                                                                            Amount of data written from journal to the data files during the last journal group commit interval.

                                                                                                                                                                                                                                                                                                                                                            mongodb.extra_info.page_faultsps

                                                                                                                                                                                                                                                                                                                                                            Number of page faults per second that require disk operations.

                                                                                                                                                                                                                                                                                                                                                            mongodb.fsynclocked

                                                                                                                                                                                                                                                                                                                                                            Number of fsynclocked performed on a mongo instance.

                                                                                                                                                                                                                                                                                                                                                            mongodb.globallock.activeclients.readers

                                                                                                                                                                                                                                                                                                                                                            Count of the active client connections performing read operations.

                                                                                                                                                                                                                                                                                                                                                            mongodb.globallock.activeclients.total

                                                                                                                                                                                                                                                                                                                                                            Total number of active client connections to the database.

                                                                                                                                                                                                                                                                                                                                                            mongodb.globallock.activeclients.writers

                                                                                                                                                                                                                                                                                                                                                            Count of active client connections performing write operations.

                                                                                                                                                                                                                                                                                                                                                            mongodb.globallock.currentqueue.readers

                                                                                                                                                                                                                                                                                                                                                            Number of operations that are currently queued and waiting for the read lock.

                                                                                                                                                                                                                                                                                                                                                            mongodb.globallock.currentqueue.total

                                                                                                                                                                                                                                                                                                                                                            Total number of operations queued waiting for the lock.

                                                                                                                                                                                                                                                                                                                                                            mongodb.globallock.currentqueue.writers

                                                                                                                                                                                                                                                                                                                                                            Number of operations that are currently queued and waiting for the write lock.

                                                                                                                                                                                                                                                                                                                                                            mongodb.globallock.locktime

                                                                                                                                                                                                                                                                                                                                                            Time since the database last started that the globalLock has been held.

                                                                                                                                                                                                                                                                                                                                                            mongodb.globallock.ratio

                                                                                                                                                                                                                                                                                                                                                            Ratio of the time that the globalLock has been held to the total time since it was created.

                                                                                                                                                                                                                                                                                                                                                            mongodb.globallock.totaltime

                                                                                                                                                                                                                                                                                                                                                            Time since the database last started and created the global lock.

                                                                                                                                                                                                                                                                                                                                                            mongodb.indexcounters.accessesps

                                                                                                                                                                                                                                                                                                                                                            Number of times that operations have accessed indexes per second.

                                                                                                                                                                                                                                                                                                                                                            mongodb.indexcounters.hitsps

                                                                                                                                                                                                                                                                                                                                                            Number of times per second that an index has been accessed and mongod is able to return the index from memory.

                                                                                                                                                                                                                                                                                                                                                            mongodb.indexcounters.missesps

                                                                                                                                                                                                                                                                                                                                                            Number of times per second that an operation attempted to access an index that was not in memory.

                                                                                                                                                                                                                                                                                                                                                            mongodb.indexcounters.missratio

                                                                                                                                                                                                                                                                                                                                                            Ratio of index hits to misses.

                                                                                                                                                                                                                                                                                                                                                            mongodb.indexcounters.resetsps

                                                                                                                                                                                                                                                                                                                                                            Number of times per second the index counters have been reset.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.collection.acquirecount.exclusiveps

                                                                                                                                                                                                                                                                                                                                                            Number of times the collection lock type was acquired in the Exclusive (X) mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.collection.acquirecount.intent_exclusiveps

                                                                                                                                                                                                                                                                                                                                                            Number of times the collection lock type was acquired in the Intent Exclusive (IX) mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.collection.acquirecount.intent_sharedps

                                                                                                                                                                                                                                                                                                                                                            Number of times the collection lock type was acquired in the Intent Shared (IS) mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.collection.acquirecount.sharedps

                                                                                                                                                                                                                                                                                                                                                            Number of times the collection lock type was acquired in the Shared (S) mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.collection.acquirewaitcount.exclusiveps

                                                                                                                                                                                                                                                                                                                                                            Number of times the collection lock type acquisition in the Exclusive (X) mode encountered waits because the locks were held in a conflicting mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.collection.acquirewaitcount.sharedps

                                                                                                                                                                                                                                                                                                                                                            Number of times the collection lock type acquisition in the Shared (S) mode encountered waits because the locks were held in a conflicting mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.collection.timeacquiringmicros.exclusiveps

                                                                                                                                                                                                                                                                                                                                                            Wait time for the collection lock type acquisitions in the Exclusive (X) mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.collection.timeacquiringmicros.sharedps

                                                                                                                                                                                                                                                                                                                                                            Wait time for the collection lock type acquisitions in the Shared (S) mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.database.acquirecount.exclusiveps

                                                                                                                                                                                                                                                                                                                                                            Number of times the database lock type was acquired in the Exclusive (X) mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.database.acquirecount.intent_exclusiveps

                                                                                                                                                                                                                                                                                                                                                            Number of times the database lock type was acquired in the Intent Exclusive (IX) mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.database.acquirecount.intent_sharedps

                                                                                                                                                                                                                                                                                                                                                            Number of times the database lock type was acquired in the Intent Shared (IS) mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.database.acquirecount.sharedps

                                                                                                                                                                                                                                                                                                                                                            Number of times the database lock type was acquired in the Shared (S) mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.database.acquirewaitcount.exclusiveps

                                                                                                                                                                                                                                                                                                                                                            Number of times the database lock type acquisition in the Exclusive (X) mode encountered waits because the locks were held in a conflicting mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.database.acquirewaitcount.intent_exclusiveps

                                                                                                                                                                                                                                                                                                                                                            Number of times the database lock type acquisition in the Intent Exclusive (IX) mode encountered waits because the locks were held in a conflicting mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.database.acquirewaitcount.intent_sharedps

                                                                                                                                                                                                                                                                                                                                                            Number of times the database lock type acquisition in the Intent Shared (IS) mode encountered waits because the locks were held in a conflicting mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.database.acquirewaitcount.sharedps

                                                                                                                                                                                                                                                                                                                                                            Number of times the database lock type acquisition in the Shared (S) mode encountered waits because the locks were held in a conflicting mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.database.timeacquiringmicros.exclusiveps

                                                                                                                                                                                                                                                                                                                                                            Wait time for the database lock type acquisitions in the Exclusive (X) mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.database.timeacquiringmicros.intent_exclusiveps

                                                                                                                                                                                                                                                                                                                                                            Wait time for the database lock type acquisitions in the Intent Exclusive (IX) mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.database.timeacquiringmicros.intent_sharedps

                                                                                                                                                                                                                                                                                                                                                            Wait time for the database lock type acquisitions in the Intent Shared (IS) mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.database.timeacquiringmicros.sharedps

                                                                                                                                                                                                                                                                                                                                                            Wait time for the database lock type acquisitions in the Shared (S) mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.global.acquirecount.exclusiveps

                                                                                                                                                                                                                                                                                                                                                            Number of times the global lock type was acquired in the Exclusive (X) mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.global.acquirecount.intent_exclusiveps

                                                                                                                                                                                                                                                                                                                                                            Number of times the global lock type was acquired in the Intent Exclusive (IX) mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.global.acquirecount.intent_sharedps

                                                                                                                                                                                                                                                                                                                                                            Number of times the global lock type was acquired in the Intent Shared (IS) mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.global.acquirecount.sharedps

                                                                                                                                                                                                                                                                                                                                                            Number of times the global lock type was acquired in the Shared (S) mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.global.acquirewaitcount.exclusiveps

                                                                                                                                                                                                                                                                                                                                                            Number of times the global lock type acquisition in the Exclusive (X) mode encountered waits because the locks were held in a conflicting mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.global.acquirewaitcount.intent_exclusiveps

                                                                                                                                                                                                                                                                                                                                                            Number of times the global lock type acquisition in the Intent Exclusive (IX) mode encountered waits because the locks were held in a conflicting mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.global.acquirewaitcount.intent_sharedps

                                                                                                                                                                                                                                                                                                                                                            Number of times the global lock type acquisition in the Intent Shared (IS) mode encountered waits because the locks were held in a conflicting mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.global.acquirewaitcount.sharedps

                                                                                                                                                                                                                                                                                                                                                            Number of times the global lock type acquisition in the Shared (S) mode encountered waits because the locks were held in a conflicting mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.global.timeacquiringmicros.exclusiveps

                                                                                                                                                                                                                                                                                                                                                            Wait time for the global lock type acquisitions in the Exclusive (X) mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.global.timeacquiringmicros.intent_exclusiveps

                                                                                                                                                                                                                                                                                                                                                            Wait time for the global lock type acquisitions in the Intent Exclusive (IX) mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.global.timeacquiringmicros.intent_sharedps

                                                                                                                                                                                                                                                                                                                                                            Wait time for the global lock type acquisitions in the Intent Shared (IS) mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.global.timeacquiringmicros.sharedps

                                                                                                                                                                                                                                                                                                                                                            Wait time for the global lock type acquisitions in the Shared (S) mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.metadata.acquirecount.exclusiveps

                                                                                                                                                                                                                                                                                                                                                            Number of times the metadata lock type was acquired in the Exclusive (X) mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.metadata.acquirecount.sharedps

                                                                                                                                                                                                                                                                                                                                                            Number of times the metadata lock type was acquired in the Shared (S) mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.mmapv1journal.acquirecount.intent_exclusiveps

                                                                                                                                                                                                                                                                                                                                                            Number of times the MMAPv1 storage engine lock type was acquired in the Intent Exclusive (IX) mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.mmapv1journal.acquirecount.intent_sharedps

                                                                                                                                                                                                                                                                                                                                                            Number of times the MMAPv1 storage engine lock type was acquired in the Intent Shared (IS) mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.mmapv1journal.acquirewaitcount.intent_exclusiveps

                                                                                                                                                                                                                                                                                                                                                            Number of times the MMAPv1 storage engine lock type acquisition in the Intent Exclusive (IX) mode encountered waits because the locks were held in a conflicting mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.mmapv1journal.acquirewaitcount.intent_sharedps

                                                                                                                                                                                                                                                                                                                                                            Number of times the MMAPv1 storage engine lock type acquisition in the Intent Shared (IS) mode encountered waits because the locks were held in a conflicting mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.mmapv1journal.timeacquiringmicros.intent_exclusiveps

                                                                                                                                                                                                                                                                                                                                                            Wait time for the MMAPv1 storage engine lock type acquisitions in the Intent Exclusive (IX) mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.mmapv1journal.timeacquiringmicros.intent_sharedps

                                                                                                                                                                                                                                                                                                                                                            Wait time for the MMAPv1 storage engine lock type acquisitions in the Intent Shared (IS) mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.oplog.acquirecount.intent_exclusiveps

                                                                                                                                                                                                                                                                                                                                                            Number of times the oplog lock type was acquired in the Intent Exclusive (IX) mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.oplog.acquirecount.sharedps

                                                                                                                                                                                                                                                                                                                                                            Number of times the oplog lock type was acquired in the Shared (S) mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.oplog.acquirewaitcount.intent_exclusiveps

                                                                                                                                                                                                                                                                                                                                                            Number of times the oplog lock type acquisition in the Intent Exclusive (IX) mode encountered waits because the locks were held in a conflicting mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.oplog.acquirewaitcount.sharedps

                                                                                                                                                                                                                                                                                                                                                            Number of times the oplog lock type acquisition in the Shared (S) mode encountered waits because the locks were held in a conflicting mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.oplog.timeacquiringmicros.intent_exclusiveps

                                                                                                                                                                                                                                                                                                                                                            Wait time for the oplog lock type acquisitions in the Intent Exclusive (IX) mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.locks.oplog.timeacquiringmicros.sharedps

                                                                                                                                                                                                                                                                                                                                                            Wait time for the oplog lock type acquisitions in the Shared (S) mode.

                                                                                                                                                                                                                                                                                                                                                            mongodb.mem.bits

                                                                                                                                                                                                                                                                                                                                                            Size of the in-memory storage engine.

                                                                                                                                                                                                                                                                                                                                                            mongodb.mem.mapped

                                                                                                                                                                                                                                                                                                                                                            Amount of mapped memory by the database.

                                                                                                                                                                                                                                                                                                                                                            mongodb.mem.mappedwithjournal

                                                                                                                                                                                                                                                                                                                                                            The amount of mapped memory, including the memory used for journaling.

                                                                                                                                                                                                                                                                                                                                                            mongodb.mem.resident

                                                                                                                                                                                                                                                                                                                                                            Amount of memory currently used by the database process.

                                                                                                                                                                                                                                                                                                                                                            mongodb.mem.virtual

                                                                                                                                                                                                                                                                                                                                                            Amount of virtual memory used by the mongod process.

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.commands.count.failed

                                                                                                                                                                                                                                                                                                                                                            Number of times count failed

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.commands.count.total

                                                                                                                                                                                                                                                                                                                                                            Number of times count executed

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.commands.createIndexes.failed

                                                                                                                                                                                                                                                                                                                                                            Number of times createIndexes failed

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.commands.createIndexes.total

                                                                                                                                                                                                                                                                                                                                                            Number of times createIndexes executed

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.commands.delete.failed

                                                                                                                                                                                                                                                                                                                                                            Number of times delete failed

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.commands.delete.total

                                                                                                                                                                                                                                                                                                                                                            Number of times delete executed

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.commands.eval.failed

                                                                                                                                                                                                                                                                                                                                                            Number of times eval failed

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.commands.eval.total

                                                                                                                                                                                                                                                                                                                                                            Number of times eval executed

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.commands.findAndModify.failed

                                                                                                                                                                                                                                                                                                                                                            Number of times findAndModify failed

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.commands.findAndModify.total

                                                                                                                                                                                                                                                                                                                                                            Number of times findAndModify executed

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.commands.insert.failed

                                                                                                                                                                                                                                                                                                                                                            Number of times insert failed

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.commands.insert.total

                                                                                                                                                                                                                                                                                                                                                            Number of times insert executed

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.commands.update.failed

                                                                                                                                                                                                                                                                                                                                                            Number of times update failed

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.commands.update.total

                                                                                                                                                                                                                                                                                                                                                            Number of times update executed

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.cursor.open.notimeout

                                                                                                                                                                                                                                                                                                                                                            Number of open cursors with the option `DBQuery.Option.noTimeout` set to prevent timeout after a period of inactivity.

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.cursor.open.pinned

                                                                                                                                                                                                                                                                                                                                                            Number of pinned open cursors.

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.cursor.open.total

                                                                                                                                                                                                                                                                                                                                                            Number of cursors that MongoDB is maintaining for clients.

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.cursor.timedoutps

                                                                                                                                                                                                                                                                                                                                                            Number of cursors that time out, per second.

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.document.deletedps

                                                                                                                                                                                                                                                                                                                                                            Number of documents deleted per second.

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.document.insertedps

                                                                                                                                                                                                                                                                                                                                                            Number of documents inserted per second.

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.document.returnedps

                                                                                                                                                                                                                                                                                                                                                            Number of documents returned by queries per second.

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.document.updatedps

                                                                                                                                                                                                                                                                                                                                                            Number of documents updated per second.

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.getlasterror.wtime.numps

                                                                                                                                                                                                                                                                                                                                                            Number of getLastError operations per second with a specified write concern (i.e. w) that wait for one or more members of a replica set to acknowledge the write operation.

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.getlasterror.wtime.totalmillisps

                                                                                                                                                                                                                                                                                                                                                            Fraction of time (ms/s) that the mongod has spent performing getLastError operations with write concern (i.e. w) that wait for one or more members of a replica set to acknowledge the write operation.

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.getlasterror.wtimeoutsps

                                                                                                                                                                                                                                                                                                                                                            Number of times per second that write concern operations have timed out as a result of the wtimeout threshold to getLastError

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.operation.fastmodps

                                                                                                                                                                                                                                                                                                                                                            Number of update operations per second that neither cause documents to grow nor require updates to the index.

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.operation.idhackps

                                                                                                                                                                                                                                                                                                                                                            Number of queries per second that contain the _id field.

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.operation.writeconflictsps

                                                                                                                                                                                                                                                                                                                                                            Number of times per second that write concern operations has encounter a conflict.

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.operation.scanandorderps

                                                                                                                                                                                                                                                                                                                                                            Number of queries per second that return sorted numbers that cannot perform the sort operation using an index.

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.queryexecutor.scannedps

                                                                                                                                                                                                                                                                                                                                                            Number of index items scanned per second during queries and query-plan evaluation.

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.record.movesps

                                                                                                                                                                                                                                                                                                                                                            Number of times per second documents move within the on-disk representation of the MongoDB data set.

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.repl.apply.batches.numps

                                                                                                                                                                                                                                                                                                                                                            Number of batches applied across all databases per second.

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.repl.apply.batches.totalmillisps

                                                                                                                                                                                                                                                                                                                                                            Fraction of time (ms/s) the mongod has spent applying operations from the oplog.

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.repl.apply.opsps

                                                                                                                                                                                                                                                                                                                                                            Number of oplog operations applied per second.

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.repl.buffer.count

                                                                                                                                                                                                                                                                                                                                                            Number of operations in the oplog buffer.

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.repl.buffer.maxsizebytes

                                                                                                                                                                                                                                                                                                                                                            Maximum size of the buffer.

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.repl.buffer.sizebytes

                                                                                                                                                                                                                                                                                                                                                            Current size of the contents of the oplog buffer.

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.repl.network.bytesps

                                                                                                                                                                                                                                                                                                                                                            Amount of data read from the replication sync source per second.

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.repl.network.getmores.numps

                                                                                                                                                                                                                                                                                                                                                            Number of getmore operations per second.

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.repl.network.getmores.totalmillisps

                                                                                                                                                                                                                                                                                                                                                            Fraction of time (ms/s) required to collect data from getmore operations.

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.repl.network.opsps

                                                                                                                                                                                                                                                                                                                                                            Number of operations read from the replication source per second.

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.repl.network.readerscreatedps

                                                                                                                                                                                                                                                                                                                                                            Number of oplog query processes created per second.

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.repl.preload.docs.numps

                                                                                                                                                                                                                                                                                                                                                            Number of documents loaded during the pre-fetch stage of replication.

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.repl.preload.docs.totalmillisps

                                                                                                                                                                                                                                                                                                                                                            Amount of time spent loading documents as part of the pre-fetch stage of replication.

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.repl.preload.indexes.numps

                                                                                                                                                                                                                                                                                                                                                            Number of index entries loaded by members before updating documents as part of the pre-fetch stage of replication.

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.repl.preload.indexes.totalmillisps

                                                                                                                                                                                                                                                                                                                                                            Amount of time spent loading documents as part of the pre-fetch stage of replication.

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.ttl.deleteddocumentsps

                                                                                                                                                                                                                                                                                                                                                            Number of documents deleted from collections with a ttl index per second.

                                                                                                                                                                                                                                                                                                                                                            mongodb.metrics.ttl.passesps

                                                                                                                                                                                                                                                                                                                                                            Number of times per second the background process removes documents from collections with a ttl index.

                                                                                                                                                                                                                                                                                                                                                            mongodb.network.bytesinps

                                                                                                                                                                                                                                                                                                                                                            The number of bytes that reflects the amount of network traffic received by this database.

                                                                                                                                                                                                                                                                                                                                                            mongodb.network.bytesoutps

                                                                                                                                                                                                                                                                                                                                                            The number of bytes that reflects the amount of network traffic sent from this database.

                                                                                                                                                                                                                                                                                                                                                            mongodb.network.numrequestsps

                                                                                                                                                                                                                                                                                                                                                            Number of distinct requests that the server has received.

                                                                                                                                                                                                                                                                                                                                                            mongodb.opcounters.commandps

                                                                                                                                                                                                                                                                                                                                                            Total number of commands per second issued to the database.

                                                                                                                                                                                                                                                                                                                                                            mongodb.opcounters.deleteps

                                                                                                                                                                                                                                                                                                                                                            Number of delete operations per second.

                                                                                                                                                                                                                                                                                                                                                            mongodb.opcounters.getmoreps

                                                                                                                                                                                                                                                                                                                                                            Number of getmore operations per second.

                                                                                                                                                                                                                                                                                                                                                            mongodb.opcounters.insertps

                                                                                                                                                                                                                                                                                                                                                            Number of insert operations per second.

                                                                                                                                                                                                                                                                                                                                                            mongodb.opcounters.queryps

                                                                                                                                                                                                                                                                                                                                                            Total number of queries per second.

                                                                                                                                                                                                                                                                                                                                                            mongodb.opcounters.updateps

                                                                                                                                                                                                                                                                                                                                                            Number of update operations per second.

                                                                                                                                                                                                                                                                                                                                                            mongodb.opcountersrepl.commandps

                                                                                                                                                                                                                                                                                                                                                            Total number of replicated commands issued to the database per second.

                                                                                                                                                                                                                                                                                                                                                            mongodb.opcountersrepl.deleteps

                                                                                                                                                                                                                                                                                                                                                            Number of replicated delete operations per second.

                                                                                                                                                                                                                                                                                                                                                            mongodb.opcountersrepl.getmoreps

                                                                                                                                                                                                                                                                                                                                                            Number of replicated getmore operations per second.

                                                                                                                                                                                                                                                                                                                                                            mongodb.opcountersrepl.insertps

                                                                                                                                                                                                                                                                                                                                                            Number of replicated insert operations per second.

                                                                                                                                                                                                                                                                                                                                                            mongodb.opcountersrepl.queryps

                                                                                                                                                                                                                                                                                                                                                            Total number of replicated queries per second.

                                                                                                                                                                                                                                                                                                                                                            mongodb.opcountersrepl.updateps

                                                                                                                                                                                                                                                                                                                                                            Number of replicated update operations per second.

                                                                                                                                                                                                                                                                                                                                                            mongodb.oplog.logsizemb

                                                                                                                                                                                                                                                                                                                                                            Total size of the oplog.

                                                                                                                                                                                                                                                                                                                                                            mongodb.oplog.timediff

                                                                                                                                                                                                                                                                                                                                                            Oplog window: difference between the first and last operation in the oplog.

                                                                                                                                                                                                                                                                                                                                                            mongodb.oplog.usedsizemb

                                                                                                                                                                                                                                                                                                                                                            Total amount of space used by the oplog.

                                                                                                                                                                                                                                                                                                                                                            mongodb.replset.health

                                                                                                                                                                                                                                                                                                                                                            Member health value of the replica set: conveys if the member is up (i.e. 1) or down (i.e. 0).

                                                                                                                                                                                                                                                                                                                                                            mongodb.replset.replicationlag

                                                                                                                                                                                                                                                                                                                                                            Delay between a write operation on the primary and its copy to a secondary.

                                                                                                                                                                                                                                                                                                                                                            mongodb.replset.state

                                                                                                                                                                                                                                                                                                                                                            State of a replica that reflects its disposition within the set.

                                                                                                                                                                                                                                                                                                                                                            mongodb.replset.votefraction

                                                                                                                                                                                                                                                                                                                                                            Fraction of votes a server will cast in a replica set election.

                                                                                                                                                                                                                                                                                                                                                            mongodb.replset.votes

                                                                                                                                                                                                                                                                                                                                                            The number of votes a server will cast in a replica set election.

                                                                                                                                                                                                                                                                                                                                                            mongodb.stats.datasize

                                                                                                                                                                                                                                                                                                                                                            Total size of the data held in this database including the padding factor.

                                                                                                                                                                                                                                                                                                                                                            mongodb.stats.indexes

                                                                                                                                                                                                                                                                                                                                                            Total number of indexes across all collections in the database.

                                                                                                                                                                                                                                                                                                                                                            mongodb.stats.indexsize

                                                                                                                                                                                                                                                                                                                                                            Total size of all indexes created on this database.

                                                                                                                                                                                                                                                                                                                                                            mongodb.stats.objects

                                                                                                                                                                                                                                                                                                                                                            Number of objects (documents) in the database across all collections.

                                                                                                                                                                                                                                                                                                                                                            mongodb.stats.storagesize

                                                                                                                                                                                                                                                                                                                                                            Total amount of space allocated to collections in this database for document storage.

                                                                                                                                                                                                                                                                                                                                                            mongodb.uptime

                                                                                                                                                                                                                                                                                                                                                            Number of seconds that the mongos or mongod process has been active.

                                                                                                                                                                                                                                                                                                                                                            mongodb.wiredtiger.cache.bytes_currently_in_cache

                                                                                                                                                                                                                                                                                                                                                            Size of the data currently in cache.

                                                                                                                                                                                                                                                                                                                                                            mongodb.wiredtiger.cache.failed_eviction_of_pages_exceeding_the_in_memory_maximumps

                                                                                                                                                                                                                                                                                                                                                            Number of failed eviction of pages that exceeded the in-memory maximum, per second.

                                                                                                                                                                                                                                                                                                                                                            mongodb.wiredtiger.cache.in_memory_page_splits

                                                                                                                                                                                                                                                                                                                                                            In-memory page splits.

                                                                                                                                                                                                                                                                                                                                                            mongodb.wiredtiger.cache.maximum_bytes_configured

                                                                                                                                                                                                                                                                                                                                                            Maximum cache size.

                                                                                                                                                                                                                                                                                                                                                            mongodb.wiredtiger.cache.maximum_page_size_at_eviction

                                                                                                                                                                                                                                                                                                                                                            Maximum page size at eviction.

                                                                                                                                                                                                                                                                                                                                                            mongodb.wiredtiger.cache.modified_pages_evicted

                                                                                                                                                                                                                                                                                                                                                            Number of pages, that have been modified, evicted from the cache.

                                                                                                                                                                                                                                                                                                                                                            mongodb.wiredtiger.cache.pages_currently_held_in_cache

                                                                                                                                                                                                                                                                                                                                                            Number of pages currently held in the cache.

                                                                                                                                                                                                                                                                                                                                                            mongodb.wiredtiger.cache.pages_evicted_by_application_threadsps

                                                                                                                                                                                                                                                                                                                                                            Number of page evicted by application threads per second.

                                                                                                                                                                                                                                                                                                                                                            mongodb.wiredtiger.cache.pages_evicted_exceeding_the_in_memory_maximumps

                                                                                                                                                                                                                                                                                                                                                            Number of pages evicted because they exceeded the cache in-memory maximum, per second.

                                                                                                                                                                                                                                                                                                                                                            mongodb.wiredtiger.cache.tracked_dirty_bytes_in_cache

                                                                                                                                                                                                                                                                                                                                                            Size of the dirty data in the cache.

                                                                                                                                                                                                                                                                                                                                                            mongodb.wiredtiger.cache.unmodified_pages_evicted

                                                                                                                                                                                                                                                                                                                                                            Number of pages, that were not modified, evicted from the cache.

                                                                                                                                                                                                                                                                                                                                                            mongodb.wiredtiger.concurrenttransactions.read.available

                                                                                                                                                                                                                                                                                                                                                            Number of available read tickets (concurrent transactions) remaining.

                                                                                                                                                                                                                                                                                                                                                            mongodb.wiredtiger.concurrenttransactions.read.out

                                                                                                                                                                                                                                                                                                                                                            Number of read tickets (concurrent transactions) in use.

                                                                                                                                                                                                                                                                                                                                                            mongodb.wiredtiger.concurrenttransactions.read.totaltickets

                                                                                                                                                                                                                                                                                                                                                            Total number of read tickets (concurrent transactions) available.

                                                                                                                                                                                                                                                                                                                                                            mongodb.wiredtiger.concurrenttransactions.write.available

                                                                                                                                                                                                                                                                                                                                                            Number of available write tickets (concurrent transactions) remaining.

                                                                                                                                                                                                                                                                                                                                                            mongodb.wiredtiger.concurrenttransactions.write.out

                                                                                                                                                                                                                                                                                                                                                            Number of write tickets (concurrent transactions) in use.

                                                                                                                                                                                                                                                                                                                                                            mongodb.wiredtiger.concurrenttransactions.write.totaltickets

                                                                                                                                                                                                                                                                                                                                                            Total number of write tickets (concurrent transactions) available.

                                                                                                                                                                                                                                                                                                                                                            mongodb.collection.size

                                                                                                                                                                                                                                                                                                                                                            The total size in bytes of the data in the collection plus the size of every indexes on the mongodb.collection.

                                                                                                                                                                                                                                                                                                                                                            mongodb.collection.avgObjSize

                                                                                                                                                                                                                                                                                                                                                            The size of the average object in the collection in bytes.

                                                                                                                                                                                                                                                                                                                                                            mongodb.collection.count

                                                                                                                                                                                                                                                                                                                                                            Total number of objects in the collection.

                                                                                                                                                                                                                                                                                                                                                            mongodb.collection.capped

                                                                                                                                                                                                                                                                                                                                                            Whether or not the collection is capped.

                                                                                                                                                                                                                                                                                                                                                            mongodb.collection.max

                                                                                                                                                                                                                                                                                                                                                            Maximum number of documents in a capped collection.

                                                                                                                                                                                                                                                                                                                                                            mongodb.collection.maxSize

                                                                                                                                                                                                                                                                                                                                                            Maximum size of a capped collection in bytes.

                                                                                                                                                                                                                                                                                                                                                            mongodb.collection.storageSize

                                                                                                                                                                                                                                                                                                                                                            Total storage space allocated to this collection for document storage.

                                                                                                                                                                                                                                                                                                                                                            mongodb.collection.nindexes

                                                                                                                                                                                                                                                                                                                                                            Total number of indices on the collection.

                                                                                                                                                                                                                                                                                                                                                            mongodb.collection.indexSizes

                                                                                                                                                                                                                                                                                                                                                            Size of index in bytes.

                                                                                                                                                                                                                                                                                                                                                            mongodb.collection.indexes.accesses.ops

                                                                                                                                                                                                                                                                                                                                                            Number of time the index was used.

                                                                                                                                                                                                                                                                                                                                                            mongodb.usage.commands.countps

                                                                                                                                                                                                                                                                                                                                                            Number of commands per second

                                                                                                                                                                                                                                                                                                                                                            mongodb.usage.commands.count

                                                                                                                                                                                                                                                                                                                                                            Number of commands since server start (deprecated)

                                                                                                                                                                                                                                                                                                                                                            mongodb.usage.commands.time

                                                                                                                                                                                                                                                                                                                                                            Total time spent performing commands in microseconds

                                                                                                                                                                                                                                                                                                                                                            mongodb.usage.getmore.countps

                                                                                                                                                                                                                                                                                                                                                            Number of getmore per second

                                                                                                                                                                                                                                                                                                                                                            mongodb.usage.getmore.count

                                                                                                                                                                                                                                                                                                                                                            Number of getmore since server start (deprecated)

                                                                                                                                                                                                                                                                                                                                                            mongodb.usage.getmore.time

                                                                                                                                                                                                                                                                                                                                                            Total time spent performing getmore in microseconds

                                                                                                                                                                                                                                                                                                                                                            mongodb.usage.insert.countps

                                                                                                                                                                                                                                                                                                                                                            Number of inserts per second

                                                                                                                                                                                                                                                                                                                                                            mongodb.usage.insert.count

                                                                                                                                                                                                                                                                                                                                                            Number of inserts since server start (deprecated)

                                                                                                                                                                                                                                                                                                                                                            mongodb.usage.insert.time

                                                                                                                                                                                                                                                                                                                                                            Total time spent performing inserts in microseconds

                                                                                                                                                                                                                                                                                                                                                            mongodb.usage.queries.countps

                                                                                                                                                                                                                                                                                                                                                            Number of queries per second

                                                                                                                                                                                                                                                                                                                                                            mongodb.usage.queries.count

                                                                                                                                                                                                                                                                                                                                                            Number of queries since server start (deprecated)

                                                                                                                                                                                                                                                                                                                                                            mongodb.usage.queries.time

                                                                                                                                                                                                                                                                                                                                                            Total time spent performing queries in microseconds

                                                                                                                                                                                                                                                                                                                                                            mongodb.usage.readLock.countps

                                                                                                                                                                                                                                                                                                                                                            Number of read locks per second

                                                                                                                                                                                                                                                                                                                                                            mongodb.usage.readLock.count

                                                                                                                                                                                                                                                                                                                                                            Number of read locks since server start (deprecated)

                                                                                                                                                                                                                                                                                                                                                            mongodb.usage.readLock.time

                                                                                                                                                                                                                                                                                                                                                            Total time spent performing read locks in microseconds

                                                                                                                                                                                                                                                                                                                                                            mongodb.usage.remove.countps

                                                                                                                                                                                                                                                                                                                                                            Number of removes per second

                                                                                                                                                                                                                                                                                                                                                            mongodb.usage.remove.count

                                                                                                                                                                                                                                                                                                                                                            Number of removes since server start (deprecated)

                                                                                                                                                                                                                                                                                                                                                            mongodb.usage.remove.time

                                                                                                                                                                                                                                                                                                                                                            Total time spent performing removes in microseconds

                                                                                                                                                                                                                                                                                                                                                            mongodb.usage.total.countps

                                                                                                                                                                                                                                                                                                                                                            Number of operations per second

                                                                                                                                                                                                                                                                                                                                                            mongodb.usage.total.count

                                                                                                                                                                                                                                                                                                                                                            Number of operations since server start (deprecated)

                                                                                                                                                                                                                                                                                                                                                            mongodb.usage.total.time

                                                                                                                                                                                                                                                                                                                                                            Total time spent performing operations in microseconds

                                                                                                                                                                                                                                                                                                                                                            mongodb.usage.update.countps

                                                                                                                                                                                                                                                                                                                                                            Number of updates per second

                                                                                                                                                                                                                                                                                                                                                            mongodb.usage.update.count

                                                                                                                                                                                                                                                                                                                                                            Number of updates since server start (deprecated)

                                                                                                                                                                                                                                                                                                                                                            mongodb.usage.update.time

                                                                                                                                                                                                                                                                                                                                                            Total time spent performing updates in microseconds

                                                                                                                                                                                                                                                                                                                                                            mongodb.usage.writeLock.countps

                                                                                                                                                                                                                                                                                                                                                            Number of write locks per second

                                                                                                                                                                                                                                                                                                                                                            mongodb.usage.writeLock.count

                                                                                                                                                                                                                                                                                                                                                            Number of write locks since server start (deprecated)

                                                                                                                                                                                                                                                                                                                                                            mongodb.usage.writeLock.time

                                                                                                                                                                                                                                                                                                                                                            Total time spent performing write locks in microseconds

                                                                                                                                                                                                                                                                                                                                                            5.3.16 -

                                                                                                                                                                                                                                                                                                                                                            MySQL Metrics

                                                                                                                                                                                                                                                                                                                                                            See also: MySQL integration information.

                                                                                                                                                                                                                                                                                                                                                            mysql.galera.wsrep_cluster_size

                                                                                                                                                                                                                                                                                                                                                            The current number of nodes in the Galera cluster.

                                                                                                                                                                                                                                                                                                                                                            mysql.innodb.buffer_pool_free

                                                                                                                                                                                                                                                                                                                                                            The number of free pages in the InnoDB Buffer Pool.

                                                                                                                                                                                                                                                                                                                                                            mysql.innodb.buffer_pool_total

                                                                                                                                                                                                                                                                                                                                                            The total number of pages in the InnoDB Buffer Pool.

                                                                                                                                                                                                                                                                                                                                                            mysql.innodb.buffer_pool_used

                                                                                                                                                                                                                                                                                                                                                            The number of used pages in the InnoDB Buffer Pool.

                                                                                                                                                                                                                                                                                                                                                            mysql.innodb.buffer_pool_utilization

                                                                                                                                                                                                                                                                                                                                                            The utilization of the InnoDB Buffer Pool.

                                                                                                                                                                                                                                                                                                                                                            mysql.innodb.current_row_locks

                                                                                                                                                                                                                                                                                                                                                            The number of current row locks.

                                                                                                                                                                                                                                                                                                                                                            mysql.innodb.data_reads

                                                                                                                                                                                                                                                                                                                                                            The rate of data reads.

                                                                                                                                                                                                                                                                                                                                                            mysql.innodb.data_writes

                                                                                                                                                                                                                                                                                                                                                            The rate of data writes.

                                                                                                                                                                                                                                                                                                                                                            mysql.innodb.mutex_os_waits

                                                                                                                                                                                                                                                                                                                                                            The rate of mutex OS waits.

                                                                                                                                                                                                                                                                                                                                                            mysql.innodb.mutex_spin_rounds

                                                                                                                                                                                                                                                                                                                                                            The rate of mutex spin rounds.

                                                                                                                                                                                                                                                                                                                                                            mysql.innodb.mutex_spin_waits

                                                                                                                                                                                                                                                                                                                                                            The rate of mutex spin waits.

                                                                                                                                                                                                                                                                                                                                                            mysql.innodb.os_log_fsyncs

                                                                                                                                                                                                                                                                                                                                                            The rate of fsync writes to the log file.

                                                                                                                                                                                                                                                                                                                                                            mysql.innodb.row_lock_time

                                                                                                                                                                                                                                                                                                                                                            The fraction of time spent (ms/s) acquring row locks.

                                                                                                                                                                                                                                                                                                                                                            mysql.innodb.row_lock_waits

                                                                                                                                                                                                                                                                                                                                                            The number of times per second a row lock had to be waited for.

                                                                                                                                                                                                                                                                                                                                                            mysql.net.connections

                                                                                                                                                                                                                                                                                                                                                            The rate of connections to the server.

                                                                                                                                                                                                                                                                                                                                                            mysql.net.max_connections

                                                                                                                                                                                                                                                                                                                                                            The maximum number of connections that have been in use simultaneously since the server started.

                                                                                                                                                                                                                                                                                                                                                            mysql.performance.com_delete

                                                                                                                                                                                                                                                                                                                                                            The rate of delete statements.

                                                                                                                                                                                                                                                                                                                                                            mysql.performance.com_delete_multi

                                                                                                                                                                                                                                                                                                                                                            The rate of delete-multi statements.

                                                                                                                                                                                                                                                                                                                                                            mysql.performance.com_insert

                                                                                                                                                                                                                                                                                                                                                            The rate of insert statements.

                                                                                                                                                                                                                                                                                                                                                            mysql.performance.com_insert_select

                                                                                                                                                                                                                                                                                                                                                            The rate of insert-select statements.

                                                                                                                                                                                                                                                                                                                                                            mysql.performance.com_replace_select

                                                                                                                                                                                                                                                                                                                                                            The rate of replace-select statements.

                                                                                                                                                                                                                                                                                                                                                            mysql.performance.com_select

                                                                                                                                                                                                                                                                                                                                                            The rate of select statements.

                                                                                                                                                                                                                                                                                                                                                            mysql.performance.com_update

                                                                                                                                                                                                                                                                                                                                                            The rate of update statements.

                                                                                                                                                                                                                                                                                                                                                            mysql.performance.com_update_multi

                                                                                                                                                                                                                                                                                                                                                            The rate of update-multi.

                                                                                                                                                                                                                                                                                                                                                            mysql.performance.created_tmp_disk_tables

                                                                                                                                                                                                                                                                                                                                                            The rate of internal on-disk temporary tables created by second by the server while executing statements.

                                                                                                                                                                                                                                                                                                                                                            mysql.performance.created_tmp_files

                                                                                                                                                                                                                                                                                                                                                            The rate of temporary files created by second.

                                                                                                                                                                                                                                                                                                                                                            mysql.performance.created_tmp_tables

                                                                                                                                                                                                                                                                                                                                                            The rate of internal temporary tables created by second by the server while executing statements.

                                                                                                                                                                                                                                                                                                                                                            mysql.performance.kernel_time

                                                                                                                                                                                                                                                                                                                                                            The percentage of CPU time spent in kernel space by MySQL.

                                                                                                                                                                                                                                                                                                                                                            mysql.performance.key_cache_utilization

                                                                                                                                                                                                                                                                                                                                                            The key cache utilization ratio.

                                                                                                                                                                                                                                                                                                                                                            mysql.performance.open_files

                                                                                                                                                                                                                                                                                                                                                            The number of open files.

                                                                                                                                                                                                                                                                                                                                                            mysql.performance.open_tables

                                                                                                                                                                                                                                                                                                                                                            The number of of tables that are open.

                                                                                                                                                                                                                                                                                                                                                            mysql.performance.qcache_hits

                                                                                                                                                                                                                                                                                                                                                            The rate of query cache hits.

                                                                                                                                                                                                                                                                                                                                                            mysql.performance.queries

                                                                                                                                                                                                                                                                                                                                                            The rate of queries.

                                                                                                                                                                                                                                                                                                                                                            mysql.performance.questions

                                                                                                                                                                                                                                                                                                                                                            The rate of statements executed by the server.

                                                                                                                                                                                                                                                                                                                                                            mysql.performance.slow_queries

                                                                                                                                                                                                                                                                                                                                                            The rate of slow queries.

                                                                                                                                                                                                                                                                                                                                                            mysql.performance.table_locks_waited

                                                                                                                                                                                                                                                                                                                                                            The total number of times that a request for a table lock could not be granted immediately and a wait was needed.

                                                                                                                                                                                                                                                                                                                                                            mysql.performance.table_locks_waited.gauge

                                                                                                                                                                                                                                                                                                                                                            mysql.performance.threads_connected

                                                                                                                                                                                                                                                                                                                                                            The number of currently open connections.

                                                                                                                                                                                                                                                                                                                                                            mysql.performance.threads_running

                                                                                                                                                                                                                                                                                                                                                            The number of threads that are not sleeping.

                                                                                                                                                                                                                                                                                                                                                            mysql.performance.user_time

                                                                                                                                                                                                                                                                                                                                                            The percentage of CPU time spent in user space by MySQL.

                                                                                                                                                                                                                                                                                                                                                            mysql.replication.seconds_behind_master

                                                                                                                                                                                                                                                                                                                                                            The lag in seconds between the master and the slave.

                                                                                                                                                                                                                                                                                                                                                            mysql.replication.slave_running

                                                                                                                                                                                                                                                                                                                                                            A boolean showing if this server is a replication slave that is connected to a replication master.

                                                                                                                                                                                                                                                                                                                                                            mysql.replication.slaves_connected

                                                                                                                                                                                                                                                                                                                                                            The number of slaves connected to a replication master.

                                                                                                                                                                                                                                                                                                                                                            5.3.17 -

                                                                                                                                                                                                                                                                                                                                                            NGINX and NGINX Plus Metrics

                                                                                                                                                                                                                                                                                                                                                            Contents

                                                                                                                                                                                                                                                                                                                                                            5.3.17.1 -

                                                                                                                                                                                                                                                                                                                                                            NGINX Metrics

                                                                                                                                                                                                                                                                                                                                                            See also: NGINX and NGINX Plus integration information.

                                                                                                                                                                                                                                                                                                                                                            nginx.net.conn_dropped_per_s

                                                                                                                                                                                                                                                                                                                                                            The rate of connections dropped.

                                                                                                                                                                                                                                                                                                                                                            nginx.net.conn_opened_per_s

                                                                                                                                                                                                                                                                                                                                                            The rate of connections opened.

                                                                                                                                                                                                                                                                                                                                                            nginx.net.connections

                                                                                                                                                                                                                                                                                                                                                            The total number of active connections.

                                                                                                                                                                                                                                                                                                                                                            nginx.net.reading

                                                                                                                                                                                                                                                                                                                                                            The number of connections reading client requests.

                                                                                                                                                                                                                                                                                                                                                            nginx.net.request_per_s

                                                                                                                                                                                                                                                                                                                                                            The rate of requests processed.

                                                                                                                                                                                                                                                                                                                                                            nginx.net.waiting

                                                                                                                                                                                                                                                                                                                                                            The number of keep-alive connections waiting for work.

                                                                                                                                                                                                                                                                                                                                                            nginx.net.writing

                                                                                                                                                                                                                                                                                                                                                            The number of connections waiting on upstream responses and/or writing responses back to the client.

                                                                                                                                                                                                                                                                                                                                                            5.3.17.2 -

                                                                                                                                                                                                                                                                                                                                                            NGINX Plus Metrics

                                                                                                                                                                                                                                                                                                                                                            See also: NGINX and NGINX Plus integration information.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.cache.bypass.bytes

                                                                                                                                                                                                                                                                                                                                                            The total number of bytes read from the proxied server.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.cache.bypass.bytes_written

                                                                                                                                                                                                                                                                                                                                                            The total number of bytes written to the cache.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.cache.bypass.responses

                                                                                                                                                                                                                                                                                                                                                            The total number of responses from the cache.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.cache.bypass.responses_written

                                                                                                                                                                                                                                                                                                                                                            The total number of responses written to the cache.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.cache.cold

                                                                                                                                                                                                                                                                                                                                                            Boolean. Defines whether the cache loader process is still loading data from the disk into the cache or not.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.cache.expired.bytes

                                                                                                                                                                                                                                                                                                                                                            The total number of bytes read from the proxied server.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.cache.expired.bytes_written

                                                                                                                                                                                                                                                                                                                                                            The total number of bytes written to the cache.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.cache.expired.responses

                                                                                                                                                                                                                                                                                                                                                            The total number of responses not taken from the cache

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.cache.expired.responses_written

                                                                                                                                                                                                                                                                                                                                                            The total number of responses written to the cache

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.cache.hit.bytes

                                                                                                                                                                                                                                                                                                                                                            The total number of bytes read from the cache

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.cache.hit.responses

                                                                                                                                                                                                                                                                                                                                                            The total number of responses read from the cache

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.cache.max_size

                                                                                                                                                                                                                                                                                                                                                            The limit on the maximum size of the cache specified in the configuration

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.cache.miss.bytes

                                                                                                                                                                                                                                                                                                                                                            The total number of bytes read from the proxied server

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.cache.miss.bytes_written

                                                                                                                                                                                                                                                                                                                                                            The total number of bytes written to the cache

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.cache.miss.responses

                                                                                                                                                                                                                                                                                                                                                            The total number of responses not taken from the cache

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.cache.miss.responses_written

                                                                                                                                                                                                                                                                                                                                                            The total number of responses written to the cache

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.cache.revalidated.bytes

                                                                                                                                                                                                                                                                                                                                                            The total number of bytes read from the cache

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.cache.revalidated.response

                                                                                                                                                                                                                                                                                                                                                            The total number of responses read from the cache

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.cache.size

                                                                                                                                                                                                                                                                                                                                                            The current size of the cache

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.cache.stale.bytes

                                                                                                                                                                                                                                                                                                                                                            The total number of bytes read from the cache

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.cache.stale.responses

                                                                                                                                                                                                                                                                                                                                                            The total number of responses read from the cache

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.cache.updating.bytes

                                                                                                                                                                                                                                                                                                                                                            The total number of bytes read from the cache

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.cache.updating.responses

                                                                                                                                                                                                                                                                                                                                                            The total number of responses read from the cache

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.connections.accepted

                                                                                                                                                                                                                                                                                                                                                            The total number of accepted client connections.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.connections.active

                                                                                                                                                                                                                                                                                                                                                            The current number of active client connections.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.connections.dropped

                                                                                                                                                                                                                                                                                                                                                            The total number of dropped client connections.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.connections.idle

                                                                                                                                                                                                                                                                                                                                                            The current number of idle client connections.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.generation

                                                                                                                                                                                                                                                                                                                                                            The total number of configuration reloads

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.load_timestamp

                                                                                                                                                                                                                                                                                                                                                            Time of the last reload of configuration (time since Epoch).

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.pid

                                                                                                                                                                                                                                                                                                                                                            The ID of the worker process that handled status request.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.plus.upstream.peers.fails

                                                                                                                                                                                                                                                                                                                                                            The total number of unsuccessful attempts to communicate with the server.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.ppid

                                                                                                                                                                                                                                                                                                                                                            The ID of the master process that started the worker process

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.processes.respawned

                                                                                                                                                                                                                                                                                                                                                            The total number of abnormally terminated and re-spawned child processes.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.requests.current

                                                                                                                                                                                                                                                                                                                                                            The current number of client requests.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.requests.total

                                                                                                                                                                                                                                                                                                                                                            The total number of client requests.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.server_zone.discarded

                                                                                                                                                                                                                                                                                                                                                            The total number of requests completed without sending a response.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.server_zone.processing

                                                                                                                                                                                                                                                                                                                                                            The number of client requests that are currently being processed.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.server_zone.received

                                                                                                                                                                                                                                                                                                                                                            The total amount of data received from clients.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.server_zone.requests

                                                                                                                                                                                                                                                                                                                                                            The total number of client requests received from clients.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.server_zone.responses.1xx

                                                                                                                                                                                                                                                                                                                                                            The number of responses with 1xx status code.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.server_zone.responses.2xx

                                                                                                                                                                                                                                                                                                                                                            The number of responses with 2xx status code.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.server_zone.responses.3xx

                                                                                                                                                                                                                                                                                                                                                            The number of responses with 3xx status code.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.server_zone.responses.4xx

                                                                                                                                                                                                                                                                                                                                                            The number of responses with 4xx status code.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.server_zone.responses.5xx

                                                                                                                                                                                                                                                                                                                                                            The number of responses with 5xx status code.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.server_zone.responses.total

                                                                                                                                                                                                                                                                                                                                                            The total number of responses sent to clients.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.server_zone.sent

                                                                                                                                                                                                                                                                                                                                                            The total amount of data sent to clients.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.slab.pages.free

                                                                                                                                                                                                                                                                                                                                                            The current number of free memory pages

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.slab.pages.used

                                                                                                                                                                                                                                                                                                                                                            The current number of used memory pages

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.slab.slots.fails

                                                                                                                                                                                                                                                                                                                                                            The number of unsuccessful attempts to allocate memory of specified size

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.slab.slots.free

                                                                                                                                                                                                                                                                                                                                                            The current number of free memory slots

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.slab.slots.reqs

                                                                                                                                                                                                                                                                                                                                                            The total number of attempts to allocate memory of specified size

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.slab.slots.used

                                                                                                                                                                                                                                                                                                                                                            The current number of used memory slots

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.ssl.handshakes

                                                                                                                                                                                                                                                                                                                                                            The total number of successful SSL handshakes.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.ssl.handshakes_failed

                                                                                                                                                                                                                                                                                                                                                            The total number of failed SSL handshakes.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.ssl.session_reuses

                                                                                                                                                                                                                                                                                                                                                            The total number of session reuses during SSL handshake.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.stream.server_zone.connections

                                                                                                                                                                                                                                                                                                                                                            The total number of connections accepted from clients

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.stream.server_zone.connections

                                                                                                                                                                                                                                                                                                                                                            The total number of connections accepted from clients

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.stream.server_zone.discarded

                                                                                                                                                                                                                                                                                                                                                            The total number of requests completed without sending a response.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.stream.server_zone.discarded

                                                                                                                                                                                                                                                                                                                                                            The total number of requests completed without sending a response.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.stream.server_zone.processing

                                                                                                                                                                                                                                                                                                                                                            The number of client requests that are currently being processed.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.stream.server_zone.processing

                                                                                                                                                                                                                                                                                                                                                            The number of client requests that are currently being processed.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.stream.server_zone.received

                                                                                                                                                                                                                                                                                                                                                            The total amount of data received from clients.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.stream.server_zone.received

                                                                                                                                                                                                                                                                                                                                                            The total amount of data received from clients.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.stream.server_zone.sent

                                                                                                                                                                                                                                                                                                                                                            The total amount of data sent to clients.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.stream.server_zone.sent

                                                                                                                                                                                                                                                                                                                                                            The total amount of data sent to clients.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.stream.server_zone.sessions.1xx

                                                                                                                                                                                                                                                                                                                                                            The number of responses with 1xx status code.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.stream.server_zone.sessions.2xx

                                                                                                                                                                                                                                                                                                                                                            The number of responses with 2xx status code.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.stream.server_zone.sessions.3xx

                                                                                                                                                                                                                                                                                                                                                            The number of responses with 3xx status code.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.stream.server_zone.sessions.4xx

                                                                                                                                                                                                                                                                                                                                                            The number of responses with 4xx status code.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.stream.server_zone.sessions.5xx

                                                                                                                                                                                                                                                                                                                                                            The number of responses with 5xx status code.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.stream.server_zone.sessions.total

                                                                                                                                                                                                                                                                                                                                                            The total number of responses sent to clients.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.stream.upstream.peers.active

                                                                                                                                                                                                                                                                                                                                                            The current number of connections

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.stream.upstream.peers.backup

                                                                                                                                                                                                                                                                                                                                                            A boolean value indicating whether the server is a backup server.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.stream.upstream.peers.connections

                                                                                                                                                                                                                                                                                                                                                            The total number of client connections forwarded to this server.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.stream.upstream.peers.downstart

                                                                                                                                                                                                                                                                                                                                                            The time (time since Epoch) when the server became “unavail” or “checking” or “unhealthy”

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.stream.upstream.peers.downtime

                                                                                                                                                                                                                                                                                                                                                            Total time the server was in the “unavail” or “checking” or “unhealthy” states.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.stream.upstream.peers.fails

                                                                                                                                                                                                                                                                                                                                                            The total number of unsuccessful attempts to communicate with the server.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.stream.upstream.peers.health_checks.checks

                                                                                                                                                                                                                                                                                                                                                            The total number of health check requests made.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.stream.upstream.peers.health_checks.fails

                                                                                                                                                                                                                                                                                                                                                            The number of failed health checks.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.stream.upstream.peers.health_checks.last_passed

                                                                                                                                                                                                                                                                                                                                                            Boolean indicating if the last health check request was successful and passed tests.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.stream.upstream.peers.health_checks.unhealthy

                                                                                                                                                                                                                                                                                                                                                            How many times the server became unhealthy (state “unhealthy”).

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.stream.upstream.peers.id

                                                                                                                                                                                                                                                                                                                                                            The ID of the server.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.stream.upstream.peers.received

                                                                                                                                                                                                                                                                                                                                                            The total number of bytes received from this server.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.stream.upstream.peers.selected

                                                                                                                                                                                                                                                                                                                                                            The time (time since Epoch) when the server was last selected to process a connection.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.stream.upstream.peers.sent

                                                                                                                                                                                                                                                                                                                                                            The total number of bytes sent to this server.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.stream.upstream.peers.unavail

                                                                                                                                                                                                                                                                                                                                                            How many times the server became unavailable for client connections (state “unavail”).

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.stream.upstream.peers.weight

                                                                                                                                                                                                                                                                                                                                                            Weight of the server.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.stream.upstream.zombies

                                                                                                                                                                                                                                                                                                                                                            The current number of servers removed from the group but still processing active client connections.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.timestamp

                                                                                                                                                                                                                                                                                                                                                            Current time since Epoch.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.upstream.keepalive

                                                                                                                                                                                                                                                                                                                                                            The current number of idle keepalive connections.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.upstream.peers.active

                                                                                                                                                                                                                                                                                                                                                            The current number of active connections.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.upstream.peers.backup

                                                                                                                                                                                                                                                                                                                                                            A boolean value indicating whether the server is a backup server.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.upstream.peers.downstart

                                                                                                                                                                                                                                                                                                                                                            The time (since Epoch) when the server became “unavail” or “unhealthy”.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.upstream.peers.downtime

                                                                                                                                                                                                                                                                                                                                                            Total time the server was in the “unavail” and “unhealthy” states.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.upstream.peers.health_checks.checks

                                                                                                                                                                                                                                                                                                                                                            The total number of health check requests made.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.upstream.peers.health_checks.fails

                                                                                                                                                                                                                                                                                                                                                            The number of failed health checks.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.upstream.peers.health_checks.last_passed

                                                                                                                                                                                                                                                                                                                                                            Boolean indicating if the last health check request was successful and passed tests.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.upstream.peers.health_checks.unhealthy

                                                                                                                                                                                                                                                                                                                                                            How many times the server became unhealthy (state “unhealthy”).

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.upstream.peers.id

                                                                                                                                                                                                                                                                                                                                                            he ID of the server.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.upstream.peers.received

                                                                                                                                                                                                                                                                                                                                                            The total amount of data received from this server.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.upstream.peers.requests

                                                                                                                                                                                                                                                                                                                                                            The total number of client requests forwarded to this server.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.upstream.peers.responses.1xx

                                                                                                                                                                                                                                                                                                                                                            The number of responses with 1xx status code.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.upstream.peers.responses.1xx_count

                                                                                                                                                                                                                                                                                                                                                            The number of responses with 1xx status code (shown as count).

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.upstream.peers.responses.2xx

                                                                                                                                                                                                                                                                                                                                                            The number of responses with 2xx status code.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.upstream.peers.responses.2xx_count

                                                                                                                                                                                                                                                                                                                                                            The number of responses with 2xx status code (shown as count).

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.upstream.peers.responses.3xx

                                                                                                                                                                                                                                                                                                                                                            The number of responses with 3xx status code.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.upstream.peers.responses.3xx_count

                                                                                                                                                                                                                                                                                                                                                            The number of responses with 3xx status code (shown as count).

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.upstream.peers.responses.4xx

                                                                                                                                                                                                                                                                                                                                                            The number of responses with 4xx status code.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.upstream.peers.responses.4xx_count

                                                                                                                                                                                                                                                                                                                                                            The number of responses with 4xx status code (shown as count).

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.upstream.peers.responses.5xx

                                                                                                                                                                                                                                                                                                                                                            The number of responses with 5xx status code.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.upstream.peers.responses.5xx_count

                                                                                                                                                                                                                                                                                                                                                            The number of responses with 5xx status code (shown as count).

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.upstream.peers.responses.total

                                                                                                                                                                                                                                                                                                                                                            The total number of responses obtained from this server.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.upstream.peers.selected

                                                                                                                                                                                                                                                                                                                                                            The time (since Epoch) when the server was last selected to process a request (1.7.5).

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.upstream.peers.sent

                                                                                                                                                                                                                                                                                                                                                            The total amount of data sent to this server.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.upstream.peers.unavail

                                                                                                                                                                                                                                                                                                                                                            How many times the server became unavailable for client requests (state “unavail”) due to the number of unsuccessful attempts reaching the max_fails threshold.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.upstream.peers.weight

                                                                                                                                                                                                                                                                                                                                                            The weight of the server.

                                                                                                                                                                                                                                                                                                                                                            nginx.plus.version

                                                                                                                                                                                                                                                                                                                                                            The NGINX version.

                                                                                                                                                                                                                                                                                                                                                            5.3.18 -

                                                                                                                                                                                                                                                                                                                                                            NTP Metrics

                                                                                                                                                                                                                                                                                                                                                            See also: NTP integration information.

                                                                                                                                                                                                                                                                                                                                                            ntp.offset

                                                                                                                                                                                                                                                                                                                                                            The time difference between the local clock and the NTP reference clock, in seconds.

                                                                                                                                                                                                                                                                                                                                                            5.3.19 -

                                                                                                                                                                                                                                                                                                                                                            PGBouncer Metrics

                                                                                                                                                                                                                                                                                                                                                            See also: PGBouncer integration information.

                                                                                                                                                                                                                                                                                                                                                            pgbouncer.pools.cl_active

                                                                                                                                                                                                                                                                                                                                                            The number of client connections linked to a server connection and able to process queries.

                                                                                                                                                                                                                                                                                                                                                            pgbouncer.pools.cl_waiting

                                                                                                                                                                                                                                                                                                                                                            The number of client connections waiting on a server connection.

                                                                                                                                                                                                                                                                                                                                                            pgbouncer.pools.maxwait

                                                                                                                                                                                                                                                                                                                                                            The age of the oldest unserved client connection.

                                                                                                                                                                                                                                                                                                                                                            pgbouncer.pools.sv_active

                                                                                                                                                                                                                                                                                                                                                            The number of server connections linked to a client connection.

                                                                                                                                                                                                                                                                                                                                                            pgbouncer.pools.sv_idle

                                                                                                                                                                                                                                                                                                                                                            The number of server connections idle and ready for a client query.

                                                                                                                                                                                                                                                                                                                                                            pgbouncer.pools.sv_login

                                                                                                                                                                                                                                                                                                                                                            The number of server connections currently in the process of logging in.

                                                                                                                                                                                                                                                                                                                                                            pgbouncer.pools.sv_tested

                                                                                                                                                                                                                                                                                                                                                            The number of server connections currently running either server_reset_query or server_check_query.

                                                                                                                                                                                                                                                                                                                                                            pgbouncer.pools.sv_used

                                                                                                                                                                                                                                                                                                                                                            The number of server connections idle more than server_check_delay, needing server_check_query.

                                                                                                                                                                                                                                                                                                                                                            pgbouncer.stats.avg_query

                                                                                                                                                                                                                                                                                                                                                            The average query duration.

                                                                                                                                                                                                                                                                                                                                                            pgbouncer.stats.avg_recv

                                                                                                                                                                                                                                                                                                                                                            The average amount of client network traffic received.

                                                                                                                                                                                                                                                                                                                                                            pgbouncer.stats.avg_req

                                                                                                                                                                                                                                                                                                                                                            The average number of requests per second in the last stat period.

                                                                                                                                                                                                                                                                                                                                                            pgbouncer.stats.avg_sent

                                                                                                                                                                                                                                                                                                                                                            The average amount of client network traffic sent.

                                                                                                                                                                                                                                                                                                                                                            pgbouncer.stats.bytes_received_per_second

                                                                                                                                                                                                                                                                                                                                                            The total network traffic received.

                                                                                                                                                                                                                                                                                                                                                            pgbouncer.stats.bytes_sent_per_second

                                                                                                                                                                                                                                                                                                                                                            The total network traffic sent.

                                                                                                                                                                                                                                                                                                                                                            pgbouncer.stats.requests_per_second

                                                                                                                                                                                                                                                                                                                                                            The request rate.

                                                                                                                                                                                                                                                                                                                                                            pgbouncer.stats.total_query_time

                                                                                                                                                                                                                                                                                                                                                            The time spent by PgBouncer actively querying PostgreSQL.

                                                                                                                                                                                                                                                                                                                                                            5.3.20 -

                                                                                                                                                                                                                                                                                                                                                            PHP-FPM Metrics

                                                                                                                                                                                                                                                                                                                                                            See also: PHP-FPM integration information.

                                                                                                                                                                                                                                                                                                                                                            php_fpm.listen_queue.size

                                                                                                                                                                                                                                                                                                                                                            The size of the socket queue of pending connections.

                                                                                                                                                                                                                                                                                                                                                            php_fpm.processes.active

                                                                                                                                                                                                                                                                                                                                                            The total number of active processes.

                                                                                                                                                                                                                                                                                                                                                            php_fpm.processes.idle

                                                                                                                                                                                                                                                                                                                                                            The total number of idle processes.

                                                                                                                                                                                                                                                                                                                                                            php_fpm.processes.max_reached

                                                                                                                                                                                                                                                                                                                                                            The number of times the process limit has been reached.

                                                                                                                                                                                                                                                                                                                                                            php_fpm.processes.total

                                                                                                                                                                                                                                                                                                                                                            The total number of processes.

                                                                                                                                                                                                                                                                                                                                                            php_fpm.requests.accepted

                                                                                                                                                                                                                                                                                                                                                            The total number of accepted requests.

                                                                                                                                                                                                                                                                                                                                                            php_fpm.requests.slow

                                                                                                                                                                                                                                                                                                                                                            The total number of slow requests.

                                                                                                                                                                                                                                                                                                                                                            5.3.21 -

                                                                                                                                                                                                                                                                                                                                                            PostgreSQL Metrics

                                                                                                                                                                                                                                                                                                                                                            See also: PostgreSQL integration information.

                                                                                                                                                                                                                                                                                                                                                            Metric NameTypeDescription
                                                                                                                                                                                                                                                                                                                                                            postgresql.seq_scansgaugeThe number of sequential scans initiated on this table.
                                                                                                                                                                                                                                                                                                                                                            postgresql.index_scansgaugeThe number of index scans initiated on this table.
                                                                                                                                                                                                                                                                                                                                                            postgresql.index_rows_fetchedgaugeThe number of live rows fetched by index scans.
                                                                                                                                                                                                                                                                                                                                                            postgresql.rows_hot_updatedgaugeThe number of rows HOT updated, meaning no separate index update was needed.
                                                                                                                                                                                                                                                                                                                                                            postgresql.live_rowsgaugeThe estimated number of live rows.
                                                                                                                                                                                                                                                                                                                                                            postgresql.dead_rowsgaugeThe estimated number of dead rows.
                                                                                                                                                                                                                                                                                                                                                            postgresql.index_rows_readgaugeThe number of index entries returned by scans on this index.
                                                                                                                                                                                                                                                                                                                                                            postgresql.table_sizegaugeThe total disk space used by the specified table. Includes TOAST, free space map, and visibility map. Excludes indexes.
                                                                                                                                                                                                                                                                                                                                                            postgresql.index_sizegaugeThe total disk space used by indexes attached to the specified table.
                                                                                                                                                                                                                                                                                                                                                            postgresql.total_sizegaugeThe total disk space used by the table, including indexes and TOAST data.
                                                                                                                                                                                                                                                                                                                                                            postgresql.heap_blocks_readgaugeThe number of disk blocks read from this table.
                                                                                                                                                                                                                                                                                                                                                            postgresql.heap_blocks_hitgaugeThe number of buffer hits in this table.
                                                                                                                                                                                                                                                                                                                                                            postgresql.index_blocks_readgaugeThe number of disk blocks read from all indexes on this table.
                                                                                                                                                                                                                                                                                                                                                            postgresql.index_blocks_hitgaugeThe number of buffer hits in all indexes on this table.
                                                                                                                                                                                                                                                                                                                                                            postgresql.toast_blocks_readgaugeThe number of disk blocks read from this table’s TOAST table.
                                                                                                                                                                                                                                                                                                                                                            postgresql.toast_blocks_hitgaugeThe number of buffer hits in this table’s TOAST table.
                                                                                                                                                                                                                                                                                                                                                            postgresql.toast_index_blocks_readgaugeThe number of disk blocks read from this table’s TOAST table index.
                                                                                                                                                                                                                                                                                                                                                            postgresql.toast_index_blocks_hitgaugeThe number of buffer hits in this table’s TOAST table index.
                                                                                                                                                                                                                                                                                                                                                            postgresql.active_queriesgaugeThe number of active queries in this database.
                                                                                                                                                                                                                                                                                                                                                            postgresql.archiver.archived_countgaugeThe number of WAL files that have been successfully archived.
                                                                                                                                                                                                                                                                                                                                                            postgresql.archiver.failed_countgaugeThe number of failed attempts for archiving WAL files.
                                                                                                                                                                                                                                                                                                                                                            postgresql.before_xid_wraparoundgaugeThe number of transactions that can occur until a transaction wraparound.
                                                                                                                                                                                                                                                                                                                                                            postgresql.index_rel_rows_fetchedrateThe number of live rows fetched by index scans.
                                                                                                                                                                                                                                                                                                                                                            postgresql.transactions.idle_in_transactiongaugeThe number of ‘idle in transaction’ transactions in this database.
                                                                                                                                                                                                                                                                                                                                                            postgresql.transactions.opengaugeThe number of open transactions in this database.
                                                                                                                                                                                                                                                                                                                                                            postgresql.waiting_queriesgaugeThe number of waiting queries in this database.
                                                                                                                                                                                                                                                                                                                                                            postgresql.waiting_queriesgaugeThe number of buffers allocated
                                                                                                                                                                                                                                                                                                                                                            postgresql.bgwriter.buffers_backendgaugeThe number of buffers written directly by a backend.
                                                                                                                                                                                                                                                                                                                                                            postgresql.bgwriter.buffers_backend_fsyncgaugeThe of times a backend had to execute its own fsync call instead of the background writer.
                                                                                                                                                                                                                                                                                                                                                            postgresql.bgwriter.buffers_checkpointgaugeThe number of buffers written during checkpoints.
                                                                                                                                                                                                                                                                                                                                                            postgresql.bgwriter.buffers_cleangaugeThe number of buffers written by the background writer.
                                                                                                                                                                                                                                                                                                                                                            postgresql.bgwriter.checkpoints_requestedgaugeThe number of requested checkpoints that were performed.
                                                                                                                                                                                                                                                                                                                                                            postgresql.bgwriter.checkpoints_timedgaugeThe number of scheduled checkpoints that were performed.
                                                                                                                                                                                                                                                                                                                                                            postgresql.bgwriter.maxwritten_cleangauge.The number of times the background writer stopped a cleaning scan due to writing too many buffers.
                                                                                                                                                                                                                                                                                                                                                            postgresql.bgwriter.sync_timegaugeThe total amount of checkpoint processing time spent synchronizing files to disk.
                                                                                                                                                                                                                                                                                                                                                            postgresql.bgwriter.write_timegaugeThe total amount of checkpoint processing time spent writing files to disk.
                                                                                                                                                                                                                                                                                                                                                            postgresql.buffer_hitgaugeThe number of times disk blocks were found in the buffer cache, preventing the need to read from the database.
                                                                                                                                                                                                                                                                                                                                                            postgresql.commitsgaugeThe number of transactions that have been committed in this database.
                                                                                                                                                                                                                                                                                                                                                            postgresql.connectionsgaugeThe number of active connections to this database.
                                                                                                                                                                                                                                                                                                                                                            postgresql.database_sizegaugeThe disk space used by this database.
                                                                                                                                                                                                                                                                                                                                                            postgresql.deadlocksgaugeThe number of deadlocks detected in this database
                                                                                                                                                                                                                                                                                                                                                            postgresql.disk_readgaugeThe number of disk blocks read in this database.
                                                                                                                                                                                                                                                                                                                                                            postgresql.locksgaugeThe number of locks active for this database.
                                                                                                                                                                                                                                                                                                                                                            postgresql.max_connectionsgaugeThe maximum number of client connections allowed to this database.
                                                                                                                                                                                                                                                                                                                                                            postgresql.percent_usage_connectionsgaugeThe number of connections to this database as a fraction of the maximum number of allowed connections.
                                                                                                                                                                                                                                                                                                                                                            postgresql.replication_delaygaugeThe current replication delay in seconds. Only available with PostgreSQL 9.1 and newer.
                                                                                                                                                                                                                                                                                                                                                            postgresql.replication_delay_bytesgaugeThe current replication delay in bytes. Only available with PostgreSQL 9.2 and newer.
                                                                                                                                                                                                                                                                                                                                                            postgresql.rollbacksgaugeThe number of transactions that have been rolled back in this database.
                                                                                                                                                                                                                                                                                                                                                            postgresql.rows_deletedgaugeThe number of rows deleted by queries in this database.
                                                                                                                                                                                                                                                                                                                                                            postgresql.rows_fetchedgaugeThe number of rows fetched by queries in this database.
                                                                                                                                                                                                                                                                                                                                                            postgresql.rows_insertedgaugeThe number of rows inserted by queries in this database. The metrics can be segmented by ‘db’ or ‘table’ and can be viewed per-relation.
                                                                                                                                                                                                                                                                                                                                                            postgresql.rows_returnedgaugeThe number of rows returned by queries in this database. The metrics can be segmented by ‘db’ or ‘table’ and can be viewed per-relation.
                                                                                                                                                                                                                                                                                                                                                            postgresql.rows_updatedgaugeThe number of rows updated by queries in this database.
                                                                                                                                                                                                                                                                                                                                                            postgresql.rows_deletedgaugeThe number of rows deleted by queries in this database. The metrics can be segmented by ‘db’ or ‘table’ and can be viewed per-relation.
                                                                                                                                                                                                                                                                                                                                                            postgresql.table.countgaugeThe number of user tables in this database.
                                                                                                                                                                                                                                                                                                                                                            postgresql.temp_bytesgaugeThe amount of data written to temporary files by queries in this database.
                                                                                                                                                                                                                                                                                                                                                            postgresql.temp_filesgaugeThe number of temporary files created by queries in this database.
                                                                                                                                                                                                                                                                                                                                                            postgresql.toast_blocks_readgaugeThe number of disk blocks read from this table’s TOAST table.
                                                                                                                                                                                                                                                                                                                                                            postgresql.transactions.idle_in_transactiongaugeThe number of ‘idle in transaction’ transactions in this database.
                                                                                                                                                                                                                                                                                                                                                            postgresql.transactions.opengaugeThe number of open transactions in this database.

                                                                                                                                                                                                                                                                                                                                                            5.3.22 -

                                                                                                                                                                                                                                                                                                                                                            RabbitMQ Metrics

                                                                                                                                                                                                                                                                                                                                                            See also: RabbitMQ integration information.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.connections

                                                                                                                                                                                                                                                                                                                                                            The number of current connections to a given rabbitmq vhost. Each connection is tagged as rabbitmq_vhost:<vhost_name>.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.connections.state

                                                                                                                                                                                                                                                                                                                                                            The number of connections in the specified connection state.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.exchange.messages.ack.count

                                                                                                                                                                                                                                                                                                                                                            The number of messages delivered to clients and acknowledged.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.exchange.messages.ack.rate

                                                                                                                                                                                                                                                                                                                                                            The rate of messages delivered to clients and acknowledged per second.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.exchange.messages.confirm.count

                                                                                                                                                                                                                                                                                                                                                            The number of messages confirmed.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.exchange.messages.confirm.rate

                                                                                                                                                                                                                                                                                                                                                            The rate of messages confirmed per second.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.exchange.messages.deliver_get.count

                                                                                                                                                                                                                                                                                                                                                            The sum of messages delivered in acknowledgement mode to consumers, in no-acknowledgement mode to consumers, in acknowledgement mode in response to basic.get, and in no-acknowledgement mode in response to basic.get.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.exchange.messages.deliver_get.rate

                                                                                                                                                                                                                                                                                                                                                            The rate per second of the sum of messages delivered in acknowledgement mode to consumers, in no-acknowledgement mode to consumers, in acknowledgement mode in response to basic.get, and in no-acknowledgement mode in response to basic.get.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.exchange.messages.publish_in.count

                                                                                                                                                                                                                                                                                                                                                            The number of messages published from channels into this exchange.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.exchange.messages.publish_in.rate

                                                                                                                                                                                                                                                                                                                                                            The amount of messages published from channels into this exchange per second.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.exchange.messages.publish_out.count

                                                                                                                                                                                                                                                                                                                                                            The number of messages published from this exchange into queues.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.exchange.messages.publish_out.rate

                                                                                                                                                                                                                                                                                                                                                            The amount of messages published from this exchange into queues per second.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.exchange.messages.publish.count

                                                                                                                                                                                                                                                                                                                                                            The number of messages published.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.exchange.messages.publish.rate

                                                                                                                                                                                                                                                                                                                                                            The amount of messages published per second.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.exchange.messages.redeliver.count

                                                                                                                                                                                                                                                                                                                                                            The number of subset of messages in deliver_get which had the redelivered flag set.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.exchange.messages.redeliver.rate

                                                                                                                                                                                                                                                                                                                                                            The amount of subset of messages in deliver_get which had the redelivered flag set per second.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.exchange.messages.return_unroutable.count

                                                                                                                                                                                                                                                                                                                                                            The number of messages returned to the publisher as unroutable.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.exchange.messages.return_unroutable.rate

                                                                                                                                                                                                                                                                                                                                                            The amount of messages returned to publisher as unroutable per second.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.node.disk_alarm

                                                                                                                                                                                                                                                                                                                                                            Defines whether the node has a disk alarm configured.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.node.disk_free

                                                                                                                                                                                                                                                                                                                                                            The current free disk space.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.node.fd_used

                                                                                                                                                                                                                                                                                                                                                            Used file descriptors.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.node.mem_alarm

                                                                                                                                                                                                                                                                                                                                                            Defines whether the node has a memory alarm configured.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.node.mem_used

                                                                                                                                                                                                                                                                                                                                                            The total memory used in bytes.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.node.partitions

                                                                                                                                                                                                                                                                                                                                                            The number of network partitions this node is seeing.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.node.run_queue

                                                                                                                                                                                                                                                                                                                                                            The average number of Erlang processes waiting to run.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.node.running

                                                                                                                                                                                                                                                                                                                                                            Defines whether the node is running or not.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.node.sockets_used

                                                                                                                                                                                                                                                                                                                                                            The number of file descriptors used as sockets.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.overview.messages.ack.count

                                                                                                                                                                                                                                                                                                                                                            The number of messages delivered to clients and acknowledged.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.overview.messages.ack.rate

                                                                                                                                                                                                                                                                                                                                                            The rate of messages delivered to clients and acknowledged per second.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.overview.messages.confirm.count

                                                                                                                                                                                                                                                                                                                                                            The number of messages confirmed.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.overview.messages.confirm.rate

                                                                                                                                                                                                                                                                                                                                                            The rate of messages confirmed per second.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.overview.messages.deliver_get.count

                                                                                                                                                                                                                                                                                                                                                            The sum of messages delivered in acknowledgement mode to consumers, in no-acknowledgement mode to consumers, in acknowledgement mode in response to basic.get, and in no-acknowledgement mode in response to basic.get.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.overview.messages.deliver_get.rate

                                                                                                                                                                                                                                                                                                                                                            The rate per second of the sum of messages delivered in acknowledgement mode to consumers, in no-acknowledgement mode to consumers, in acknowledgement mode in response to basic.get, and in no-acknowledgement mode in response to basic.get.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.overview.messages.publish_in.count

                                                                                                                                                                                                                                                                                                                                                            The number of messages published from channels into this overview.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.overview.messages.publish_in.rate

                                                                                                                                                                                                                                                                                                                                                            The rate of messages published from channels into this overview per second.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.overview.messages.publish_out.count

                                                                                                                                                                                                                                                                                                                                                            The number of messages published from this overview into queues.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.overview.messages.publish_out.rate

                                                                                                                                                                                                                                                                                                                                                            The rate of messages published from this overview into queues per second.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.overview.messages.publish.count

                                                                                                                                                                                                                                                                                                                                                            The number of messages published.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.overview.messages.publish.rate

                                                                                                                                                                                                                                                                                                                                                            The rate of messages published per second.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.overview.messages.redeliver.count

                                                                                                                                                                                                                                                                                                                                                            The number of subset of messages in deliver_get which had the redelivered flag set.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.overview.messages.redeliver.rate

                                                                                                                                                                                                                                                                                                                                                            The rate of subset of messages in deliver_get which had the redelivered flag set per second.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.overview.messages.return_unroutable.count

                                                                                                                                                                                                                                                                                                                                                            The number of messages returned to publisher as unroutable.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.overview.messages.return_unroutable.rate

                                                                                                                                                                                                                                                                                                                                                            The rate of messages returned to publisher as unroutable per second.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.overview.object_totals.channels

                                                                                                                                                                                                                                                                                                                                                            The total number of channels.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.overview.object_totals.connections

                                                                                                                                                                                                                                                                                                                                                            The total number of connections.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.overview.object_totals.consumers

                                                                                                                                                                                                                                                                                                                                                            The total number of consumers.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.overview.object_totals.queues

                                                                                                                                                                                                                                                                                                                                                            The total number of queues.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.overview.queue_totals.messages_ready.count

                                                                                                                                                                                                                                                                                                                                                            The number of messages ready for delivery.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.overview.queue_totals.messages_ready.rate

                                                                                                                                                                                                                                                                                                                                                            The rate of messages ready for delivery.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.overview.queue_totals.messages_unacknowledged.count

                                                                                                                                                                                                                                                                                                                                                            The number of unacknowledged messages.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.overview.queue_totals.messages_unacknowledged.rate

                                                                                                                                                                                                                                                                                                                                                            The rate of unacknowledged messages.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.overview.queue_totals.messages.count

                                                                                                                                                                                                                                                                                                                                                            The total number of messages (ready plus unacknowledged).

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.overview.queue_totals.messages.rate

                                                                                                                                                                                                                                                                                                                                                            The rate of messages (ready plus unacknowledged).

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.queue.active_consumers

                                                                                                                                                                                                                                                                                                                                                            The number of active consumers, consumers that can immediately receive any messages sent to the queue.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.queue.bindings.count

                                                                                                                                                                                                                                                                                                                                                            The number of bindings for a specific queue.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.queue.consumer_utilisation

                                                                                                                                                                                                                                                                                                                                                            The ratio of time that a queue’s consumers can take new messages.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.queue.consumers

                                                                                                                                                                                                                                                                                                                                                            The number of consumers.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.queue.memory

                                                                                                                                                                                                                                                                                                                                                            The number of bytes of memory consumed by the Erlang process associated with the queue, including stack, heap and internal structures.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.queue.messages

                                                                                                                                                                                                                                                                                                                                                            The total number of messages in the queue.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.queue.messages_ready

                                                                                                                                                                                                                                                                                                                                                            The number of messages ready to be delivered to clients.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.queue.messages_ready.rate

                                                                                                                                                                                                                                                                                                                                                            The number of messages ready to be delivered to clients per second.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.queue.messages_unacknowledged

                                                                                                                                                                                                                                                                                                                                                            The number of messages delivered to clients but not yet acknowledged.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.queue.messages_unacknowledged.rate

                                                                                                                                                                                                                                                                                                                                                            The number of messages delivered to clients but not yet acknowledged per second.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.queue.messages.ack.count

                                                                                                                                                                                                                                                                                                                                                            The number of messages delivered to clients and acknowledged.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.queue.messages.ack.rate

                                                                                                                                                                                                                                                                                                                                                            The number of messages delivered to clients and acknowledged per second.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.queue.messages.deliver_get.count

                                                                                                                                                                                                                                                                                                                                                            The sum of messages delivered in acknowledgement mode to consumers, in no-acknowledgement mode to consumers, in acknowledgement mode in response to basic.get, and in no-acknowledgement mode in response to basic.get.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.queue.messages.deliver_get.rate

                                                                                                                                                                                                                                                                                                                                                            The sum of messages delivered in acknowledgement mode to consumers, in no-acknowledgement mode to consumers, in acknowledgement mode in response to basic.get, and in no-acknowledgement mode in response to basic.get per second.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.queue.messages.deliver.count

                                                                                                                                                                                                                                                                                                                                                            The number of messages delivered in acknowledgement mode to consumers.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.queue.messages.deliver.rate

                                                                                                                                                                                                                                                                                                                                                            The number of messages delivered in acknowledgement mode to consumers.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.queue.messages.publish.count

                                                                                                                                                                                                                                                                                                                                                            The number of messages published.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.queue.messages.publish.rate

                                                                                                                                                                                                                                                                                                                                                            The rate of messages published per second.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.queue.messages.rate

                                                                                                                                                                                                                                                                                                                                                            The total number of messages in the queue per second.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.queue.messages.redeliver.count

                                                                                                                                                                                                                                                                                                                                                            The number of subset of messages in deliver_get which had the redelivered flag set.

                                                                                                                                                                                                                                                                                                                                                            rabbitmq.queue.messages.redeliver.rate

                                                                                                                                                                                                                                                                                                                                                            The rate per second of subset of messages in deliver_get which had the redelivered flag set.

                                                                                                                                                                                                                                                                                                                                                            5.3.23 -

                                                                                                                                                                                                                                                                                                                                                            Supervisord Metrics

                                                                                                                                                                                                                                                                                                                                                            See also: Supervisord integration information.

                                                                                                                                                                                                                                                                                                                                                            supervisord.process.count

                                                                                                                                                                                                                                                                                                                                                            The number of supervisord monitored processes.

                                                                                                                                                                                                                                                                                                                                                            supervisord.process.uptime

                                                                                                                                                                                                                                                                                                                                                            The process uptime.

                                                                                                                                                                                                                                                                                                                                                            5.3.24 -

                                                                                                                                                                                                                                                                                                                                                            TCP Metrics

                                                                                                                                                                                                                                                                                                                                                            See TCP integration information.

                                                                                                                                                                                                                                                                                                                                                            network.tcp.response_time

                                                                                                                                                                                                                                                                                                                                                            The response time of a given host and TCP port.

                                                                                                                                                                                                                                                                                                                                                            5.3.25 -

                                                                                                                                                                                                                                                                                                                                                            Varnish Metrics

                                                                                                                                                                                                                                                                                                                                                            See also: Varnish integration information.

                                                                                                                                                                                                                                                                                                                                                            All Varnish metrics have the type gauge except varnish.n_purgesps, which has the type rate.

                                                                                                                                                                                                                                                                                                                                                            varnish.accept_fail

                                                                                                                                                                                                                                                                                                                                                            Accept failures. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.backend_busy

                                                                                                                                                                                                                                                                                                                                                            Maximum number of connections to a given backend.

                                                                                                                                                                                                                                                                                                                                                            varnish.backend_conn

                                                                                                                                                                                                                                                                                                                                                            Successful connections to a given backend.

                                                                                                                                                                                                                                                                                                                                                            varnish.backend_fail

                                                                                                                                                                                                                                                                                                                                                            Failed connections for a given backend.

                                                                                                                                                                                                                                                                                                                                                            varnish.backend_recycle

                                                                                                                                                                                                                                                                                                                                                            Backend connections with keep-alive that are returned to the pool of connections.

                                                                                                                                                                                                                                                                                                                                                            varnish.backend_req

                                                                                                                                                                                                                                                                                                                                                            Backend requests.

                                                                                                                                                                                                                                                                                                                                                            varnish.backend_retry

                                                                                                                                                                                                                                                                                                                                                            Backend connection retries.

                                                                                                                                                                                                                                                                                                                                                            varnish.backend_reuse

                                                                                                                                                                                                                                                                                                                                                            Recycled connections that has were reused.

                                                                                                                                                                                                                                                                                                                                                            varnish.backend_toolate

                                                                                                                                                                                                                                                                                                                                                            Backend connections closed because they were idle too long.

                                                                                                                                                                                                                                                                                                                                                            varnish.backend_unhealthy

                                                                                                                                                                                                                                                                                                                                                            Backend connections not tried because the backend was unhealthy.

                                                                                                                                                                                                                                                                                                                                                            varnish.bans

                                                                                                                                                                                                                                                                                                                                                            Bans in system, including bans superseded by newer bans and bans already checked by the ban-lurker. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.bans_added

                                                                                                                                                                                                                                                                                                                                                            Bans added to ban list. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.bans_completed

                                                                                                                                                                                                                                                                                                                                                            Bans which are no longer active, either because they got checked by the ban-lurker or superseded by newer identical bans. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.bans_deleted

                                                                                                                                                                                                                                                                                                                                                            Bans deleted from ban list. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.bans_dups

                                                                                                                                                                                                                                                                                                                                                            Bans replaced by later identical bans. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.bans_lurker_contention

                                                                                                                                                                                                                                                                                                                                                            Times the ban-lurker waited for lookups. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.bans_lurker_obj_killed

                                                                                                                                                                                                                                                                                                                                                            Objects killed by ban-lurker. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.bans_lurker_tested

                                                                                                                                                                                                                                                                                                                                                            Bans and objects tested against each other by the ban-lurker. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.bans_lurker_tests_tested

                                                                                                                                                                                                                                                                                                                                                            Tests and objects tested against each other by the ban-lurker. ‘ban req.url == foo && req.http.host == bar’ counts as one in ‘bans_tested’ and as two in ‘bans_tests_tested’. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.bans_obj

                                                                                                                                                                                                                                                                                                                                                            Bans which use obj.* variables. These bans can possibly be washed by the ban-lurker. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.bans_obj_killed

                                                                                                                                                                                                                                                                                                                                                            Objects killed by bans during object lookup. This metric is only provided by varnish 4.x

                                                                                                                                                                                                                                                                                                                                                            varnish.bans_persisted_bytes

                                                                                                                                                                                                                                                                                                                                                            Bytes used by the persisted ban lists. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.bans_persisted_fragmentation

                                                                                                                                                                                                                                                                                                                                                            Extra bytes accumulated through dropped and completed bans in the persistent ban lists. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.bans_req

                                                                                                                                                                                                                                                                                                                                                            Bans which use req.* variables. These bans can not be washed by the ban-lurker. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.bans_tested

                                                                                                                                                                                                                                                                                                                                                            Bans and objects tested against each other during hash lookup. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.bans_tests_tested

                                                                                                                                                                                                                                                                                                                                                            Tests and objects tested against each other during lookup. ‘ban req.url == foo && req.http.host == bar’ counts as one in ‘bans_tested’ and as two in ‘bans_tests_tested’. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.busy_sleep

                                                                                                                                                                                                                                                                                                                                                            Requests sent to sleep without a worker thread because they found a busy object. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.busy_wakeup

                                                                                                                                                                                                                                                                                                                                                            Requests taken off the busy object sleep list and and rescheduled. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.cache_hit

                                                                                                                                                                                                                                                                                                                                                            Requests served from the cache.

                                                                                                                                                                                                                                                                                                                                                            varnish.cache_hitpass

                                                                                                                                                                                                                                                                                                                                                            Requests passed to a backend where the decision to pass them found in the cache.

                                                                                                                                                                                                                                                                                                                                                            varnish.cache_miss

                                                                                                                                                                                                                                                                                                                                                            Requests fetched from a backend server.

                                                                                                                                                                                                                                                                                                                                                            varnish.client_conn

                                                                                                                                                                                                                                                                                                                                                            Client connections accepted. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.client_drop

                                                                                                                                                                                                                                                                                                                                                            Client connection dropped, no session. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.client_drop_late

                                                                                                                                                                                                                                                                                                                                                            Client connection dropped late. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.client_req

                                                                                                                                                                                                                                                                                                                                                            Parseable client requests seen.

                                                                                                                                                                                                                                                                                                                                                            varnish.client_req_400

                                                                                                                                                                                                                                                                                                                                                            Requests that were malformed in some drastic way. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.client_req_411

                                                                                                                                                                                                                                                                                                                                                            Requests that were missing a Content-Length: header. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.client_req_413

                                                                                                                                                                                                                                                                                                                                                            Requests that were too big. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.client_req_417

                                                                                                                                                                                                                                                                                                                                                            Requests with a bad Expect: header. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.dir_dns_cache_full

                                                                                                                                                                                                                                                                                                                                                            DNS director full DNS cache. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.dir_dns_failed

                                                                                                                                                                                                                                                                                                                                                            DNS director failed lookup. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.dir_dns_hit

                                                                                                                                                                                                                                                                                                                                                            DNS director cached lookup hit. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.dir_dns_lookups

                                                                                                                                                                                                                                                                                                                                                            DNS director lookups. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.esi_errors

                                                                                                                                                                                                                                                                                                                                                            Edge Side Includes (ESI) parse errors.

                                                                                                                                                                                                                                                                                                                                                            varnish.esi_warnings

                                                                                                                                                                                                                                                                                                                                                            Edge Side Includes (ESI) parse warnings.

                                                                                                                                                                                                                                                                                                                                                            varnish.exp_mailed

                                                                                                                                                                                                                                                                                                                                                            Objects mailed to expiry thread for handling. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.exp_received

                                                                                                                                                                                                                                                                                                                                                            Objects received by expiry thread for handling. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.fetch_1xx

                                                                                                                                                                                                                                                                                                                                                            Back end response with no body because of 1XX response (Informational).

                                                                                                                                                                                                                                                                                                                                                            varnish.fetch_204

                                                                                                                                                                                                                                                                                                                                                            Back end response with no body because of 204 response (No Content).

                                                                                                                                                                                                                                                                                                                                                            varnish.fetch_304

                                                                                                                                                                                                                                                                                                                                                            Back end response with no body because of 304 response (Not Modified).

                                                                                                                                                                                                                                                                                                                                                            varnish.fetch_bad

                                                                                                                                                                                                                                                                                                                                                            Back end response’s body length could not be determined and/or had bad headers.

                                                                                                                                                                                                                                                                                                                                                            varnish.fetch_chunked

                                                                                                                                                                                                                                                                                                                                                            Back end response bodies that were chunked.

                                                                                                                                                                                                                                                                                                                                                            varnish.fetch_close

                                                                                                                                                                                                                                                                                                                                                            Fetch wanted close.

                                                                                                                                                                                                                                                                                                                                                            varnish.fetch_eof

                                                                                                                                                                                                                                                                                                                                                            Back end response bodies with EOF.

                                                                                                                                                                                                                                                                                                                                                            varnish.fetch_failed

                                                                                                                                                                                                                                                                                                                                                            Back end response fetches that failed.

                                                                                                                                                                                                                                                                                                                                                            varnish.fetch_head

                                                                                                                                                                                                                                                                                                                                                            Back end HEAD requests.

                                                                                                                                                                                                                                                                                                                                                            varnish.fetch_length

                                                                                                                                                                                                                                                                                                                                                            Back end response bodies with Content-Length.

                                                                                                                                                                                                                                                                                                                                                            varnish.fetch_no_thread

                                                                                                                                                                                                                                                                                                                                                            Back end fetches that failed because no thread was available. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.fetch_oldhttp

                                                                                                                                                                                                                                                                                                                                                            Number of responses served by backends with http < 1.1

                                                                                                                                                                                                                                                                                                                                                            varnish.fetch_zero

                                                                                                                                                                                                                                                                                                                                                            Number of responses that have zero length.

                                                                                                                                                                                                                                                                                                                                                            varnish.hcb_insert

                                                                                                                                                                                                                                                                                                                                                            HCB inserts.

                                                                                                                                                                                                                                                                                                                                                            varnish.hcb_lock

                                                                                                                                                                                                                                                                                                                                                            HCB lookups with lock.

                                                                                                                                                                                                                                                                                                                                                            varnish.hcb_nolock

                                                                                                                                                                                                                                                                                                                                                            HCB lookups without lock.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.backend.colls

                                                                                                                                                                                                                                                                                                                                                            Collisions. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.backend.creat

                                                                                                                                                                                                                                                                                                                                                            Created locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.backend.destroy

                                                                                                                                                                                                                                                                                                                                                            Destroyed locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.backend.locks

                                                                                                                                                                                                                                                                                                                                                            Lock operations.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.ban.colls

                                                                                                                                                                                                                                                                                                                                                            Collisions. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.ban.creat

                                                                                                                                                                                                                                                                                                                                                            Created locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.ban.destroy

                                                                                                                                                                                                                                                                                                                                                            Destroyed locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.ban.locks

                                                                                                                                                                                                                                                                                                                                                            Lock operations.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.busyobj.creat

                                                                                                                                                                                                                                                                                                                                                            Created locks. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.busyobj.destroy

                                                                                                                                                                                                                                                                                                                                                            Destroyed locks. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.busyobj.locks

                                                                                                                                                                                                                                                                                                                                                            Lock operations. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.cli.colls

                                                                                                                                                                                                                                                                                                                                                            Collisions. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.cli.creat

                                                                                                                                                                                                                                                                                                                                                            Created locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.cli.destroy

                                                                                                                                                                                                                                                                                                                                                            Destroyed locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.cli.locks

                                                                                                                                                                                                                                                                                                                                                            Lock operations.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.exp.colls

                                                                                                                                                                                                                                                                                                                                                            Collisions. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.exp.creat

                                                                                                                                                                                                                                                                                                                                                            Created locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.exp.destroy

                                                                                                                                                                                                                                                                                                                                                            Destroyed locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.exp.locks

                                                                                                                                                                                                                                                                                                                                                            Lock operations.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.hcb.colls

                                                                                                                                                                                                                                                                                                                                                            Collisions. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.hcb.creat

                                                                                                                                                                                                                                                                                                                                                            Created locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.hcb.destroy

                                                                                                                                                                                                                                                                                                                                                            Destroyed locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.hcb.locks

                                                                                                                                                                                                                                                                                                                                                            Lock operations.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.hcl.colls

                                                                                                                                                                                                                                                                                                                                                            Collisions. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.hcl.creat

                                                                                                                                                                                                                                                                                                                                                            Created locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.hcl.destroy

                                                                                                                                                                                                                                                                                                                                                            Destroyed locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.hcl.locks

                                                                                                                                                                                                                                                                                                                                                            Lock operations.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.herder.colls

                                                                                                                                                                                                                                                                                                                                                            Collisions. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.herder.creat

                                                                                                                                                                                                                                                                                                                                                            Created locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.herder.destroy

                                                                                                                                                                                                                                                                                                                                                            Destroyed locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.herder.locks

                                                                                                                                                                                                                                                                                                                                                            Lock operations.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.hsl.colls

                                                                                                                                                                                                                                                                                                                                                            Collisions. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.hsl.creat

                                                                                                                                                                                                                                                                                                                                                            Created locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.hsl.destroy

                                                                                                                                                                                                                                                                                                                                                            Destroyed locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.hsl.locks

                                                                                                                                                                                                                                                                                                                                                            Lock operations.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.lru.colls

                                                                                                                                                                                                                                                                                                                                                            Collisions. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.lru.creat

                                                                                                                                                                                                                                                                                                                                                            Created locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.lru.destroy

                                                                                                                                                                                                                                                                                                                                                            Destroyed locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.lru.locks

                                                                                                                                                                                                                                                                                                                                                            Lock operations.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.mempool.creat

                                                                                                                                                                                                                                                                                                                                                            Created locks. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.mempool.destroy

                                                                                                                                                                                                                                                                                                                                                            Destroyed locks. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.mempool.locks

                                                                                                                                                                                                                                                                                                                                                            Lock operations. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.nbusyobj.creat

                                                                                                                                                                                                                                                                                                                                                            Created locks. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.nbusyobj.destroy

                                                                                                                                                                                                                                                                                                                                                            Destroyed locks. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.nbusyobj.locks

                                                                                                                                                                                                                                                                                                                                                            Lock operations. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.objhdr.colls

                                                                                                                                                                                                                                                                                                                                                            Collisions. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.objhdr.creat

                                                                                                                                                                                                                                                                                                                                                            Created locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.objhdr.destroy

                                                                                                                                                                                                                                                                                                                                                            Destroyed locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.objhdr.locks

                                                                                                                                                                                                                                                                                                                                                            Lock operations.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.pipestat.creat

                                                                                                                                                                                                                                                                                                                                                            Created locks. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.pipestat.destroy

                                                                                                                                                                                                                                                                                                                                                            Destroyed locks. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.pipestat.locks

                                                                                                                                                                                                                                                                                                                                                            Lock operations. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.sess.creat

                                                                                                                                                                                                                                                                                                                                                            Created locks. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.sess.destroy

                                                                                                                                                                                                                                                                                                                                                            Destroyed locks. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.sess.locks

                                                                                                                                                                                                                                                                                                                                                            Lock operations. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.sessmem.colls

                                                                                                                                                                                                                                                                                                                                                            Collisions. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.sessmem.creat

                                                                                                                                                                                                                                                                                                                                                            Created locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.sessmem.destroy

                                                                                                                                                                                                                                                                                                                                                            Destroyed locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.sessmem.locks

                                                                                                                                                                                                                                                                                                                                                            Lock operations.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.sma.colls

                                                                                                                                                                                                                                                                                                                                                            Collisions. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.sma.creat

                                                                                                                                                                                                                                                                                                                                                            Created locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.sma.destroy

                                                                                                                                                                                                                                                                                                                                                            Destroyed locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.sma.locks

                                                                                                                                                                                                                                                                                                                                                            Lock operations.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.smf.colls

                                                                                                                                                                                                                                                                                                                                                            Collisions. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.smf.creat

                                                                                                                                                                                                                                                                                                                                                            Created locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.smf.destroy

                                                                                                                                                                                                                                                                                                                                                            Destroyed locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.smf.locks

                                                                                                                                                                                                                                                                                                                                                            Lock operations.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.smp.colls

                                                                                                                                                                                                                                                                                                                                                            Collisions. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.smp.creat

                                                                                                                                                                                                                                                                                                                                                            Created locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.smp.destroy

                                                                                                                                                                                                                                                                                                                                                            Destroyed locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.smp.locks

                                                                                                                                                                                                                                                                                                                                                            Lock operations.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.sms.colls

                                                                                                                                                                                                                                                                                                                                                            Collisions. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.sms.creat

                                                                                                                                                                                                                                                                                                                                                            Created locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.sms.destroy

                                                                                                                                                                                                                                                                                                                                                            Destroyed locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.sms.locks

                                                                                                                                                                                                                                                                                                                                                            Lock operations.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.stat.colls

                                                                                                                                                                                                                                                                                                                                                            Collisions. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.stat.creat

                                                                                                                                                                                                                                                                                                                                                            Created locks. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.stat.destroy

                                                                                                                                                                                                                                                                                                                                                            Destroyed locks. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.stat.locks

                                                                                                                                                                                                                                                                                                                                                            Lock operations. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.vbe.colls

                                                                                                                                                                                                                                                                                                                                                            Collisions. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.vbe.creat

                                                                                                                                                                                                                                                                                                                                                            Created locks. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.vbe.destroy

                                                                                                                                                                                                                                                                                                                                                            Destroyed locks. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.vbe.locks

                                                                                                                                                                                                                                                                                                                                                            Lock operations. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.vbp.colls

                                                                                                                                                                                                                                                                                                                                                            Collisions. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.vbp.creat

                                                                                                                                                                                                                                                                                                                                                            Created locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.vbp.destroy

                                                                                                                                                                                                                                                                                                                                                            Destroyed locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.vbp.locks

                                                                                                                                                                                                                                                                                                                                                            Lock operations.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.vcapace.creat

                                                                                                                                                                                                                                                                                                                                                            Created locks. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.vcapace.destroy

                                                                                                                                                                                                                                                                                                                                                            Destroyed locks. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.vcapace.locks

                                                                                                                                                                                                                                                                                                                                                            Lock operations. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.vcl.colls

                                                                                                                                                                                                                                                                                                                                                            Collisions. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.vcl.creat

                                                                                                                                                                                                                                                                                                                                                            Created locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.vcl.destroy

                                                                                                                                                                                                                                                                                                                                                            Destroyed locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.vcl.locks

                                                                                                                                                                                                                                                                                                                                                            Lock operations.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.vxid.creat

                                                                                                                                                                                                                                                                                                                                                            Created locks. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.vxid.destroy

                                                                                                                                                                                                                                                                                                                                                            Destroyed locks. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.vxid.locks

                                                                                                                                                                                                                                                                                                                                                            Lock operations. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.wq.colls

                                                                                                                                                                                                                                                                                                                                                            Collisions. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.wq.creat

                                                                                                                                                                                                                                                                                                                                                            Created locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.wq.destroy

                                                                                                                                                                                                                                                                                                                                                            Destroyed locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.wq.locks

                                                                                                                                                                                                                                                                                                                                                            Lock operations.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.wstat.colls

                                                                                                                                                                                                                                                                                                                                                            Collisions. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.wstat.creat

                                                                                                                                                                                                                                                                                                                                                            Created locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.wstat.destroy

                                                                                                                                                                                                                                                                                                                                                            Destroyed locks.

                                                                                                                                                                                                                                                                                                                                                            varnish.LCK.wstat.locks

                                                                                                                                                                                                                                                                                                                                                            Lock operations.

                                                                                                                                                                                                                                                                                                                                                            varnish.losthdr

                                                                                                                                                                                                                                                                                                                                                            HTTP header overflows.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.busyobj.allocs

                                                                                                                                                                                                                                                                                                                                                            Allocations. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.busyobj.frees

                                                                                                                                                                                                                                                                                                                                                            Frees. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.busyobj.live

                                                                                                                                                                                                                                                                                                                                                            In use. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.busyobj.pool

                                                                                                                                                                                                                                                                                                                                                            In pool. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.busyobj.randry

                                                                                                                                                                                                                                                                                                                                                            Pool ran dry. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.busyobj.recycle

                                                                                                                                                                                                                                                                                                                                                            Recycled from pool. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.busyobj.surplus

                                                                                                                                                                                                                                                                                                                                                            Too many for pool. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.busyobj.sz_needed

                                                                                                                                                                                                                                                                                                                                                            Size allocated. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.busyobj.sz_wanted

                                                                                                                                                                                                                                                                                                                                                            Size requested. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.busyobj.timeout

                                                                                                                                                                                                                                                                                                                                                            Timed out from pool. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.busyobj.toosmall

                                                                                                                                                                                                                                                                                                                                                            Too small to recycle. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.req0.allocs

                                                                                                                                                                                                                                                                                                                                                            Allocations. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.req0.frees

                                                                                                                                                                                                                                                                                                                                                            Frees. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.req0.live

                                                                                                                                                                                                                                                                                                                                                            In use. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.req0.pool

                                                                                                                                                                                                                                                                                                                                                            In pool. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.req0.randry

                                                                                                                                                                                                                                                                                                                                                            Pool ran dry. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.req0.recycle

                                                                                                                                                                                                                                                                                                                                                            Recycled from pool. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.req0.surplus

                                                                                                                                                                                                                                                                                                                                                            Too many for pool. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.req0.sz_needed

                                                                                                                                                                                                                                                                                                                                                            Size allocated. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.req0.sz_wanted

                                                                                                                                                                                                                                                                                                                                                            Size requested. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.req0.timeout

                                                                                                                                                                                                                                                                                                                                                            Timed out from pool. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.req0.toosmall

                                                                                                                                                                                                                                                                                                                                                            Too small to recycle. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.req1.allocs

                                                                                                                                                                                                                                                                                                                                                            Allocations. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.req1.frees

                                                                                                                                                                                                                                                                                                                                                            Frees. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.req1.live

                                                                                                                                                                                                                                                                                                                                                            In use. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.req1.pool

                                                                                                                                                                                                                                                                                                                                                            In pool. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.req1.randry

                                                                                                                                                                                                                                                                                                                                                            Pool ran dry. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.req1.recycle

                                                                                                                                                                                                                                                                                                                                                            Recycled from pool. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.req1.surplus

                                                                                                                                                                                                                                                                                                                                                            Too many for pool. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.req1.sz_needed

                                                                                                                                                                                                                                                                                                                                                            Size allocated. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.req1.sz_wanted

                                                                                                                                                                                                                                                                                                                                                            Size requested. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.req1.timeout

                                                                                                                                                                                                                                                                                                                                                            Timed out from pool. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.req1.toosmall

                                                                                                                                                                                                                                                                                                                                                            Too small to recycle. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.sess0.allocs

                                                                                                                                                                                                                                                                                                                                                            Allocations. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.sess0.frees

                                                                                                                                                                                                                                                                                                                                                            Frees. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.sess0.live

                                                                                                                                                                                                                                                                                                                                                            In use. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.sess0.pool

                                                                                                                                                                                                                                                                                                                                                            In pool. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.sess0.randry

                                                                                                                                                                                                                                                                                                                                                            Pool ran dry. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.sess0.recycle

                                                                                                                                                                                                                                                                                                                                                            Recycled from pool. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.sess0.surplus

                                                                                                                                                                                                                                                                                                                                                            Too many for pool. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.sess0.sz_needed

                                                                                                                                                                                                                                                                                                                                                            Size allocated. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.sess0.sz_wanted

                                                                                                                                                                                                                                                                                                                                                            Size requested. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.sess0.timeout

                                                                                                                                                                                                                                                                                                                                                            Timed out from pool. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.sess0.toosmall

                                                                                                                                                                                                                                                                                                                                                            Too small to recycle. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.sess1.allocs

                                                                                                                                                                                                                                                                                                                                                            Allocations. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.sess1.frees

                                                                                                                                                                                                                                                                                                                                                            Frees. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.sess1.live

                                                                                                                                                                                                                                                                                                                                                            In use. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.sess1.pool

                                                                                                                                                                                                                                                                                                                                                            In pool. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.sess1.randry

                                                                                                                                                                                                                                                                                                                                                            Pool ran dry. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.sess1.recycle

                                                                                                                                                                                                                                                                                                                                                            Recycled from pool. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.sess1.surplus

                                                                                                                                                                                                                                                                                                                                                            Too many for pool. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.sess1.sz_needed

                                                                                                                                                                                                                                                                                                                                                            Size allocated. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.sess1.sz_wanted

                                                                                                                                                                                                                                                                                                                                                            Size requested. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.sess1.timeout

                                                                                                                                                                                                                                                                                                                                                            Timed out from pool. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.sess1.toosmall

                                                                                                                                                                                                                                                                                                                                                            Too small to recycle. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.vbc.allocs

                                                                                                                                                                                                                                                                                                                                                            Allocations. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.vbc.frees

                                                                                                                                                                                                                                                                                                                                                            Frees. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.vbc.live

                                                                                                                                                                                                                                                                                                                                                            In use. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.vbc.pool

                                                                                                                                                                                                                                                                                                                                                            In pool. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.vbc.randry

                                                                                                                                                                                                                                                                                                                                                            Pool ran dry. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.vbc.recycle

                                                                                                                                                                                                                                                                                                                                                            Recycled from pool. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.vbc.surplus

                                                                                                                                                                                                                                                                                                                                                            Too many for pool. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.vbc.sz_needed

                                                                                                                                                                                                                                                                                                                                                            Size allocated. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.vbc.sz_wanted

                                                                                                                                                                                                                                                                                                                                                            Size requested. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.vbc.timeout

                                                                                                                                                                                                                                                                                                                                                            Timed out from pool. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MEMPOOL.vbc.toosmall

                                                                                                                                                                                                                                                                                                                                                            Too small to recycle. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MGT.child_died

                                                                                                                                                                                                                                                                                                                                                            Child processes that died due to signals. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MGT.child_dump

                                                                                                                                                                                                                                                                                                                                                            Child processes that produced core dumps. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MGT.child_exit

                                                                                                                                                                                                                                                                                                                                                            Child processes the were cleanly stopped. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MGT.child_panic

                                                                                                                                                                                                                                                                                                                                                            Child processes that panicked. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MGT.child_start

                                                                                                                                                                                                                                                                                                                                                            Child processes that started. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MGT.child_stop

                                                                                                                                                                                                                                                                                                                                                            Child processes that exited with an unexpected return code. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.MGT.uptime

                                                                                                                                                                                                                                                                                                                                                            This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.n_backend

                                                                                                                                                                                                                                                                                                                                                            Number of backends.

                                                                                                                                                                                                                                                                                                                                                            varnish.n_ban

                                                                                                                                                                                                                                                                                                                                                            Active bans. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.n_ban_add

                                                                                                                                                                                                                                                                                                                                                            New bans added. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.n_ban_dups

                                                                                                                                                                                                                                                                                                                                                            Duplicate bans removed. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.n_ban_obj_test

                                                                                                                                                                                                                                                                                                                                                            Objects tested. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.n_ban_re_test

                                                                                                                                                                                                                                                                                                                                                            Regexps tested against. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.n_ban_retire

                                                                                                                                                                                                                                                                                                                                                            Old bans deleted. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.n_expired

                                                                                                                                                                                                                                                                                                                                                            Objects that expired from cache because of TTL.

                                                                                                                                                                                                                                                                                                                                                            varnish.n_gunzip

                                                                                                                                                                                                                                                                                                                                                            Gunzip operations.

                                                                                                                                                                                                                                                                                                                                                            varnish.n_gzip

                                                                                                                                                                                                                                                                                                                                                            Gzip operations.

                                                                                                                                                                                                                                                                                                                                                            varnish.n_lru_moved

                                                                                                                                                                                                                                                                                                                                                            Move operations done on the LRU list.

                                                                                                                                                                                                                                                                                                                                                            varnish.n_lru_nuked

                                                                                                                                                                                                                                                                                                                                                            Objects forcefully evicted from storage to make room for new objects.

                                                                                                                                                                                                                                                                                                                                                            varnish.n_obj_purged

                                                                                                                                                                                                                                                                                                                                                            Purged objects. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.n_object

                                                                                                                                                                                                                                                                                                                                                            object structs made.

                                                                                                                                                                                                                                                                                                                                                            varnish.n_objectcore

                                                                                                                                                                                                                                                                                                                                                            objectcore structs made.

                                                                                                                                                                                                                                                                                                                                                            varnish.n_objecthead

                                                                                                                                                                                                                                                                                                                                                            objecthead structs made.

                                                                                                                                                                                                                                                                                                                                                            varnish.n_objoverflow

                                                                                                                                                                                                                                                                                                                                                            Objects overflowing workspace. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.n_objsendfile

                                                                                                                                                                                                                                                                                                                                                            Objects sent with sendfile. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.n_objwrite

                                                                                                                                                                                                                                                                                                                                                            Objects sent with write. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.n_purges

                                                                                                                                                                                                                                                                                                                                                            Purges executed. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.n_sess

                                                                                                                                                                                                                                                                                                                                                            sess structs made. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.n_sess_mem

                                                                                                                                                                                                                                                                                                                                                            sess_mem structs made. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.n_vampireobject

                                                                                                                                                                                                                                                                                                                                                            Unresurrected objects.

                                                                                                                                                                                                                                                                                                                                                            varnish.n_vbc

                                                                                                                                                                                                                                                                                                                                                            vbc structs made. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.n_vcl

                                                                                                                                                                                                                                                                                                                                                            Total VCLs loaded.

                                                                                                                                                                                                                                                                                                                                                            varnish.n_vcl_avail

                                                                                                                                                                                                                                                                                                                                                            Available VCLs.

                                                                                                                                                                                                                                                                                                                                                            varnish.n_vcl_discard

                                                                                                                                                                                                                                                                                                                                                            Discarded VCLs.

                                                                                                                                                                                                                                                                                                                                                            varnish.n_waitinglist

                                                                                                                                                                                                                                                                                                                                                            waitinglist structs made.

                                                                                                                                                                                                                                                                                                                                                            varnish.n_wrk

                                                                                                                                                                                                                                                                                                                                                            Worker threads. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.n_wrk_create

                                                                                                                                                                                                                                                                                                                                                            Worker threads created. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.n_wrk_drop

                                                                                                                                                                                                                                                                                                                                                            Dropped work requests. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.n_wrk_failed

                                                                                                                                                                                                                                                                                                                                                            Worker threads not created. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.n_wrk_lqueue

                                                                                                                                                                                                                                                                                                                                                            Work request queue length. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.n_wrk_max

                                                                                                                                                                                                                                                                                                                                                            Worker threads limited. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.n_wrk_queued

                                                                                                                                                                                                                                                                                                                                                            Queued work requests. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.pools

                                                                                                                                                                                                                                                                                                                                                            Thread pools. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.s_bodybytes

                                                                                                                                                                                                                                                                                                                                                            Total body size. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.s_fetch

                                                                                                                                                                                                                                                                                                                                                            Backend fetches.

                                                                                                                                                                                                                                                                                                                                                            varnish.s_hdrbytes

                                                                                                                                                                                                                                                                                                                                                            Total header size. This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.s_pass

                                                                                                                                                                                                                                                                                                                                                            Passed requests.

                                                                                                                                                                                                                                                                                                                                                            varnish.s_pipe

                                                                                                                                                                                                                                                                                                                                                            Pipe sessions seen.

                                                                                                                                                                                                                                                                                                                                                            varnish.s_pipe_hdrbytes

                                                                                                                                                                                                                                                                                                                                                            Total request bytes received for piped sessions. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.s_pipe_in

                                                                                                                                                                                                                                                                                                                                                            Total number of bytes forwarded from clients in pipe sessions. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.s_pipe_out

                                                                                                                                                                                                                                                                                                                                                            Total number of bytes forwarded to clients in pipe sessions. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.s_req

                                                                                                                                                                                                                                                                                                                                                            Requests.

                                                                                                                                                                                                                                                                                                                                                            varnish.s_req_bodybytes

                                                                                                                                                                                                                                                                                                                                                            Total request body bytes received. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.s_req_hdrbytes

                                                                                                                                                                                                                                                                                                                                                            Total request header bytes received. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.s_resp_bodybytes

                                                                                                                                                                                                                                                                                                                                                            Total response body bytes transmitted. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.s_resp_hdrbytes

                                                                                                                                                                                                                                                                                                                                                            Total response header bytes transmitted. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.s_sess

                                                                                                                                                                                                                                                                                                                                                            Client connections.

                                                                                                                                                                                                                                                                                                                                                            varnish.s_synth

                                                                                                                                                                                                                                                                                                                                                            Synthetic responses made. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.sess_closed

                                                                                                                                                                                                                                                                                                                                                            Client connections closed.

                                                                                                                                                                                                                                                                                                                                                            varnish.sess_conn

                                                                                                                                                                                                                                                                                                                                                            Client connections accepted. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.sess_drop

                                                                                                                                                                                                                                                                                                                                                            Client connections dropped due to lack of worker thread. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.sess_dropped

                                                                                                                                                                                                                                                                                                                                                            Client connections dropped due to a full queue. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.sess_fail

                                                                                                                                                                                                                                                                                                                                                            Failures to accept a TCP connection. Either the client changed its mind, or the kernel ran out of some resource like file descriptors. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.sess_herd varnish.sess_linger

                                                                                                                                                                                                                                                                                                                                                            This metric is only provided by varnish 3.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.sess_pipe_overflow

                                                                                                                                                                                                                                                                                                                                                            This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.sess_pipeline varnish.sess_queued

                                                                                                                                                                                                                                                                                                                                                            Client connections queued to wait for a thread. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.sess_readahead varnish.shm_cont

                                                                                                                                                                                                                                                                                                                                                            SHM MTX contention.

                                                                                                                                                                                                                                                                                                                                                            varnish.shm_cycles

                                                                                                                                                                                                                                                                                                                                                            SHM cycles through buffer.

                                                                                                                                                                                                                                                                                                                                                            varnish.shm_flushes

                                                                                                                                                                                                                                                                                                                                                            SHM flushes due to overflow.

                                                                                                                                                                                                                                                                                                                                                            varnish.shm_records

                                                                                                                                                                                                                                                                                                                                                            SHM records.

                                                                                                                                                                                                                                                                                                                                                            varnish.shm_writes

                                                                                                                                                                                                                                                                                                                                                            SHM writes.

                                                                                                                                                                                                                                                                                                                                                            varnish.SMA.s0.c_bytes

                                                                                                                                                                                                                                                                                                                                                            Total space allocated by this storage.

                                                                                                                                                                                                                                                                                                                                                            varnish.SMA.s0.c_fail

                                                                                                                                                                                                                                                                                                                                                            Times the storage has failed to provide a storage segment.

                                                                                                                                                                                                                                                                                                                                                            varnish.SMA.s0.c_freed

                                                                                                                                                                                                                                                                                                                                                            Total space returned to this storage.

                                                                                                                                                                                                                                                                                                                                                            varnish.SMA.s0.c_req

                                                                                                                                                                                                                                                                                                                                                            Times the storage has been asked to provide a storage segment.

                                                                                                                                                                                                                                                                                                                                                            varnish.SMA.s0.g_alloc

                                                                                                                                                                                                                                                                                                                                                            Storage allocations outstanding.

                                                                                                                                                                                                                                                                                                                                                            varnish.SMA.s0.g_bytes

                                                                                                                                                                                                                                                                                                                                                            Space allocated from the storage.

                                                                                                                                                                                                                                                                                                                                                            varnish.SMA.s0.g_space

                                                                                                                                                                                                                                                                                                                                                            Space left in the storage.

                                                                                                                                                                                                                                                                                                                                                            varnish.SMA.Transient.c_bytes

                                                                                                                                                                                                                                                                                                                                                            Total space allocated by this storage.

                                                                                                                                                                                                                                                                                                                                                            varnish.SMA.Transient.c_fail

                                                                                                                                                                                                                                                                                                                                                            Times the storage has failed to provide a storage segment.

                                                                                                                                                                                                                                                                                                                                                            varnish.SMA.Transient.c_freed

                                                                                                                                                                                                                                                                                                                                                            Total space returned to this storage.

                                                                                                                                                                                                                                                                                                                                                            varnish.SMA.Transient.c_req

                                                                                                                                                                                                                                                                                                                                                            Times the storage has been asked to provide a storage segment.

                                                                                                                                                                                                                                                                                                                                                            varnish.SMA.Transient.g_alloc

                                                                                                                                                                                                                                                                                                                                                            Storage allocations outstanding.

                                                                                                                                                                                                                                                                                                                                                            varnish.SMA.Transient.g_bytes

                                                                                                                                                                                                                                                                                                                                                            Space allocated from the storage.

                                                                                                                                                                                                                                                                                                                                                            varnish.SMA.Transient.g_space

                                                                                                                                                                                                                                                                                                                                                            Space left in the storage.

                                                                                                                                                                                                                                                                                                                                                            varnish.sms_balloc

                                                                                                                                                                                                                                                                                                                                                            SMS space allocated.

                                                                                                                                                                                                                                                                                                                                                            varnish.sms_bfree

                                                                                                                                                                                                                                                                                                                                                            SMS space freed.

                                                                                                                                                                                                                                                                                                                                                            varnish.sms_nbytes

                                                                                                                                                                                                                                                                                                                                                            SMS outstanding space.

                                                                                                                                                                                                                                                                                                                                                            varnish.sms_nobj

                                                                                                                                                                                                                                                                                                                                                            SMS outstanding allocations.

                                                                                                                                                                                                                                                                                                                                                            varnish.sms_nreq

                                                                                                                                                                                                                                                                                                                                                            SMS allocator requests.

                                                                                                                                                                                                                                                                                                                                                            varnish.thread_queue_len

                                                                                                                                                                                                                                                                                                                                                            Length of session queue waiting for threads. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.threads

                                                                                                                                                                                                                                                                                                                                                            Number of threads. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.threads_created

                                                                                                                                                                                                                                                                                                                                                            Threads created. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.threads_destroyed

                                                                                                                                                                                                                                                                                                                                                            Threads destroyed. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.threads_failed

                                                                                                                                                                                                                                                                                                                                                            Threads that failed to get created. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.threads_limited

                                                                                                                                                                                                                                                                                                                                                            Threads that were needed but couldn’t be created because of a thread pool limit. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.uptime

                                                                                                                                                                                                                                                                                                                                                            varnish.vmods

                                                                                                                                                                                                                                                                                                                                                            Loaded VMODs. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.vsm_cooling

                                                                                                                                                                                                                                                                                                                                                            Space which will soon (max 1 minute) be freed in the shared memory used to communicate with tools like varnishstat, varnishlog etc. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.vsm_free

                                                                                                                                                                                                                                                                                                                                                            Free space in the shared memory used to communicate with tools like varnishstat, varnishlog etc. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.vsm_overflow

                                                                                                                                                                                                                                                                                                                                                            Data which does not fit in the shared memory used to communicate with tools like varnishstat, varnishlog etc. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.vsm_overflowed

                                                                                                                                                                                                                                                                                                                                                            Total data which did not fit in the shared memory used to communicate with tools like varnishstat, varnishlog etc. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.vsm_used

                                                                                                                                                                                                                                                                                                                                                            Used space in the shared memory used to communicate with tools like varnishstat, varnishlog etc. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            varnish.n_purgesps

                                                                                                                                                                                                                                                                                                                                                            Purges executed. This metric is only provided by varnish 4.x.

                                                                                                                                                                                                                                                                                                                                                            5.4 -

                                                                                                                                                                                                                                                                                                                                                            Benchmarks and Compliance

                                                                                                                                                                                                                                                                                                                                                            Compliance metrics are generated from scheduled CIS Benchmark scans that occur in Sysdig Secure. These metrics cover aggregate results of the various CIS Benchmark sections, as well as granular details about how many running containers are failing specific run-time compliance checks.

                                                                                                                                                                                                                                                                                                                                                            Contents

                                                                                                                                                                                                                                                                                                                                                            5.4.1 -

                                                                                                                                                                                                                                                                                                                                                            Docker/CIS Benchmarks

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.container-images-and-build-file.pass_pct

                                                                                                                                                                                                                                                                                                                                                            The percentage of successful Docker benchmark tests run on the container images and build files.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.container-images-and-build-file.tests_fail

                                                                                                                                                                                                                                                                                                                                                            The number of failed Docker benchmark tests run against the container images and build file.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.container-images-and-build-file.tests_pass

                                                                                                                                                                                                                                                                                                                                                            The number of successful Docker benchmark tests run against the container images and build file.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.container-images-and-build-file.tests_total

                                                                                                                                                                                                                                                                                                                                                            The total number of tests run against the container images and build file.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.container-runtime.pass_pct

                                                                                                                                                                                                                                                                                                                                                            The percentage of successful container runtime Docker benchmark tests.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.container-runtime.tests_fail

                                                                                                                                                                                                                                                                                                                                                            The number of failed container runtime benchmark tests.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.container-runtime.tests_pass

                                                                                                                                                                                                                                                                                                                                                            The number of successful container runtime Docker benchmark tests.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.container-runtime.tests_total

                                                                                                                                                                                                                                                                                                                                                            The total number of Docker benchmark tests run against container runtimes.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.c-caps-added

                                                                                                                                                                                                                                                                                                                                                            The number of containers running without kernel restrictions in place.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.c-maxretry-not-set

                                                                                                                                                                                                                                                                                                                                                            The number of containers configured to not limit installation retries if the initial attempt fails.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.c-mount-prop-shared

                                                                                                                                                                                                                                                                                                                                                            The number of containers that use mount propagation.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.c-networking-host

                                                                                                                                                                                                                                                                                                                                                            The number of containers that share the host’s network namespace.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.c-no-apparmor

                                                                                                                                                                                                                                                                                                                                                            The number of containers running without an AppArmor profile.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.c-no-cpu-limits

                                                                                                                                                                                                                                                                                                                                                            The number of containers running with no CPU limits configured.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.c-no-health-check

                                                                                                                                                                                                                                                                                                                                                            The number of containers that have no HEALTHCHECK instruction configured.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.c-no-mem-limits

                                                                                                                                                                                                                                                                                                                                                            The number of containers configured to run without memory limitations.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.c-no-pids-cgroup-limit

                                                                                                                                                                                                                                                                                                                                                            The number of containers that do not use a cgroup for PIDs.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.c-no-restricted-privs

                                                                                                                                                                                                                                                                                                                                                            The number of containers running that can have additional privileges configured.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.c-no-seccomp

                                                                                                                                                                                                                                                                                                                                                            The number of containers that disable the default seccomp profile.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.c-no-securityopts

                                                                                                                                                                                                                                                                                                                                                            The number of containers running without SELinux options configured.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.c-no-ulimit-override

                                                                                                                                                                                                                                                                                                                                                            The number of containers running that override the default ulimit.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.c-privileged-ports

                                                                                                                                                                                                                                                                                                                                                            The number of containers that have privileged ports mapped into them.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.c-root-mounted-rw

                                                                                                                                                                                                                                                                                                                                                            The number of containers that mount the host’s root filesystem with read/write privileges.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.c-running-privileged

                                                                                                                                                                                                                                                                                                                                                            The number of containers running with the --privileged configuration option set.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.c-sensitive-dirs

                                                                                                                                                                                                                                                                                                                                                            The number of containers that have mounted a sensitive directory from the host.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.c-sharing-docker-sock

                                                                                                                                                                                                                                                                                                                                                            The number of containers that share the host’s docker socket.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.c-sharing-host-devs

                                                                                                                                                                                                                                                                                                                                                            The number of containers that share one or more host devices.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.c-sharing-host-ipc-ns

                                                                                                                                                                                                                                                                                                                                                            The number of containers that share the host’s IPC namespace.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.c-sharing-host-pid-ns

                                                                                                                                                                                                                                                                                                                                                            The number of containers that share the host’s PID namespace.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.c-sharing-host-user-ns

                                                                                                                                                                                                                                                                                                                                                            The number of containers that share the host’s user namespace.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.c-sharing-host-uts-ns

                                                                                                                                                                                                                                                                                                                                                            The number of containers that share the host’s UTS namespace.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.c-sshd-docker-exec-failures

                                                                                                                                                                                                                                                                                                                                                            The number of containers running an SSH daemon.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.c-unexpected-cgroup

                                                                                                                                                                                                                                                                                                                                                            The number of containers running without a dedicated cgroup configured.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.c-using-docker0-net

                                                                                                                                                                                                                                                                                                                                                            The number of containers using the default docker bridge network docker0.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.c-wildcard-bound-port

                                                                                                                                                                                                                                                                                                                                                            The number of containers that do not bind incoming traffic to a specific interface.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.docker-daemon-configuration.pass_pct

                                                                                                                                                                                                                                                                                                                                                            The percentage of successful Docker benchmark tests run against the Docker daemon configuration.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.docker-daemon-configuration.tests_fail

                                                                                                                                                                                                                                                                                                                                                            The number of benchmark tests run against the Docker daemon configuration that failed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.docker-daemon-configuration.tests_pass

                                                                                                                                                                                                                                                                                                                                                            The number of benchmark tests run against the Docker daemon configuration that passed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.docker-daemon-configuration.tests_total

                                                                                                                                                                                                                                                                                                                                                            The total number of benchmark tests run against the Docker daemon configuration.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.docker-daemon-configuration-files.pass_pct

                                                                                                                                                                                                                                                                                                                                                            The percentage of successful Docker benchmark tests run against the Docker daemon configuration files.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.docker-daemon-configuration-files.tests_fail

                                                                                                                                                                                                                                                                                                                                                            The number of benchmark tests run against the Docker daemon configuration files that failed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.docker-daemon-configuration-files.tests_pass

                                                                                                                                                                                                                                                                                                                                                            The number of benchmark tests run against the Docker daemon configuration files that passed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.docker-daemon-configuration-files.tests_total

                                                                                                                                                                                                                                                                                                                                                            The total number of benchmark tests run against the Docker daemon configuration files.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.docker-security-operations.pass_pct

                                                                                                                                                                                                                                                                                                                                                            The percentage of benchmark tests run against Docker security operations that were successful.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.docker-security-operations.tests_fail

                                                                                                                                                                                                                                                                                                                                                            The number of benchmark tests run against Docker security operations that failed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.docker-security-operations.tests_pass

                                                                                                                                                                                                                                                                                                                                                            The number of benchmark tests run against Docker security operations that passed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.docker-security-operations.tests_total

                                                                                                                                                                                                                                                                                                                                                            The total number of benchmark tests run against Docker security operations.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.docker-swarm-configuration.pass_pct

                                                                                                                                                                                                                                                                                                                                                            The percentage of benchmark tests run against the Docker swarm configuration that were successful.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.docker-swarm-configuration.tests_fail

                                                                                                                                                                                                                                                                                                                                                            The number of benchmark tests run against the Docker swarm configuration that failed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Maxv

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.docker-swarm-configuration.tests_pass

                                                                                                                                                                                                                                                                                                                                                            The number of benchmark tests run against the Docker swarm configuration that passed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.docker-swarm-configuration.tests_total

                                                                                                                                                                                                                                                                                                                                                            The total number of benchmark tests run against the Docker swarm configuration.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.docker-users

                                                                                                                                                                                                                                                                                                                                                            The number of user accounts with permission to access the Docker daemon socket.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.host-configuration.pass_pct

                                                                                                                                                                                                                                                                                                                                                            The percentage of benchmark tests run against the host configuration that were successful.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.host-configuration.tests_fail

                                                                                                                                                                                                                                                                                                                                                            The number of benchmark tests run against the host configuration that failed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.host-configuration.tests_pass

                                                                                                                                                                                                                                                                                                                                                            The number of benchmark tests run against the host configuration that passed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.host-configuration.tests_total

                                                                                                                                                                                                                                                                                                                                                            The total number of benchmark tests run against the host configuration.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.img-images-using-add

                                                                                                                                                                                                                                                                                                                                                            The number of images that use the COPY function rather than the ADD function in Dockerfile.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.img-no-healthcheck

                                                                                                                                                                                                                                                                                                                                                            The number of images with no HEALTHCHECK instruction configured.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.img-running-root

                                                                                                                                                                                                                                                                                                                                                            The number of images that use the root user.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.img-update-insts-found

                                                                                                                                                                                                                                                                                                                                                            The number of images that run a package update step without a package installation step.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.pass_pct

                                                                                                                                                                                                                                                                                                                                                            The percentage of Docker benchmark tests run that passed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.score

                                                                                                                                                                                                                                                                                                                                                            The current pass/fail score for Docker benchmark tests run. The value of this metric is calculated by starting at zero, and incrementing once for every successful test, and decrementing once for every test that returns a WARN result or worse.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.tests_fail

                                                                                                                                                                                                                                                                                                                                                            The total number of Docker benchmark tests that have failed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.tests_pass

                                                                                                                                                                                                                                                                                                                                                            The total number of Docker benchmark tests that have passed

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.docker-bench.tests_total

                                                                                                                                                                                                                                                                                                                                                            The total number of Docker benchmark tests that have been run.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            5.4.2 -

                                                                                                                                                                                                                                                                                                                                                            Kubernetes Benchmarks

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.api-server.pass_pct

                                                                                                                                                                                                                                                                                                                                                            The percentage of Kubernetes benchmark tests run on the API server that passed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.api-server.tests_fail

                                                                                                                                                                                                                                                                                                                                                            The number of Kubernetes benchmark tests run on the API server that failed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.api-server.tests_pass

                                                                                                                                                                                                                                                                                                                                                            The number of Kubernetes benchmark tests run on the API server that passed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.api-server.tests_total

                                                                                                                                                                                                                                                                                                                                                            The total number of Kubernetes benchmark tests run on the API server.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.api-server.tests_warn

                                                                                                                                                                                                                                                                                                                                                            The number of Kubernetes benchmark tests run on the API server that returned a result of WARN.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.configuration-files.pass_pct

                                                                                                                                                                                                                                                                                                                                                            The percentage of Kubernetes benchmark tests run on the configuration files of non-master nodes that passed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.configuration-files.tests_fail

                                                                                                                                                                                                                                                                                                                                                            The number of Kubernetes benchmark tests run on the configuration files of non-master nodes that failed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.configuration-files.tests_pass

                                                                                                                                                                                                                                                                                                                                                            The number of Kubernetes benchmark tests run on the configuration files that passed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.configuration-files.tests_total

                                                                                                                                                                                                                                                                                                                                                            The total number of Kubernetes benchmark tests run on the configuration files of non-master nodes.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.configuration-files.tests_warn

                                                                                                                                                                                                                                                                                                                                                            The number of Kubernetes benchmark tests run on the configuration files of non-master nodes that returned a result of WARN.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.configure-files.pass_pct

                                                                                                                                                                                                                                                                                                                                                            The percentage of Kubernetes benchmark tests run on the master node configuration files that passed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.configure-files.tests_fail

                                                                                                                                                                                                                                                                                                                                                            The number of Kubernetes benchmark tests run on the master node configuration files that failed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.configure-files.tests_pass

                                                                                                                                                                                                                                                                                                                                                            The number of Kubernetes benchmark tests run on the master node configuration files that passed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.configure-files.tests_total

                                                                                                                                                                                                                                                                                                                                                            The total number of Kubernetes benchmark tests run on the master node configuration files.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.configure-files.tests_warn

                                                                                                                                                                                                                                                                                                                                                            The number of Kubernetes benchmark tests run on the master node configuration files that returned a result of WARN.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.controller-manager.pass_pct

                                                                                                                                                                                                                                                                                                                                                            The percentage of Kubernetes benchmark tests run on the controller manager that passed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.controller-manager.tests_fail

                                                                                                                                                                                                                                                                                                                                                            The number of Kubernetes benchmark tests run on the controller manager that failed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.controller-manager.tests_pass

                                                                                                                                                                                                                                                                                                                                                            The number of Kubernetes benchmark tests run on the controller manager that passed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.controller-manager.tests_total

                                                                                                                                                                                                                                                                                                                                                            The total number of Kubernetes benchmark tests run on the controller manager.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.controller-manager.tests_warn

                                                                                                                                                                                                                                                                                                                                                            The number of Kubernetes benchmark tests run on the controller manager that returned a result of WARN.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.etcd.pass_pct

                                                                                                                                                                                                                                                                                                                                                            The percentage of Kubernetes benchmark tests run on the etcd key value store that passed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.etcd.tests_fail

                                                                                                                                                                                                                                                                                                                                                            The number of Kubernetes benchmark tests run on the etcd key value store that failed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.etcd.tests_pass

                                                                                                                                                                                                                                                                                                                                                            The number of Kubernetes benchmark tests run on the etcd key value store that passed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.etcd.tests_total

                                                                                                                                                                                                                                                                                                                                                            The total number of Kubernetes benchmark tests run on the etcd key value store.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.etcd.tests_warn

                                                                                                                                                                                                                                                                                                                                                            The number of Kubernetes benchmark tests run on the etcd key value store that returned a result of WARN.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.general-security-primitives.pass_pct

                                                                                                                                                                                                                                                                                                                                                            The percentage of Kubernetes benchmark tests run on the security primitives that passed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.general-security-primitives.tests_fail

                                                                                                                                                                                                                                                                                                                                                            The number of Kubernetes benchmark tests run on the security primitives that failed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.general-security-primitives.tests_pass

                                                                                                                                                                                                                                                                                                                                                            The number of Kubernetes benchmark tests run on the security primitives that passed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.general-security-primitives.tests_total

                                                                                                                                                                                                                                                                                                                                                            The total number of Kubernetes benchmark tests run on the security primitives.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.general-security-primitives.tests_warn

                                                                                                                                                                                                                                                                                                                                                            The number of Kubernetes benchmark tests run on the security primitives that returned a result of WARN.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.kubelet.pass_pct

                                                                                                                                                                                                                                                                                                                                                            The percentage of Kubernetes benchmark tests run on the non-master node Kubernetes agent that passed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.kubelet.tests_fail

                                                                                                                                                                                                                                                                                                                                                            The number of Kubernetes benchmark tests run on the non-master node Kubernetes agent that failed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.kubelet.tests_pass

                                                                                                                                                                                                                                                                                                                                                            The number of Kubernetes benchmark tests run on the non-master node Kubernetes agent that passed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.kubelet.tests_total

                                                                                                                                                                                                                                                                                                                                                            The total number of Kubernetes benchmark tests run on the non-master node Kubernetes agent.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.kubelet.tests_warn

                                                                                                                                                                                                                                                                                                                                                            The number of Kubernetes benchmark tests run on the non-master node Kubernetes agent that returned a result of WARN.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.pass_pct

                                                                                                                                                                                                                                                                                                                                                            The percentage of Kubernetes benchmark tests that passed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.scheduler.pass_pct

                                                                                                                                                                                                                                                                                                                                                            The percentage of Kubernetes benchmark tests run on the scheduler that passed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.scheduler.tests_fail

                                                                                                                                                                                                                                                                                                                                                            The number of Kubernetes benchmark tests run on the scheduler that failed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.scheduler.tests_pass

                                                                                                                                                                                                                                                                                                                                                            The number of Kubernetes benchmark tests run on the scheduler that passed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.scheduler.tests_total

                                                                                                                                                                                                                                                                                                                                                            The total number of Kubernetes benchmark tests run on the scheduler.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.scheduler.tests_warn

                                                                                                                                                                                                                                                                                                                                                            The number of Kubernetes benchmark tests run on the scheduler that returned a result of WARN.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.tests_fail

                                                                                                                                                                                                                                                                                                                                                            The number of Kubernetes benchmark tests that failed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.tests_pass

                                                                                                                                                                                                                                                                                                                                                            The number of Kubernetes benchmark tests that passed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.tests_total

                                                                                                                                                                                                                                                                                                                                                            The total number of Kubernetes benchmark tests run.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            compliance.k8s-bench.tests_warn

                                                                                                                                                                                                                                                                                                                                                            The number of Kubernetes benchmark tests that returned a result of WARN.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            5.5 -

                                                                                                                                                                                                                                                                                                                                                            Containers

                                                                                                                                                                                                                                                                                                                                                            This topic introduces you to the Container metrics.

                                                                                                                                                                                                                                                                                                                                                            container.count

                                                                                                                                                                                                                                                                                                                                                            The number of containers in the infrastructure.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            container.id

                                                                                                                                                                                                                                                                                                                                                            The container’s identifier.

                                                                                                                                                                                                                                                                                                                                                            For Docker containers, this value is a 12 digit hex number.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            container.image

                                                                                                                                                                                                                                                                                                                                                            The name of the image used to run the container.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            container.name

                                                                                                                                                                                                                                                                                                                                                            The name of the container.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            container.type

                                                                                                                                                                                                                                                                                                                                                            The type of container (for example, Docker, LXC, or Mesos).

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByContainer
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            cpu.quota.used.percent

                                                                                                                                                                                                                                                                                                                                                            The percentage of CPU quota a container actually used over a defined period of time.

                                                                                                                                                                                                                                                                                                                                                            CPU quotas are a common way of creating a CPU limit for a container. A container can only spend its quota of time on CPU cycles across a given time period. The default time period is 100ms.

                                                                                                                                                                                                                                                                                                                                                            Unlike CPU shares, CPU quota is a hard limit for the amount of CPU the container can use. For this reason, the CPU quota should not exceed 100% for an extended period of time. For a shorter time, containers are allowed to consume higher than the CPU quota.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            cpu.shares.count

                                                                                                                                                                                                                                                                                                                                                            The amount of CPU shares assigned to the container’s cgroup. CPU shares represent a relative weight used by the kernel to distribute CPU cycles across different containers. Each container receives its own allocation of CPU cycles, based on the ratio of share allocation for the container versus the total share allocation for all containers. For example, if an environment has three containers, each with 1024 shares, then each will receive 1/3 of the CPU cycles.

                                                                                                                                                                                                                                                                                                                                                            The default value for a container is 1024.

                                                                                                                                                                                                                                                                                                                                                            Defining a CPU shares count is a common way to create a CPU limit for a container.

                                                                                                                                                                                                                                                                                                                                                            The CPU shares count is not a hard limit. A container can consume more than its allocation, as long as the CPU has cycles that are not being consumed by the container they were originally allocated to.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            cpu.shares.used.percent

                                                                                                                                                                                                                                                                                                                                                            The percentage of a container’s allocated CPU shares that are used. CPU shares are a common way of creating a CPU limit for a container, as they represent a relative weight used by the kernel to distribute CPU cycles across different containers. Each container receives its own allocation of CPU cycles, according to the ratio of share count vs the total number of shares claimed by all containers. For example, in an infrastructure with three containers, each with 1024 shares, each container receives 1/3 of the CPU cycles.

                                                                                                                                                                                                                                                                                                                                                            A container can use more CPU cycles than allocated if the CPU has cycles that are not being consumed by the container they were originally allocated to. This means that the value of cpu.shares.used.percent can exceed 100%.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            memory.limit.bytes

                                                                                                                                                                                                                                                                                                                                                            The RAM limit assigned to a container. The default value is 0.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeByte
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            memory.limit.used.percent

                                                                                                                                                                                                                                                                                                                                                            The percentage of the memory limit used by a container.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            swap.limit.bytes

                                                                                                                                                                                                                                                                                                                                                            The swap limit assigned to a container.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeByte
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            swap.limit.used.percent

                                                                                                                                                                                                                                                                                                                                                            The percentage of swap limit used by the container.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            5.6 -

                                                                                                                                                                                                                                                                                                                                                            Cloud Provider

                                                                                                                                                                                                                                                                                                                                                            At this time, all cloudProvider metrics are AWS-related.

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.account.id

                                                                                                                                                                                                                                                                                                                                                            The cloud provider instance account number.

                                                                                                                                                                                                                                                                                                                                                            This metric is useful if there are multiple accounts linked with Sysdig Monitor.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.availabilityZone

                                                                                                                                                                                                                                                                                                                                                            The AWS Availability Zone where the entity or entities are located. Each availability zone is an isolated subsection of an AWS region. See cloudProvider.region.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.host.ip.private

                                                                                                                                                                                                                                                                                                                                                            The private IP address allocated by the cloud provider for the instance. This address can be used for communication between instances in the same network.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.host.ip.public

                                                                                                                                                                                                                                                                                                                                                            Public IP address of the selected host.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.host.name

                                                                                                                                                                                                                                                                                                                                                            The name of the host as reported by the cloud provider.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.id

                                                                                                                                                                                                                                                                                                                                                            The ID number as assigned and reported by the cloud provider.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.instance.type

                                                                                                                                                                                                                                                                                                                                                            The type of instance (for example, AWS or Rackspace).

                                                                                                                                                                                                                                                                                                                                                            This metric is extremely useful to segment instances and compare their resource usage and saturation. You can use it as a grouping criteria for the explore table to quickly explore AWS usage on a per-instance-type basis. You can also use it to compare things like CPU usage, number of requests or network utilization for different instance types.

                                                                                                                                                                                                                                                                                                                                                            Use this grouping criteria in conjunction with the host.count metric to easily create a report on how many instances of each type you have.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.name

                                                                                                                                                                                                                                                                                                                                                            The name of the instance (for example, AWS or Rackspace).

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.region

                                                                                                                                                                                                                                                                                                                                                            The region the cloud provider host (or group of hosts) is located in.

                                                                                                                                                                                                                                                                                                                                                            Use this grouping criteria in conjunction with the host.count metric to easily create a report on how many instances you have in each region.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.resource.endPoint

                                                                                                                                                                                                                                                                                                                                                            The DNS name for which the resource can be accessed.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.resource.name

                                                                                                                                                                                                                                                                                                                                                            The cloud provider service name (for example, Amazon EC2 or Amazon ELB).

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.resource.type

                                                                                                                                                                                                                                                                                                                                                            The cloud provider service type (for example, INSTANCE, LOAD_BALANCER, DATABASE).

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.status

                                                                                                                                                                                                                                                                                                                                                            Resource status.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            5.6.1.1 -

                                                                                                                                                                                                                                                                                                                                                            Elasticache

                                                                                                                                                                                                                                                                                                                                                            Amazon ElastiCache is a cloud-caching service that increases the performance, speed, and redundancy with which applications can retrieve data by providing an in-memory database caching system.

                                                                                                                                                                                                                                                                                                                                                            aws.elasticache.CPUUtilization

                                                                                                                                                                                                                                                                                                                                                            The percentage of CPU utilization.

                                                                                                                                                                                                                                                                                                                                                            When reaching high utilization and your main workload is from read requests, scale your cache cluster out by adding read replicas. If the main workload is from write requests, scale up by using a larger cache instance type.

                                                                                                                                                                                                                                                                                                                                                            For more information, refer to the ElastiCache documentation.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverave
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.elasticache.FreeableMemory

                                                                                                                                                                                                                                                                                                                                                            The amount of memory considered free, or that could be made available, for use by the node.

                                                                                                                                                                                                                                                                                                                                                            For more information, refer to the ElastiCache documentation.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeByte
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.elasticache.NetworkBytesIn

                                                                                                                                                                                                                                                                                                                                                            The number of bytes the host has read from the network.

                                                                                                                                                                                                                                                                                                                                                            For more information, refer to the ElastiCache documentation.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeByte
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.elasticache.NetworkBytesOut

                                                                                                                                                                                                                                                                                                                                                            The number of bytes the host has written to the network.

                                                                                                                                                                                                                                                                                                                                                            For more information, refer to the ElastiCache documentation.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeByte
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.elasticache.SwapUsage

                                                                                                                                                                                                                                                                                                                                                            The amount of swap space used on the host.

                                                                                                                                                                                                                                                                                                                                                            If swap is being utilized, the node probably needs more memory than is available and cache performance may be negatively impacted. Consider adding more nodes or using larger ones to reduce or eliminate swapping.

                                                                                                                                                                                                                                                                                                                                                            For more information, refer to the ElastiCache documentation.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeByte
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            5.6.1.2 -

                                                                                                                                                                                                                                                                                                                                                            Elastic Application Load Balancing (ALB)

                                                                                                                                                                                                                                                                                                                                                            Application Load Balancer is best suited for load balancing of HTTP and HTTPS traffic and provides advanced request routing targeted at the delivery of modern application architectures, including microservices and containers. For more information, refer to the Elastic Application Load Balancer documentation.

                                                                                                                                                                                                                                                                                                                                                            aws.alb.ActiveConnectionCount

                                                                                                                                                                                                                                                                                                                                                            The total number of concurrent TCP connections active from clients to the load balancer and from the load balancer to the targets.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.alb.ClientTLSNegotiationErrorCount

                                                                                                                                                                                                                                                                                                                                                            The number of TLS connections initiated by the client that did not establish a session with the load balancer.

                                                                                                                                                                                                                                                                                                                                                            Possible causes include a mismatch of ciphers or protocols.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.alb.ConsumedLCUs

                                                                                                                                                                                                                                                                                                                                                            The number of load balancer capacity units (LCU) used by the load balancer.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.alb.HTTPCode_ELB_4XX_Count

                                                                                                                                                                                                                                                                                                                                                            The number of HTTP 4XX client error codes that originate form the load balancer. Client errors are generated when requests are malformed or incomplete. These requests have not been received by the target.

                                                                                                                                                                                                                                                                                                                                                            This count does not include any response codes generated by the targets.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.alb.HTTPCode_ELB_5XX_Count

                                                                                                                                                                                                                                                                                                                                                            The number of HTTP 5XX server error codes that originate from the load balancer. Server errors are generated when requests are malformed or incomplete. These requests have not been received by the target.

                                                                                                                                                                                                                                                                                                                                                            This count does not include any response codes generated by the targets.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.alb.HTTPCode_Target_2XX_Count

                                                                                                                                                                                                                                                                                                                                                            The number of HTTP 2XX response codes generated by the target.

                                                                                                                                                                                                                                                                                                                                                            This count does not include any response codes generated by the load balancer.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.alb.HTTPCode_Target_3XX_Count

                                                                                                                                                                                                                                                                                                                                                            The number of HTTP 3XX response codes generated by the target.

                                                                                                                                                                                                                                                                                                                                                            This count does not include any response codes generated by the load balancer.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.alb.HTTPCode_Target_4XX_Count

                                                                                                                                                                                                                                                                                                                                                            The number of HTTP 4XX response codes generated by the target.

                                                                                                                                                                                                                                                                                                                                                            This count does not include any response codes generated by the load balancer.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.alb.HTTPCode_Target_5XX_Count

                                                                                                                                                                                                                                                                                                                                                            The number of HTTP 5XX response codes generated by the target.

                                                                                                                                                                                                                                                                                                                                                            This count does not include any response codes generated by the load balancer.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.alb.HealthyHostCount

                                                                                                                                                                                                                                                                                                                                                            The number of targets that are considered healthy.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.alb.IPv6ProcessedBytes

                                                                                                                                                                                                                                                                                                                                                            The total number of bytes processed by the load balancer over IPv6.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.alb.IPv6RequestCount

                                                                                                                                                                                                                                                                                                                                                            The total number of data requested by the load balancer over IPv6.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.alb.NewConnectionCount

                                                                                                                                                                                                                                                                                                                                                            The total number of new TCP connections established from clients to the load balancer and from the load balancer to targets.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.alb.ProcessedBytes

                                                                                                                                                                                                                                                                                                                                                            The total number of bytes processed by the load balancer over IPv4 and IPv6.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.alb.RejectedConnectionCount

                                                                                                                                                                                                                                                                                                                                                            The number of connections that were rejected because the load balancer had reached its maximum number of connections.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.alb.RequestCount

                                                                                                                                                                                                                                                                                                                                                            The number of requests processed over IPv4 and IPv6. This count only includes the requests with a response generated by a target of the load balancer.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.alb.RequestCountPerTarget

                                                                                                                                                                                                                                                                                                                                                            The average number of requests received by each target in a target group.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.alb.RuleEvaluations

                                                                                                                                                                                                                                                                                                                                                            The number of rules processed by the load balancer given a request rate averaged over an hour.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.alb.TargetConnectionErrorCount

                                                                                                                                                                                                                                                                                                                                                            The number of connections that were not successfully established between the load balancer and target.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.alb.TargetResponseTime

                                                                                                                                                                                                                                                                                                                                                            The time elapsed, in seconds, after the request leaves the load balancer until a response from the target is received.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.alb.TargetTLSNegotiationErrorCount

                                                                                                                                                                                                                                                                                                                                                            The number of TLS connections initiated by the load balancer that did not establish a session with the target.

                                                                                                                                                                                                                                                                                                                                                            Possible causes include a mismatch of ciphers or protocols.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.alb.UnHealthyHostCount

                                                                                                                                                                                                                                                                                                                                                            The number of targets that are considered unhealthy.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            5.6.1.3 -

                                                                                                                                                                                                                                                                                                                                                            Elastic Cloud Compute (EC2)

                                                                                                                                                                                                                                                                                                                                                            Amazon Elastic Compute Cloud (Amazon EC2) is a web service that provides secure, resizable compute capacity in the cloud. It is designed to make web-scale cloud computing easier for developers.

                                                                                                                                                                                                                                                                                                                                                            aws.ec2.CPUCreditBalance

                                                                                                                                                                                                                                                                                                                                                            The CPU credit balance of an instance, based on what has accrued since it started. For more information, refer to the Elastic Compute Cloud metric definition table.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.ec2.CPUCreditUsage

                                                                                                                                                                                                                                                                                                                                                            The CPU credit usage by the instance. For more information, refer to the Elastic Compute Cloud metric definition documentation.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.ec2.CPUUtilization

                                                                                                                                                                                                                                                                                                                                                            The percentage of allocated EC2 compute units currently in use on the instance. For more information, refer to the Elastic Compute Cloud metric definition documentation.

                                                                                                                                                                                                                                                                                                                                                            This metric identifies the processing power required to run an application upon a selected instance.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.ec2.DiskReadBytes

                                                                                                                                                                                                                                                                                                                                                            The total bytes read from all ephemeral disks available to the instance. This metric is used to determine the volume of the data the application reads from the disk and can be used to determine the speed of the application.

                                                                                                                                                                                                                                                                                                                                                            The number reported is the number of bytes received during a specified period. For a basic (five-minute) monitoring, divide this number by 300 to find Bytes/second. For a detailed (one-minute) monitoring, divide it by 60.

                                                                                                                                                                                                                                                                                                                                                            For more information, refer to the Elastic Compute Cloud metric definition documentation.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeByte
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.ec2.DiskReadOps

                                                                                                                                                                                                                                                                                                                                                            Total completed read operations from all ephemeral disks available to the instance in a specified period of time. For more information, refer to the Elastic Compute Cloud metric definition documentation.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.ec2.DiskWriteBytes

                                                                                                                                                                                                                                                                                                                                                            It is the total bytes written to all ephemeral disks available to the instance. This metric is used to determine the volume of the data the application writes to the disk and can be used to determine the speed of the application.

                                                                                                                                                                                                                                                                                                                                                            The number reported is the number of bytes received during a specified period. For a basic (five-minute) monitoring, divide this number by 300 to find Bytes/second. For a detailed (one-minute) monitoring, divide it by 60.

                                                                                                                                                                                                                                                                                                                                                            For more information, refer to the Elastic Compute Cloud metric definition documentation.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeByte
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.ec2.DiskWriteOps

                                                                                                                                                                                                                                                                                                                                                            The completed write operations to all ephemeral disks available to the instance in a specified period of time. If your instance uses Amazon EBS volumes, see Amazon EBS Metrics. For more information, refer to the Elastic Compute Cloud metric definition documentation.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeByte
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.ec2.NetworkIn

                                                                                                                                                                                                                                                                                                                                                            The number of bytes received on all network interfaces by the instance. For more information, refer to the Elastic Compute Cloud metric definition documentation.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeByte
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.ec2.NetworkOut

                                                                                                                                                                                                                                                                                                                                                            The number of bytes sent out on all network interfaces by the instance. For more information, refer to the Elastic Compute Cloud metric definition documentation.

                                                                                                                                                                                                                                                                                                                                                            This metric identifies the volume of outgoing network traffic to an application on a single instance.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeByte
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            5.6.1.4 -

                                                                                                                                                                                                                                                                                                                                                            Elastic Container Service (ECS)

                                                                                                                                                                                                                                                                                                                                                            Amazon Elastic Container Service (Amazon ECS) is a highly scalable, high-performance container orchestration service that supports Docker containers and allows you to easily run and scale containerized applications on AWS. Amazon ECS eliminates the need for you to install and operate your own container orchestration software, manage and scale a cluster of virtual machines, or schedule containers on those virtual machines.

                                                                                                                                                                                                                                                                                                                                                            ecs.clusterName

                                                                                                                                                                                                                                                                                                                                                            The name of the cluster. For more information, refer to the AWS CloudFormation documentation.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            ecs.serviceName

                                                                                                                                                                                                                                                                                                                                                            The name of the Elastic Container Service (Amazon ECS) service. For more information, refer to the AWS CloudFormation documentation.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            ecs.taskFamilyName

                                                                                                                                                                                                                                                                                                                                                            The name of the task definition family. For more information, refer to the AWS CloudFormation documentation.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            5.6.1.5 -

                                                                                                                                                                                                                                                                                                                                                            Elastic Load Balancing (ELB)

                                                                                                                                                                                                                                                                                                                                                            Elastic Load Balancing automatically distributes incoming application traffic across multiple targets, such as Amazon EC2 instances, containers, IP addresses, and Lambda functions.

                                                                                                                                                                                                                                                                                                                                                            aws.elb.BackendConnectionErrors

                                                                                                                                                                                                                                                                                                                                                            The number of errors encountered by the load balancer while attempting to connect to your application.

                                                                                                                                                                                                                                                                                                                                                            For high error counts, look for network related issues or check that your servers are operating correctly. The ELB is having problems connecting to them.

                                                                                                                                                                                                                                                                                                                                                            For more information, refer to the Elastic Load Balancing documentation.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.elb.HealthyHostCount

                                                                                                                                                                                                                                                                                                                                                            A count of the number of healthy instances that are bound to the load balancer.

                                                                                                                                                                                                                                                                                                                                                            Hosts are declared healthy if they meet the threshold for the number of consecutive health checks that are successful. Hosts that have failed more health checks than the value of the unhealthy threshold are considered unhealthy. If cross-zone is enabled, the count of the number of healthy instances is calculated for all Availability Zones.

                                                                                                                                                                                                                                                                                                                                                            For more information, refer to the Elastic Load Balancing documentation.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.elb.HTTPCode_Backend_2XX

                                                                                                                                                                                                                                                                                                                                                            The count of the number of HTTP 2XX response codes generated by back-end instances. This metric does not include any response codes generated by the load balancer.

                                                                                                                                                                                                                                                                                                                                                            The 2XX class status codes represent successful actions (e.g., 200-OK, 201-Created, 202-Accepted, 203-Non-Authoritative Info).

                                                                                                                                                                                                                                                                                                                                                            For more information, refer to the Elastic Load Balancing documentation.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.elb.HTTPCode_Backend_3XX

                                                                                                                                                                                                                                                                                                                                                            The count of the number of HTTP 3XX response codes generated by back-end instances. This metric does not include any response codes generated by the load balancer.

                                                                                                                                                                                                                                                                                                                                                            The 3XX class status code indicates that the user agent requires action (e.g., 301-Moved Permanently, 302-Found, 305-Use Proxy, 307-Temporary Redirect).

                                                                                                                                                                                                                                                                                                                                                            For more information, refer to the Elastic Load Balancing documentation.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.elb.HTTPCode_Backend_4XX

                                                                                                                                                                                                                                                                                                                                                            The count of the number of HTTP 4XX response codes generated by back-end instances. This metric does not include any response codes generated by the load balancer. For more information, refer to the Elastic Load Balancing documentation.

                                                                                                                                                                                                                                                                                                                                                            The 4XX class status code represents client errors (e.g., 400-Bad Request, 401-Unauthorized, 403-Forbidden, 404-Not Found).

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.elb.HTTPCode_Backend_5XX

                                                                                                                                                                                                                                                                                                                                                            The count of the number of HTTP 5XX response codes generated by back-end instances. This metric does not include any response codes generated by the load balancer. For more information, refer to the Elastic Load Balancing documentation.

                                                                                                                                                                                                                                                                                                                                                            The 5XX class status code represents back-end server errors e.g., 500-Internal Server Error, 501-Not implemented, 503-Service Unavailable).

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.elb.HTTPCode_ELB_4XX

                                                                                                                                                                                                                                                                                                                                                            The count of the number of HTTP 4XX client error codes generated by the load balancer when the listener is configured to use HTTP or HTTPS protocols. For more information, refer to the Elastic Load Balancing documentation.

                                                                                                                                                                                                                                                                                                                                                            Client errors are generated when a request is malformed or is incomplete.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.elb.HTTPCode_ELB_5XX

                                                                                                                                                                                                                                                                                                                                                            The count of the number of HTTP 5XX server error codes generated by the load balancer when the listener is configured to use HTTP or HTTPS protocols. This metric does not include any responses generated by back-end instances.For more information, refer to the Elastic Load Balancing documentation.

                                                                                                                                                                                                                                                                                                                                                            The metric is reported if there are no back-end instances that are healthy or registered to the load balancer, or if the request rate exceeds the capacity of the instances or the load balancers.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.elb.Latency

                                                                                                                                                                                                                                                                                                                                                            A measurement of the time backend requests require to process. For more information, refer to the Elastic Load Balancing documentation.

                                                                                                                                                                                                                                                                                                                                                            Latency metrics from the ELB are good indicators of the overall performance of your application.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TyperelativeTime
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.elb.RequestCount

                                                                                                                                                                                                                                                                                                                                                            The number of requests handled by the load balancer. For more information, refer to the Elastic Load Balancing documentation.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.elb.SpilloverCount

                                                                                                                                                                                                                                                                                                                                                            A count of the total number of requests that were rejected due to the queue being full. For more information, refer to the Elastic Load Balancing documentation.

                                                                                                                                                                                                                                                                                                                                                            Positive numbers indicate some requests are not being forwarded to any server. Clients are not notified that their request was dropped.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.elb.SurgeQueueLength

                                                                                                                                                                                                                                                                                                                                                            A count of the total number of requests that are pending submission to a registered instance. For more information, refer to the Elastic Load Balancing documentation.

                                                                                                                                                                                                                                                                                                                                                            Positive numbers indicate clients are waiting for their requests to be forwarded to a server for processing.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.elb.UnHealthyHostCount

                                                                                                                                                                                                                                                                                                                                                            The count of the number of unhealthy instances that are bound to the load balancer. For more information, refer to the Elastic Load Balancing documentation.

                                                                                                                                                                                                                                                                                                                                                            Hosts are declared healthy if they meet the threshold for the number of consecutive health checks that are successful. Hosts that have failed more health checks than the value of the unhealthy threshold are considered unhealthy.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            5.6.1.6 -

                                                                                                                                                                                                                                                                                                                                                            DynamoDB

                                                                                                                                                                                                                                                                                                                                                            DynamoDB is a fully managed proprietary NoSQL database service that supports key-value and document data structures and is offered by Amazon as part of the Amazon Web Services portfolio. Amazon CloudWatch aggregates the DynamoDB metrics at one-minute intervals.

                                                                                                                                                                                                                                                                                                                                                            In DynamoDB, provisioned throughput requirements are specified in terms of capacity units: Read Capacity unit and Write Capacity unit. A unit of read capacity represents one strongly consistent read per second for items up to 4 KB in size. One write capacity unit represents one write per second for items up to 1 KB in size. Larger items will require more capacity. You can calculate the number of units of read and write capacity by estimating the number of reads or writes required per second and multiplying by the size of the items rounded up to the nearest KB.

                                                                                                                                                                                                                                                                                                                                                            For more information, see the Amazon DynamoDB documentation.

                                                                                                                                                                                                                                                                                                                                                            aws.dynamodb.ConditionalCheckFailedRequests

                                                                                                                                                                                                                                                                                                                                                            The number of failed attempts to perform conditional writes.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.dynamodb.ConsumedReadCapacityUnits

                                                                                                                                                                                                                                                                                                                                                            The amount of read capacity units consumed over the defined time period. Amazon CloudWatch aggregates the metrics at one-minute intervals. Use the Sum aggregation to calculate the consumed throughput. For example, get the Sum value over a span of one minute, and divide it by the number of seconds in a minute (60) to calculate the average ConsumedReadCapacityUnits per second.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.dynamodb.ConsumedWriteCapacityUnits

                                                                                                                                                                                                                                                                                                                                                            The amount of write capacity units consumed over the specified time interval. Amazon CloudWatch aggregates the metrics at one-minute intervals. Use the Sum aggregation to calculate the consumed throughput. For example, get the Sum value over a span of one minute, and divide it by the number of seconds in a minute (60) to calculate the average ConsumedWriteCapacityUnits per second.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.dynamodb.ProvisionedReadCapacityUnits

                                                                                                                                                                                                                                                                                                                                                            The number of read capacity units provisioned for a table or a global secondary index.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.dynamodb.ProvisionedWriteCapacityUnits

                                                                                                                                                                                                                                                                                                                                                            The number of write capacity units provisioned for a table or global secondary table.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.dynamodb.ReadThrottleEvents

                                                                                                                                                                                                                                                                                                                                                            The number of DynamoDB requests that exceed the amount of read capacity units provisioned.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.dynamodb.ReturnedBytes.GetRecords

                                                                                                                                                                                                                                                                                                                                                            The number of bytes returned by GetRecords operation during the specified time period.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeByte
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.dynamodb.ReturnedItemCount

                                                                                                                                                                                                                                                                                                                                                            The number of items returned by query or scan operations during the specified time period.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.dynamodb.ReturnedRecordsCount.GetRecords

                                                                                                                                                                                                                                                                                                                                                            The number of stream records returned by the GetRecords operations during the specific period.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.dynamodb.SuccessfulRequestLatency

                                                                                                                                                                                                                                                                                                                                                            The number of successful requests to DynamoDB or Amazon DynamoDB Streams during the specified time period. The time period is in milliseconds.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.dynamodb.SystemErrors

                                                                                                                                                                                                                                                                                                                                                            The number of requests made to DynamoDB or Amazon DynamoDB Streams that resulted in an HTTP 500 status code during the specified time period. HTTP 500 usually indicates an internal service error.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.dynamodb.ThrottledRequests

                                                                                                                                                                                                                                                                                                                                                            The number of requests to DynamoDB that exceed the provisioned throughput limits on a resource, such as a table or an index. ThrottledRequests is incremented by one if any event within a request exceeds a provisioned throughput limit.

                                                                                                                                                                                                                                                                                                                                                            If any individual request for read or write events within the batch is throttled, ReadThrottleEvents metrics or WriteThrottleEvents metrics is incremented respectively.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.dynamodb.UserErrors

                                                                                                                                                                                                                                                                                                                                                            The number of requests to DynamoDB or Amazon DynamoDB Streams that returned an HTTP 400 status code during the specified time period. HTTP 400 usually indicates a client-side error.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.dynamodb.WriteThrottleEvents

                                                                                                                                                                                                                                                                                                                                                            The number of requests to DynamoDB that exceed the provisioned write capacity units for a table or a global secondary index.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            5.6.1.7 -

                                                                                                                                                                                                                                                                                                                                                            Relational Database Service (RDS)

                                                                                                                                                                                                                                                                                                                                                            Amazon Relational Database Service (Amazon RDS) is a managed SQL database service provided by Amazon Web Services (AWS). Amazon RDS supports an array of database engines to store and organize data and helps with database management tasks, such as migration, backup, recovery, and patching.

                                                                                                                                                                                                                                                                                                                                                            aws.rds.BinLogDiskUsage

                                                                                                                                                                                                                                                                                                                                                            The amount of disk space occupied by binary logs on the master. Applies to MySQL read replicas.

                                                                                                                                                                                                                                                                                                                                                            For more information, refer to the Amazon Relational Database (RDS) documentation.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeByte
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.rds.CPUUtilization

                                                                                                                                                                                                                                                                                                                                                            The percentage of CPU utilization.

                                                                                                                                                                                                                                                                                                                                                            For more information, refer to the Amazon Relational Database (RDS) documentation.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.rds.DatabaseConnections

                                                                                                                                                                                                                                                                                                                                                            The number of database connections in use.

                                                                                                                                                                                                                                                                                                                                                            For more information, refer to the Amazon Relational Database (RDS) documentation.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.rds.DiskQueueDepth

                                                                                                                                                                                                                                                                                                                                                            The number of outstanding I/Os (read/write requests) waiting to access the disk.

                                                                                                                                                                                                                                                                                                                                                            For more information, refer to the Amazon Relational Database (RDS) documentation.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.rds.FreeableMemory

                                                                                                                                                                                                                                                                                                                                                            The amount of available random access memory, in megabytes.

                                                                                                                                                                                                                                                                                                                                                            For more information, refer to the Amazon Relational Database (RDS) documentation.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeByte
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.rds.FreeStorageSpace

                                                                                                                                                                                                                                                                                                                                                            The amount of available storage space in bytes.

                                                                                                                                                                                                                                                                                                                                                            For more information, refer to the Amazon Relational Database (RDS) documentation.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeByte
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.rds.NetworkReceiveThroughput

                                                                                                                                                                                                                                                                                                                                                            The incoming (Receive) network traffic on the DB instance, including both customer database traffic and Amazon RDS traffic used for monitoring and replication. The metric is measured in bytes per second.

                                                                                                                                                                                                                                                                                                                                                            For more information, refer to the Amazon Relational Database (RDS) documentation.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeByte
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.rds.NetworkTransmitThroughput

                                                                                                                                                                                                                                                                                                                                                            The outgoing (Transmit) network traffic on the DB instance, including both customer database traffic and Amazon RDS traffic used for monitoring and replication. The metric is measured in bytes per second.

                                                                                                                                                                                                                                                                                                                                                            For more information, refer to the Amazon Relational Database (RDS) documentation.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeByte
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.rds.ReadIOPS

                                                                                                                                                                                                                                                                                                                                                            The average number of read I/O operations per second.

                                                                                                                                                                                                                                                                                                                                                            For more information, refer to the Amazon Relational Database (RDS) documentation.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.rds.ReadLatency

                                                                                                                                                                                                                                                                                                                                                            The average amount of seconds taken per read I/O operation.

                                                                                                                                                                                                                                                                                                                                                            For more information, refer to the Amazon Relational Database (RDS) documentation.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TyperelativeTime
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.rds.ReadThroughput

                                                                                                                                                                                                                                                                                                                                                            The average number of bytes read from disk per second.

                                                                                                                                                                                                                                                                                                                                                            For more information, refer to the Amazon Relational Database (RDS) documentation.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.rds.ReplicaLag

                                                                                                                                                                                                                                                                                                                                                            The amount of time, in nanoseconds, a Read Replica DB instance lags behind the source DB instance.

                                                                                                                                                                                                                                                                                                                                                            This metric applies to MySQL read replicas.

                                                                                                                                                                                                                                                                                                                                                            For more information, refer to the Amazon Relational Database (RDS) documentation.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TyperelativeTime
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.rds.SwapUsage

                                                                                                                                                                                                                                                                                                                                                            The amount of swap space used by the database, measured in megabytes.

                                                                                                                                                                                                                                                                                                                                                            For more information, refer to the Amazon Relational Database (RDS) documentation.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeByte
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.rds.WriteIOPS

                                                                                                                                                                                                                                                                                                                                                            The average number of write I/O operations per second.

                                                                                                                                                                                                                                                                                                                                                            For more information, refer to the Amazon Relational Database (RDS) documentation.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.rds.WriteLatency

                                                                                                                                                                                                                                                                                                                                                            The average amount of time taken per write I/O operation.

                                                                                                                                                                                                                                                                                                                                                            For more information, refer to the Amazon Relational Database (RDS) documentation.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TyperelativeTime
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.rds.WriteThroughput

                                                                                                                                                                                                                                                                                                                                                            The average number of bytes written to disk per second.

                                                                                                                                                                                                                                                                                                                                                            For more information, refer to the Amazon Relational Database (RDS) documentation.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            5.6.1.8 -

                                                                                                                                                                                                                                                                                                                                                            Simple Queue Service (SQS)

                                                                                                                                                                                                                                                                                                                                                            Amazon Simple Queue Service (Amazon SQS) is a pay-per-use web service for storing messages in transit between computers. Developers use SQS to build distributed applications with decoupled components without having to deal with the overhead of creating and maintaining message queues.

                                                                                                                                                                                                                                                                                                                                                            Amazon Simple Queue Service (Amazon SQS) is a pay-per-use web service for storing messages in transit between computers. Developers use SQS to build distributed applications with decoupled components without having to deal with the overhead of creating and maintaining message queues. For more information, see Amazon SQS Resources.

                                                                                                                                                                                                                                                                                                                                                            aws.sqs.ApproximateNumberOfMessagesDelayed

                                                                                                                                                                                                                                                                                                                                                            The number of messages in the queue that are delayed or currently unavailable for reading. Messages are stuck like this when the queue is configured as a delay queue or when a message has been sent with a delay parameter.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAvg
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.sqs.ApproximateNumberOfMessagesNotVisible

                                                                                                                                                                                                                                                                                                                                                            The number of undelivered messages. These messages are still in the queue, on their way to a client (in flight), but have not yet been deleted or have not yet reached the destination.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAvg
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.sqs.ApproximateNumberOfMessagesVisible

                                                                                                                                                                                                                                                                                                                                                            The number of messages available for retrieval from the queue. These are the messages which have not yet been locked by an SQS worker.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAvg
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.sqs.NumberOfEmptyReceives

                                                                                                                                                                                                                                                                                                                                                            The number of ReceiveMessage API calls that did not return a message. This metric is populated every 5 minutes.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.sqs.NumberOfMessagesDeleted

                                                                                                                                                                                                                                                                                                                                                            The number of messages deleted from the queue. Amazon SQS considers every successful deletion that uses a valid receipt handle, including duplicate deletions, to generate the NumberOfMessagesDeleted metric. Therefore, this number could include duplicate deletions.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.sqs.NumberOfMessagesReceived

                                                                                                                                                                                                                                                                                                                                                            The number of messages returned by calls to the ReceiveMessage API action.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.sqs.NumberOfMessagesSent

                                                                                                                                                                                                                                                                                                                                                            The number of messages added to a queue.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            aws.sqs.SentMessageSize

                                                                                                                                                                                                                                                                                                                                                            The size of messages in bytes added to a queue. The SentMessageSize does not display as an available metric in the CloudWatch console until at least one message is sent to the corresponding queue.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByCloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            5.7 -

                                                                                                                                                                                                                                                                                                                                                            File

                                                                                                                                                                                                                                                                                                                                                            file.bytes.in

                                                                                                                                                                                                                                                                                                                                                            The number of bytes read from the file. By default, this metric displays the total value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the total value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeByte
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            file.bytes.out

                                                                                                                                                                                                                                                                                                                                                            The number of bytes written to the file. By default, this metric displays the total value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the total value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeByte
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            file.bytes.total

                                                                                                                                                                                                                                                                                                                                                            The total number of bytes written to, and read from, the file. By default, this metric displays the total value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the total value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeByte
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            file.error.open.count

                                                                                                                                                                                                                                                                                                                                                            The number of errors that occurred when opening files. By default, this metric displays the total value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the total value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            file.error.total.count

                                                                                                                                                                                                                                                                                                                                                            The number of errors encountered by file system calls, such as open(), close(), and create(). By default, this metric displays the total value for the defined scope. For example, if the scope is defined as a group of machines, the metric value will be the total value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            file.iops.in

                                                                                                                                                                                                                                                                                                                                                            The number of file read operations per second. This metric is calculated by measuring the actual number of read requests made by a process. By default, this metric displays the total value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the total value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            The value of file.iops.in can differ from the value other tools show, as they are usually based on interpolating this value from the number of bytes read and written to the file system.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            file.iops.out

                                                                                                                                                                                                                                                                                                                                                            The number of file write operations per second. This metric is calculated by measuring the actual number of write requests made by a process. By default, this metric displays the total value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the total value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            The value of file.iops.out can differ from the value other tools show, as they are usually based on interpolating this value from the number of bytes read and written to the file system.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            file.iops.total

                                                                                                                                                                                                                                                                                                                                                            The number of file read and write operations per second. This metric is calculated by measuring the actual number of read/write requests made by a process. By default, this metric displays the total value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the total value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            The value of file.iops.total can differ from the value other tools show, as they are usually based on interpolating this value from the number of bytes read and written to the file system.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            file.name

                                                                                                                                                                                                                                                                                                                                                            The name of the file.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            file.open.count

                                                                                                                                                                                                                                                                                                                                                            The number of times the file has been opened.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            file.time.in

                                                                                                                                                                                                                                                                                                                                                            The time spent reading the file. By default, this metric displays the total value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the total value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TyperelativeTime
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            file.time.out

                                                                                                                                                                                                                                                                                                                                                            The time spent writing in the file. By default, this metric displays the total value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the total value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TyperelativeTime
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            file.time.total

                                                                                                                                                                                                                                                                                                                                                            The time spent during file I/O. By default, this metric displays the total value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the total value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TyperelativeTime
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            5.8 -

                                                                                                                                                                                                                                                                                                                                                            File System

                                                                                                                                                                                                                                                                                                                                                            fs.used.percent

                                                                                                                                                                                                                                                                                                                                                            Specifies what percentage of the file system has been used.

                                                                                                                                                                                                                                                                                                                                                            Metadata

                                                                                                                                                                                                                                                                                                                                                            Description

                                                                                                                                                                                                                                                                                                                                                            Metric Type

                                                                                                                                                                                                                                                                                                                                                            Gauge

                                                                                                                                                                                                                                                                                                                                                            Value Type

                                                                                                                                                                                                                                                                                                                                                            Percent

                                                                                                                                                                                                                                                                                                                                                            Scope

                                                                                                                                                                                                                                                                                                                                                            Host, Container

                                                                                                                                                                                                                                                                                                                                                            Segment By

                                                                                                                                                                                                                                                                                                                                                            agent.tag

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.account.id

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.availabilityZone

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.region

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.tag

                                                                                                                                                                                                                                                                                                                                                            container.id

                                                                                                                                                                                                                                                                                                                                                            container.image

                                                                                                                                                                                                                                                                                                                                                            container.name

                                                                                                                                                                                                                                                                                                                                                            ecs.clusterName

                                                                                                                                                                                                                                                                                                                                                            ecs.serviceName

                                                                                                                                                                                                                                                                                                                                                            ecs.taskFamilyName

                                                                                                                                                                                                                                                                                                                                                            fs.device

                                                                                                                                                                                                                                                                                                                                                            fs.mountDir

                                                                                                                                                                                                                                                                                                                                                            fs.type

                                                                                                                                                                                                                                                                                                                                                            host.hostName

                                                                                                                                                                                                                                                                                                                                                            host.mac

                                                                                                                                                                                                                                                                                                                                                            Default Time Aggregation

                                                                                                                                                                                                                                                                                                                                                            Average

                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation Formats

                                                                                                                                                                                                                                                                                                                                                            Average, Rate, Sum, Minimum, Maximum

                                                                                                                                                                                                                                                                                                                                                            Default Group Aggregation

                                                                                                                                                                                                                                                                                                                                                            Average

                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation Formats

                                                                                                                                                                                                                                                                                                                                                            Average, Sum, Minimum, Maximum

                                                                                                                                                                                                                                                                                                                                                            fs.free.percent

                                                                                                                                                                                                                                                                                                                                                            Specifies what percentage of the file system is free.

                                                                                                                                                                                                                                                                                                                                                            Metadata

                                                                                                                                                                                                                                                                                                                                                            Description

                                                                                                                                                                                                                                                                                                                                                            Metric Type

                                                                                                                                                                                                                                                                                                                                                            Gauge

                                                                                                                                                                                                                                                                                                                                                            Value Type

                                                                                                                                                                                                                                                                                                                                                            Percent

                                                                                                                                                                                                                                                                                                                                                            Scope

                                                                                                                                                                                                                                                                                                                                                            Host, Container

                                                                                                                                                                                                                                                                                                                                                            Segment By

                                                                                                                                                                                                                                                                                                                                                            agent.tag

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.account.id

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.availabilityZone

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.region

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.tag

                                                                                                                                                                                                                                                                                                                                                            container.id

                                                                                                                                                                                                                                                                                                                                                            container.image

                                                                                                                                                                                                                                                                                                                                                            container.name

                                                                                                                                                                                                                                                                                                                                                            ecs.clusterName

                                                                                                                                                                                                                                                                                                                                                            ecs.serviceName

                                                                                                                                                                                                                                                                                                                                                            ecs.taskFamilyName

                                                                                                                                                                                                                                                                                                                                                            fs.device

                                                                                                                                                                                                                                                                                                                                                            fs.mountDir

                                                                                                                                                                                                                                                                                                                                                            fs.type

                                                                                                                                                                                                                                                                                                                                                            host.hostName

                                                                                                                                                                                                                                                                                                                                                            host.mac

                                                                                                                                                                                                                                                                                                                                                            Default Time Aggregation

                                                                                                                                                                                                                                                                                                                                                            Average

                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation Formats

                                                                                                                                                                                                                                                                                                                                                            Average, Rate, Sum, Minimum, Maximum

                                                                                                                                                                                                                                                                                                                                                            Default Group Aggregation

                                                                                                                                                                                                                                                                                                                                                            Average

                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation Formats

                                                                                                                                                                                                                                                                                                                                                            Average, Sum, Minimum, Maximum

                                                                                                                                                                                                                                                                                                                                                            fs.bytes.free

                                                                                                                                                                                                                                                                                                                                                            The number of bytes free in the file system.

                                                                                                                                                                                                                                                                                                                                                            Metadata

                                                                                                                                                                                                                                                                                                                                                            Description

                                                                                                                                                                                                                                                                                                                                                            Metric Type

                                                                                                                                                                                                                                                                                                                                                            gauge

                                                                                                                                                                                                                                                                                                                                                            Value Type

                                                                                                                                                                                                                                                                                                                                                            Byte

                                                                                                                                                                                                                                                                                                                                                            Scope

                                                                                                                                                                                                                                                                                                                                                            Host, Container

                                                                                                                                                                                                                                                                                                                                                            Segment By

                                                                                                                                                                                                                                                                                                                                                            agent.tag

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.account.id

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.availabilityZone

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.region

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.tag

                                                                                                                                                                                                                                                                                                                                                            container.id

                                                                                                                                                                                                                                                                                                                                                            container.image

                                                                                                                                                                                                                                                                                                                                                            container.name

                                                                                                                                                                                                                                                                                                                                                            ecs.clusterName

                                                                                                                                                                                                                                                                                                                                                            ecs.serviceName

                                                                                                                                                                                                                                                                                                                                                            ecs.taskFamilyName

                                                                                                                                                                                                                                                                                                                                                            fs.device

                                                                                                                                                                                                                                                                                                                                                            fs.mountDir

                                                                                                                                                                                                                                                                                                                                                            fs.type

                                                                                                                                                                                                                                                                                                                                                            host.hostName

                                                                                                                                                                                                                                                                                                                                                            host.mac

                                                                                                                                                                                                                                                                                                                                                            Default Time Aggregation

                                                                                                                                                                                                                                                                                                                                                            Average

                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation Formats

                                                                                                                                                                                                                                                                                                                                                            Average, Rate, Sum, Minimum, Maximum

                                                                                                                                                                                                                                                                                                                                                            Default Group Aggregation

                                                                                                                                                                                                                                                                                                                                                            Average

                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation Formats

                                                                                                                                                                                                                                                                                                                                                            Average, Sum, Minimum, Maximum

                                                                                                                                                                                                                                                                                                                                                            fs.bytes.used

                                                                                                                                                                                                                                                                                                                                                            The number of bytes used in the file system.

                                                                                                                                                                                                                                                                                                                                                            Metadata

                                                                                                                                                                                                                                                                                                                                                            Description

                                                                                                                                                                                                                                                                                                                                                            Metric Type

                                                                                                                                                                                                                                                                                                                                                            Gauge

                                                                                                                                                                                                                                                                                                                                                            Value Type

                                                                                                                                                                                                                                                                                                                                                            Byte

                                                                                                                                                                                                                                                                                                                                                            Scope

                                                                                                                                                                                                                                                                                                                                                            Host, Container

                                                                                                                                                                                                                                                                                                                                                            Segment By

                                                                                                                                                                                                                                                                                                                                                            agent.tag

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.account.id

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.availabilityZone

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.region

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.tag

                                                                                                                                                                                                                                                                                                                                                            container.id

                                                                                                                                                                                                                                                                                                                                                            container.image

                                                                                                                                                                                                                                                                                                                                                            container.name

                                                                                                                                                                                                                                                                                                                                                            ecs.clusterName

                                                                                                                                                                                                                                                                                                                                                            ecs.serviceName

                                                                                                                                                                                                                                                                                                                                                            ecs.taskFamilyName

                                                                                                                                                                                                                                                                                                                                                            fs.device

                                                                                                                                                                                                                                                                                                                                                            fs.mountDir

                                                                                                                                                                                                                                                                                                                                                            fs.type

                                                                                                                                                                                                                                                                                                                                                            host.hostName

                                                                                                                                                                                                                                                                                                                                                            host.mac

                                                                                                                                                                                                                                                                                                                                                            Default Time Aggregation

                                                                                                                                                                                                                                                                                                                                                            Average

                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation Formats

                                                                                                                                                                                                                                                                                                                                                            Average, Rate, Sum, Minimum, Maximum

                                                                                                                                                                                                                                                                                                                                                            Default Group Aggregation

                                                                                                                                                                                                                                                                                                                                                            Average

                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation Formats

                                                                                                                                                                                                                                                                                                                                                            Average, Sum, Minimum, Maximum

                                                                                                                                                                                                                                                                                                                                                            fs.bytes.total

                                                                                                                                                                                                                                                                                                                                                            The size of the file system.

                                                                                                                                                                                                                                                                                                                                                            Metadata

                                                                                                                                                                                                                                                                                                                                                            Description

                                                                                                                                                                                                                                                                                                                                                            Metric Type

                                                                                                                                                                                                                                                                                                                                                            Gauge

                                                                                                                                                                                                                                                                                                                                                            Value Type

                                                                                                                                                                                                                                                                                                                                                            Byte

                                                                                                                                                                                                                                                                                                                                                            Scope

                                                                                                                                                                                                                                                                                                                                                            Host, Container

                                                                                                                                                                                                                                                                                                                                                            Segment By

                                                                                                                                                                                                                                                                                                                                                            agent.tag

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.account.id

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.availabilityZone

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.region

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.tag

                                                                                                                                                                                                                                                                                                                                                            container.id

                                                                                                                                                                                                                                                                                                                                                            container.image

                                                                                                                                                                                                                                                                                                                                                            container.name

                                                                                                                                                                                                                                                                                                                                                            ecs.clusterName

                                                                                                                                                                                                                                                                                                                                                            ecs.serviceName

                                                                                                                                                                                                                                                                                                                                                            ecs.taskFamilyName

                                                                                                                                                                                                                                                                                                                                                            fs.device

                                                                                                                                                                                                                                                                                                                                                            fs.mountDir

                                                                                                                                                                                                                                                                                                                                                            fs.type

                                                                                                                                                                                                                                                                                                                                                            host.hostName

                                                                                                                                                                                                                                                                                                                                                            host.mac

                                                                                                                                                                                                                                                                                                                                                            Default Time Aggregation

                                                                                                                                                                                                                                                                                                                                                            Average

                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation Formats

                                                                                                                                                                                                                                                                                                                                                            Average, Rate, Sum, Minimum, Maximum

                                                                                                                                                                                                                                                                                                                                                            Default Group Aggregation

                                                                                                                                                                                                                                                                                                                                                            Average

                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation Formats

                                                                                                                                                                                                                                                                                                                                                            Average, Rate, Sum, Minimum, Maximum

                                                                                                                                                                                                                                                                                                                                                            fs.inodes.total.count

                                                                                                                                                                                                                                                                                                                                                            The number of inodes in the file system.

                                                                                                                                                                                                                                                                                                                                                            Metadata

                                                                                                                                                                                                                                                                                                                                                            Description

                                                                                                                                                                                                                                                                                                                                                            Metric Type

                                                                                                                                                                                                                                                                                                                                                            Gauge

                                                                                                                                                                                                                                                                                                                                                            Value Type

                                                                                                                                                                                                                                                                                                                                                            Integer

                                                                                                                                                                                                                                                                                                                                                            Scope

                                                                                                                                                                                                                                                                                                                                                            Host, Container

                                                                                                                                                                                                                                                                                                                                                            Segment By

                                                                                                                                                                                                                                                                                                                                                            agent.tag

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.account.id

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.availabilityZone

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.region

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.tag

                                                                                                                                                                                                                                                                                                                                                            container.id

                                                                                                                                                                                                                                                                                                                                                            container.image

                                                                                                                                                                                                                                                                                                                                                            container.name

                                                                                                                                                                                                                                                                                                                                                            ecs.clusterName

                                                                                                                                                                                                                                                                                                                                                            ecs.serviceName

                                                                                                                                                                                                                                                                                                                                                            ecs.taskFamilyName

                                                                                                                                                                                                                                                                                                                                                            fs.device

                                                                                                                                                                                                                                                                                                                                                            fs.mountDir

                                                                                                                                                                                                                                                                                                                                                            fs.type

                                                                                                                                                                                                                                                                                                                                                            host.hostName

                                                                                                                                                                                                                                                                                                                                                            host.mac

                                                                                                                                                                                                                                                                                                                                                            Default Time Aggregation

                                                                                                                                                                                                                                                                                                                                                            Average

                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation Formats

                                                                                                                                                                                                                                                                                                                                                            Average, Rate, Sum, Minimum, Maximum

                                                                                                                                                                                                                                                                                                                                                            Default Group Aggregation

                                                                                                                                                                                                                                                                                                                                                            Average

                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation Formats

                                                                                                                                                                                                                                                                                                                                                            Average, Sum, Minimum, Maximum

                                                                                                                                                                                                                                                                                                                                                            fs.inodes.used.count

                                                                                                                                                                                                                                                                                                                                                            The number of inodes used in the file system.

                                                                                                                                                                                                                                                                                                                                                            Metadata

                                                                                                                                                                                                                                                                                                                                                            Description

                                                                                                                                                                                                                                                                                                                                                            Metric Type

                                                                                                                                                                                                                                                                                                                                                            Gauge

                                                                                                                                                                                                                                                                                                                                                            Value Type

                                                                                                                                                                                                                                                                                                                                                            Integer

                                                                                                                                                                                                                                                                                                                                                            Scope

                                                                                                                                                                                                                                                                                                                                                            Host, Container

                                                                                                                                                                                                                                                                                                                                                            Segment By

                                                                                                                                                                                                                                                                                                                                                            agent.tag

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.account.id

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.availabilityZone

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.region

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.tag

                                                                                                                                                                                                                                                                                                                                                            container.id

                                                                                                                                                                                                                                                                                                                                                            container.image

                                                                                                                                                                                                                                                                                                                                                            container.name

                                                                                                                                                                                                                                                                                                                                                            ecs.clusterName

                                                                                                                                                                                                                                                                                                                                                            ecs.serviceName

                                                                                                                                                                                                                                                                                                                                                            ecs.taskFamilyName

                                                                                                                                                                                                                                                                                                                                                            fs.device

                                                                                                                                                                                                                                                                                                                                                            fs.mountDir

                                                                                                                                                                                                                                                                                                                                                            fs.type

                                                                                                                                                                                                                                                                                                                                                            host.hostName

                                                                                                                                                                                                                                                                                                                                                            host.mac

                                                                                                                                                                                                                                                                                                                                                            Default Time Aggregation

                                                                                                                                                                                                                                                                                                                                                            Average

                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation Formats

                                                                                                                                                                                                                                                                                                                                                            Average, Rate, Sum, Minimum, Maximum

                                                                                                                                                                                                                                                                                                                                                            Default Group Aggregation

                                                                                                                                                                                                                                                                                                                                                            Average

                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation Formats

                                                                                                                                                                                                                                                                                                                                                            Average, Sum, Minimum, Maximum

                                                                                                                                                                                                                                                                                                                                                            fs.inodes.used.percent

                                                                                                                                                                                                                                                                                                                                                            Percentage of filesystem inodes usage.

                                                                                                                                                                                                                                                                                                                                                            Metadata

                                                                                                                                                                                                                                                                                                                                                            Description

                                                                                                                                                                                                                                                                                                                                                            Metric Type

                                                                                                                                                                                                                                                                                                                                                            Gauge

                                                                                                                                                                                                                                                                                                                                                            Value Type

                                                                                                                                                                                                                                                                                                                                                            Percent

                                                                                                                                                                                                                                                                                                                                                            Scope

                                                                                                                                                                                                                                                                                                                                                            Host, Container

                                                                                                                                                                                                                                                                                                                                                            Segment By

                                                                                                                                                                                                                                                                                                                                                            agent.tag

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.account.id

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.availabilityZone

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.region

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.tag

                                                                                                                                                                                                                                                                                                                                                            container.id

                                                                                                                                                                                                                                                                                                                                                            container.image

                                                                                                                                                                                                                                                                                                                                                            container.name

                                                                                                                                                                                                                                                                                                                                                            ecs.clusterName

                                                                                                                                                                                                                                                                                                                                                            ecs.serviceName

                                                                                                                                                                                                                                                                                                                                                            ecs.taskFamilyName

                                                                                                                                                                                                                                                                                                                                                            fs.device

                                                                                                                                                                                                                                                                                                                                                            fs.mountDir

                                                                                                                                                                                                                                                                                                                                                            fs.type

                                                                                                                                                                                                                                                                                                                                                            host.hostName

                                                                                                                                                                                                                                                                                                                                                            host.mac

                                                                                                                                                                                                                                                                                                                                                            Default Time Aggregation

                                                                                                                                                                                                                                                                                                                                                            Average

                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation Formats

                                                                                                                                                                                                                                                                                                                                                            Average, Rate, Sum, Minimum, Maximum

                                                                                                                                                                                                                                                                                                                                                            Default Group Aggregation

                                                                                                                                                                                                                                                                                                                                                            Average

                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation Formats

                                                                                                                                                                                                                                                                                                                                                            Average, Sum, Minimum, Maximum

                                                                                                                                                                                                                                                                                                                                                            fs.root.used.percent

                                                                                                                                                                                                                                                                                                                                                            Percentage of root filesystem usage.

                                                                                                                                                                                                                                                                                                                                                            Metadata

                                                                                                                                                                                                                                                                                                                                                            Description

                                                                                                                                                                                                                                                                                                                                                            Metric Type

                                                                                                                                                                                                                                                                                                                                                            Gauge

                                                                                                                                                                                                                                                                                                                                                            Value Type

                                                                                                                                                                                                                                                                                                                                                            Percent

                                                                                                                                                                                                                                                                                                                                                            Scope

                                                                                                                                                                                                                                                                                                                                                            Host, Container

                                                                                                                                                                                                                                                                                                                                                            Segment By

                                                                                                                                                                                                                                                                                                                                                            agent.tag

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.account.id

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.availabilityZone

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.region

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.tag

                                                                                                                                                                                                                                                                                                                                                            container.id

                                                                                                                                                                                                                                                                                                                                                            container.image

                                                                                                                                                                                                                                                                                                                                                            container.name

                                                                                                                                                                                                                                                                                                                                                            ecs.clusterName

                                                                                                                                                                                                                                                                                                                                                            ecs.serviceName

                                                                                                                                                                                                                                                                                                                                                            ecs.taskFamilyName

                                                                                                                                                                                                                                                                                                                                                            fs.device

                                                                                                                                                                                                                                                                                                                                                            fs.mountDir

                                                                                                                                                                                                                                                                                                                                                            fs.type

                                                                                                                                                                                                                                                                                                                                                            host.hostName

                                                                                                                                                                                                                                                                                                                                                            host.mac

                                                                                                                                                                                                                                                                                                                                                            Default Time Aggregation

                                                                                                                                                                                                                                                                                                                                                            Average

                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation Formats

                                                                                                                                                                                                                                                                                                                                                            Average, Rate, Sum, Minimum, Maximum

                                                                                                                                                                                                                                                                                                                                                            Default Group Aggregation

                                                                                                                                                                                                                                                                                                                                                            Average

                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation Formats

                                                                                                                                                                                                                                                                                                                                                            Average, Sum, Minimum, Maximum

                                                                                                                                                                                                                                                                                                                                                            fs.largest.used.percent

                                                                                                                                                                                                                                                                                                                                                            Percentage of the largest filesystem.

                                                                                                                                                                                                                                                                                                                                                            Metadata

                                                                                                                                                                                                                                                                                                                                                            Description

                                                                                                                                                                                                                                                                                                                                                            Metric Type

                                                                                                                                                                                                                                                                                                                                                            Gauge

                                                                                                                                                                                                                                                                                                                                                            Value Type

                                                                                                                                                                                                                                                                                                                                                            Percent

                                                                                                                                                                                                                                                                                                                                                            Scope

                                                                                                                                                                                                                                                                                                                                                            Host, Container

                                                                                                                                                                                                                                                                                                                                                            Segment By

                                                                                                                                                                                                                                                                                                                                                            agent.tag

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.account.id

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.availabilityZone

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.region

                                                                                                                                                                                                                                                                                                                                                            cloudProvider.tag

                                                                                                                                                                                                                                                                                                                                                            container.id

                                                                                                                                                                                                                                                                                                                                                            container.image

                                                                                                                                                                                                                                                                                                                                                            container.name

                                                                                                                                                                                                                                                                                                                                                            ecs.clusterName

                                                                                                                                                                                                                                                                                                                                                            ecs.serviceName

                                                                                                                                                                                                                                                                                                                                                            ecs.taskFamilyName

                                                                                                                                                                                                                                                                                                                                                            fs.device

                                                                                                                                                                                                                                                                                                                                                            fs.mountDir

                                                                                                                                                                                                                                                                                                                                                            fs.type

                                                                                                                                                                                                                                                                                                                                                            host.hostName

                                                                                                                                                                                                                                                                                                                                                            host.mac

                                                                                                                                                                                                                                                                                                                                                            Default Time Aggregation

                                                                                                                                                                                                                                                                                                                                                            Average

                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation Formats

                                                                                                                                                                                                                                                                                                                                                            Average, Rate, Sum, Minimum, Maximum

                                                                                                                                                                                                                                                                                                                                                            Default Group Aggregation

                                                                                                                                                                                                                                                                                                                                                            Average

                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation Formats

                                                                                                                                                                                                                                                                                                                                                            Average, Sum, Minimum, Maximum

                                                                                                                                                                                                                                                                                                                                                            5.9 -

                                                                                                                                                                                                                                                                                                                                                            Host

                                                                                                                                                                                                                                                                                                                                                            agent.id

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            agent.mode

                                                                                                                                                                                                                                                                                                                                                            For more information on agent modes, see Configure Agent Modes.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeString
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time Aggregationconcat
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation Formatsconcat, distinct, count
                                                                                                                                                                                                                                                                                                                                                            Default Group Aggregationconcat
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation Formatsconcat, distinct, count

                                                                                                                                                                                                                                                                                                                                                            agent.version

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            cpu.core

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            host.container.mappings

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            host.count

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            host.domain

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            host.hostName

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            host.ip.all

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            host.ip.private

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            host.ip.public

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            host.isClientServer

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            host.isInstrumented

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            host.isInternal

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            host.mac

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            host.procList.main

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            host.uname

                                                                                                                                                                                                                                                                                                                                                            host.uname provides the following system information:

                                                                                                                                                                                                                                                                                                                                                            • kernel name

                                                                                                                                                                                                                                                                                                                                                            • kernel release number

                                                                                                                                                                                                                                                                                                                                                            • kernel version

                                                                                                                                                                                                                                                                                                                                                            • machine hardware name

                                                                                                                                                                                                                                                                                                                                                            Agents send this metric along with a number of labels that map with the uname information. host.uname is supported on agent versions 10.1 and above.

                                                                                                                                                                                                                                                                                                                                                            Metrics Details

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment BySee Segmentation Details.
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time AggregationAverage, Rate, Sum, Min, Max, Rate of Change
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group RollupAverage, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            Segmentation Details

                                                                                                                                                                                                                                                                                                                                                            The labels are given below:

                                                                                                                                                                                                                                                                                                                                                            LabelDescriptionMapping to the uname toolingExample
                                                                                                                                                                                                                                                                                                                                                            host.uname.kernel.nameThe kernel nameuname -sLinux
                                                                                                                                                                                                                                                                                                                                                            host.uname.kernel.releaseThe kernel releaseuname -r5.4.0-31-generic
                                                                                                                                                                                                                                                                                                                                                            host.uname.kernel.versionThe kernel versionuname -v#35-Ubuntu SMP Thu May 7 20:20:34 UTC 2020
                                                                                                                                                                                                                                                                                                                                                            host.machineThe hardware name of the machineuname -mx86_64

                                                                                                                                                                                                                                                                                                                                                            Example: Kernel Versions in the Infrastructure

                                                                                                                                                                                                                                                                                                                                                            The image depicts host.uname being segmented by host.uname.kernel.version. The resulting dashboard gives the distribution of kernel versions in the infrastructure.

                                                                                                                                                                                                                                                                                                                                                            Count Limits StasD Metrics

                                                                                                                                                                                                                                                                                                                                                            The count limits metrics report the upper limit of the number of metrics of the same type. The values the metrics report can be changed by modifying the dragent.yaml file.

                                                                                                                                                                                                                                                                                                                                                            Metric NameConfiguration Parameter in the dragent.yaml fileDefault Value
                                                                                                                                                                                                                                                                                                                                                            metricCount.limit.appCheckapp_checks_limit500
                                                                                                                                                                                                                                                                                                                                                            metricCount.limit.statsdstatsd.limit100
                                                                                                                                                                                                                                                                                                                                                            metricCount.limit.jmxjmx.limit500
                                                                                                                                                                                                                                                                                                                                                            metricCount.limit.prometheusprometheus.max+metrics3000

                                                                                                                                                                                                                                                                                                                                                            metricCount.appCheck

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            metricCount.jmx

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            metricCount.statsd

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            metricCount.prometheus

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            5.10 -

                                                                                                                                                                                                                                                                                                                                                            JVM

                                                                                                                                                                                                                                                                                                                                                            jvm.class.loaded

                                                                                                                                                                                                                                                                                                                                                            The number of classes currently loaded in the JVM. By default, this metric shows the total value of the selected scope. For example, if applied to a group of machines, the value will be the total value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            jvm.class.unloaded

                                                                                                                                                                                                                                                                                                                                                            jvm.gc.ConcurrentMarkSweep.count

                                                                                                                                                                                                                                                                                                                                                            The number of times the Concurrent Mark-Sweep garbage collector has run.

                                                                                                                                                                                                                                                                                                                                                            jvm.gc.ConcurrentMarkSweep.time

                                                                                                                                                                                                                                                                                                                                                            The total time the Concurrent Mark-Sweep garbage collector has run.

                                                                                                                                                                                                                                                                                                                                                            jvm.gc.Copy.count

                                                                                                                                                                                                                                                                                                                                                            jvm.gc.Copy.time

                                                                                                                                                                                                                                                                                                                                                            jvm.gc.G1_Old_Generation.count

                                                                                                                                                                                                                                                                                                                                                            jvm.gc.G1_Old_Generation.time

                                                                                                                                                                                                                                                                                                                                                            jvm.gc.G1_Young_Generation.count

                                                                                                                                                                                                                                                                                                                                                            jvm.gc.G1_Young_Generation.time

                                                                                                                                                                                                                                                                                                                                                            jvm.gc.global.time

                                                                                                                                                                                                                                                                                                                                                            The total time the garbage collection has run.

                                                                                                                                                                                                                                                                                                                                                            jvm.gc.MarkSweepCompact.count

                                                                                                                                                                                                                                                                                                                                                            jvm.gc.MarkSweepCompact.time

                                                                                                                                                                                                                                                                                                                                                            jvm.gc.PS_MarkSweep.count

                                                                                                                                                                                                                                                                                                                                                            The number of times the parallel scavenge Mark-Sweep old generation garbage collector has run.

                                                                                                                                                                                                                                                                                                                                                            jvm.gc.PS_MarkSweep.time

                                                                                                                                                                                                                                                                                                                                                            The total time the parallel scavenge Mark-Sweep old generation garbage collector has run.

                                                                                                                                                                                                                                                                                                                                                            jvm.gc.PS_Scavenge.count

                                                                                                                                                                                                                                                                                                                                                            The number of times the parallel eden/survivor space garbage collector has run.

                                                                                                                                                                                                                                                                                                                                                            jvm.gc.PS_Scavenge.time

                                                                                                                                                                                                                                                                                                                                                            The total time the parallel eden/survivor space garbage collector has run.

                                                                                                                                                                                                                                                                                                                                                            jvm.gc.ParNew.count

                                                                                                                                                                                                                                                                                                                                                            The number of times the parallel garbage collector has run.

                                                                                                                                                                                                                                                                                                                                                            jvm.gc.ParNew.time

                                                                                                                                                                                                                                                                                                                                                            The total time the parallel garbage collector has run.

                                                                                                                                                                                                                                                                                                                                                            jvm.gc.scavenge.time

                                                                                                                                                                                                                                                                                                                                                            The total time the scavenge collector has run.

                                                                                                                                                                                                                                                                                                                                                            jvm.heap.committed

                                                                                                                                                                                                                                                                                                                                                            The amount of memory that is currently allocated to the JVM for heap memory. Heap memory is the storage area for Java objects. By default, this metric shows the total value of the selected scope. For example, if applied to a group of machines, the value will be the total value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            The JVM may release memory to the system and Heap Committed could decrease below Heap Init; but Heap Committed can never increase above Heap Max.

                                                                                                                                                                                                                                                                                                                                                            jvm.heap.init

                                                                                                                                                                                                                                                                                                                                                            The initial amount of memory that the JVM requests from the operating system for heap memory during startup (defined by the –Xms option).The value of Heap Init may be undefined. By default, this metric shows the total value of the selected scope. For example, if applied to a group of machines, the value will be the total value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            The JVM may request additional memory from the operating system and may also release memory to the system over time.

                                                                                                                                                                                                                                                                                                                                                            jvm.heap.max

                                                                                                                                                                                                                                                                                                                                                            The maximum size allocation of heap memory for the JVM (defined by the –Xmx option). By default, this metric shows the total value of the selected scope. For example, if applied to a group of machines, the value will be the total value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            Any memory allocation attempt that would exceed this limit will cause an OutOfMemoryError exception to be thrown.

                                                                                                                                                                                                                                                                                                                                                            jvm.heap.used

                                                                                                                                                                                                                                                                                                                                                            The amount of allocated heap memory (ie Heap Committed) currently in use. The number of classes currently loaded in the JVM. By default, this metric shows the total value of the selected scope. For example, if applied to a group of machines, the value will be the total value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            Heap memory is the storage area for Java objects.

                                                                                                                                                                                                                                                                                                                                                            An object in the heap that is referenced by another object is ‘live’, and will remain in the heap as long as it continues to be referenced. Objects that are no longer referenced are garbage and will be cleared out of the heap to reclaim space.

                                                                                                                                                                                                                                                                                                                                                            jvm.heap.used.percent

                                                                                                                                                                                                                                                                                                                                                            The ratio between Heap Used and Heap Committed. By default, this metric shows the total value of the selected scope. For example, if applied to a group of machines, the value will be the total value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            jvm.nonHeap.committed

                                                                                                                                                                                                                                                                                                                                                            The amount of memory that is currently allocated to the JVM for non-heap memory. By default, this metric shows the total value of the selected scope. For example, if applied to a group of machines, the value will be the total value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            Non-heap memory is used by Java to store loaded classes and other meta-data.

                                                                                                                                                                                                                                                                                                                                                            The JVM may release memory to the system and Non-Heap Committed could decrease below Non-Heap Init; but Non-Heap Committed can never increase above Non-Heap Max.

                                                                                                                                                                                                                                                                                                                                                            jvm.nonHeap.init

                                                                                                                                                                                                                                                                                                                                                            The initial amount of memory that the JVM requests from the operating system for non-heap memory during startup. By default, this metric shows the total value of the selected scope. For example, if applied to a group of machines, the value will be the total value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            The value of Non-Heap Init may be undefined.

                                                                                                                                                                                                                                                                                                                                                            The JVM may request additional memory from the operating system and may also release memory to the system over time.

                                                                                                                                                                                                                                                                                                                                                            jvm.nonHeap.max

                                                                                                                                                                                                                                                                                                                                                            The maximum size allocation of non-heap memory for the JVM. This memory is used by Java to store loaded classes and other meta-data. By default, this metric shows the total value of the selected scope. For example, if applied to a group of machines, the value will be the total value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            jvm.nonHeap.used

                                                                                                                                                                                                                                                                                                                                                            The amount of allocated non-heap memory (Non-Heap Committed) currently in use. By default, this metric shows the total value of the selected scope. For example, if applied to a group of machines, the value will be the total value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            Non-heap memory is used by Java to store loaded classes and other meta-data.

                                                                                                                                                                                                                                                                                                                                                            jvm.nonHeap.used.percent

                                                                                                                                                                                                                                                                                                                                                            The ratio between Non-Heap Used and Non-Heap Committed. By default, this metric shows the total value of the selected scope. For example, if applied to a group of machines, the value will be the total value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            jvm.thread.count

                                                                                                                                                                                                                                                                                                                                                            The current number of live daemon and non-daemon threads. By default, this metric shows the total value of the selected scope. For example, if applied to a group of machines, the value will be the total value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            jvm.thread.daemon

                                                                                                                                                                                                                                                                                                                                                            The current number of live daemon threads. By default, this metric shows the total value of the selected scope. For example, if applied to a group of machines, the value will be the total value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            Daemon threads are used for background supporting tasks and are only needed while normal threads are executing.

                                                                                                                                                                                                                                                                                                                                                            5.11 -

                                                                                                                                                                                                                                                                                                                                                            Kubernetes

                                                                                                                                                                                                                                                                                                                                                            Contents

                                                                                                                                                                                                                                                                                                                                                            5.11.1 -

                                                                                                                                                                                                                                                                                                                                                            Kubernetes State

                                                                                                                                                                                                                                                                                                                                                            kubernetes.hpa.replicas.min

                                                                                                                                                                                                                                                                                                                                                            The lower limit for the number of pods that can be set by the Horizontal Pod Autoscaler. The default value is 1.

                                                                                                                                                                                                                                                                                                                                                            The lower limit determines the minimum number of replicas that the autoscaler can periodically adjust in a replication controller or deployment to the target specified by the user in order to match the observed average CPU utilization.

                                                                                                                                                                                                                                                                                                                                                            Metric Type: Gauge

                                                                                                                                                                                                                                                                                                                                                            Segmented by:

                                                                                                                                                                                                                                                                                                                                                            • kubernetes.hpa.name

                                                                                                                                                                                                                                                                                                                                                            • kubernetes.cluster.id

                                                                                                                                                                                                                                                                                                                                                            • kubernetes.cluster.name

                                                                                                                                                                                                                                                                                                                                                            • kubernetes.namespace.name

                                                                                                                                                                                                                                                                                                                                                            kubernetes.hpa.replicas.max

                                                                                                                                                                                                                                                                                                                                                            The upper limit for the number of pods that can be set by the Horizontal Pod Autoscaler. This value cannot be smaller than that of kubernetes.hpa.replicas.min.

                                                                                                                                                                                                                                                                                                                                                            The upper limit determines the maximum number of replicas that the autoscaler can periodically adjust in a replication controller or deployment to the target specified by the user in order to match the observed average CPU utilization .

                                                                                                                                                                                                                                                                                                                                                            Metric Type: Gauge

                                                                                                                                                                                                                                                                                                                                                            Segmented by:

                                                                                                                                                                                                                                                                                                                                                            • kubernetes.hpa.name

                                                                                                                                                                                                                                                                                                                                                            • kubernetes.cluster.id

                                                                                                                                                                                                                                                                                                                                                            • kubernetes.cluster.name

                                                                                                                                                                                                                                                                                                                                                            • kubernetes.namespace.name

                                                                                                                                                                                                                                                                                                                                                            kubernetes.hpa.replicas.current

                                                                                                                                                                                                                                                                                                                                                            The current number of replicas of pods managed by the Horizontal Pod Autoscaler.

                                                                                                                                                                                                                                                                                                                                                            Metric Type: Gauge

                                                                                                                                                                                                                                                                                                                                                            Segmented by:

                                                                                                                                                                                                                                                                                                                                                            • kubernetes.hpa.name

                                                                                                                                                                                                                                                                                                                                                            • kubernetes.cluster.id

                                                                                                                                                                                                                                                                                                                                                            • kubernetes.cluster.name

                                                                                                                                                                                                                                                                                                                                                            • kubernetes.namespace.name

                                                                                                                                                                                                                                                                                                                                                            kubernetes.hpa.replicas.desired

                                                                                                                                                                                                                                                                                                                                                            The desired number of replicas of pods managed by the Horizontal Pod Autoscaler.

                                                                                                                                                                                                                                                                                                                                                            Metric Type: Gauge

                                                                                                                                                                                                                                                                                                                                                            Segmented by:

                                                                                                                                                                                                                                                                                                                                                            • kubernetes.hpa.name

                                                                                                                                                                                                                                                                                                                                                            • kubernetes.cluster.id

                                                                                                                                                                                                                                                                                                                                                            • kubernetes.cluster.name

                                                                                                                                                                                                                                                                                                                                                            • kubernetes.namespace.name

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.configmaps.hard

                                                                                                                                                                                                                                                                                                                                                            The number of config maps that can be created in each Kubernetes namespace.

                                                                                                                                                                                                                                                                                                                                                            Metric Type: Gauge - Integer

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.configmaps.used

                                                                                                                                                                                                                                                                                                                                                            The current number of config maps in each Kubernetes namespace.

                                                                                                                                                                                                                                                                                                                                                            Metric Type: Gauge - Integer

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.limits.cpu.hard

                                                                                                                                                                                                                                                                                                                                                            The total CPU limit across all pods in a non-terminal state in the cluster, determined by adding each pod’s CPU limit together.

                                                                                                                                                                                                                                                                                                                                                            Metric Type: Gauge - Integer

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.limits.cpu.used

                                                                                                                                                                                                                                                                                                                                                            The current amount of CPU used across all cluster pods in a non-terminal state.

                                                                                                                                                                                                                                                                                                                                                            Metric Type: Gauge - Integer

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.limits.memory.hard

                                                                                                                                                                                                                                                                                                                                                            The total memory limit across all cluster pods in a non-terminal state.

                                                                                                                                                                                                                                                                                                                                                            Metric Type: Gauge - Integer

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.limits.memory.used

                                                                                                                                                                                                                                                                                                                                                            The current amount of memory used across all cluster pods in a non-terminal state.

                                                                                                                                                                                                                                                                                                                                                            Metric Type: Gauge - Integer

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.persistentvolumeclaims.hard

                                                                                                                                                                                                                                                                                                                                                            The maximum number of persistent volume claims that can exist in the Kubernetes namespace.

                                                                                                                                                                                                                                                                                                                                                            Metric Type: Gauge - Integer

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.persistentvolumeclaims.used

                                                                                                                                                                                                                                                                                                                                                            The current number of persistent volume claims that exist in the Kubernetes namespace.

                                                                                                                                                                                                                                                                                                                                                            Metric Type: Gauge - Integer

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.cpu.hard

                                                                                                                                                                                                                                                                                                                                                            The maximum number of CPU cores assigned in the namespace or at the resource quota scope level. Across all the pods in a non-terminal state, the sum of CPU requests cannot exceed this value.

                                                                                                                                                                                                                                                                                                                                                            Metric Type: Gauge - Integer

                                                                                                                                                                                                                                                                                                                                                            Segmented by:

                                                                                                                                                                                                                                                                                                                                                            • kubernetes.cluster

                                                                                                                                                                                                                                                                                                                                                            • kubernetes.namespace

                                                                                                                                                                                                                                                                                                                                                            • kubernetes.resourcequota

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.memory.hard

                                                                                                                                                                                                                                                                                                                                                            The maximum memory assigned in the namespace or at the resource quota scope level. Across all the pods in a non-terminal state, the sum of memory requests cannot exceed this value

                                                                                                                                                                                                                                                                                                                                                            Metric Type: Gauge - Integer

                                                                                                                                                                                                                                                                                                                                                            Segmented by:

                                                                                                                                                                                                                                                                                                                                                            • kubernetes.cluster

                                                                                                                                                                                                                                                                                                                                                            • kubernetes.namespace

                                                                                                                                                                                                                                                                                                                                                            • kubernetes.resourcequota

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.pods.hard

                                                                                                                                                                                                                                                                                                                                                            The maximum number of pods in a non-terminal state that can exist in the Kubernetes namespace.

                                                                                                                                                                                                                                                                                                                                                            Metric Type: Gauge - Integer

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.pods.used

                                                                                                                                                                                                                                                                                                                                                            The current number of pods in a non-terminal state that exists in the Kubernetes namespace.

                                                                                                                                                                                                                                                                                                                                                            Metric Type: Gauge - Integer

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.replicationcontrollers.hard

                                                                                                                                                                                                                                                                                                                                                            The maximum number of replication controllers that can exist in the Kubernetes namespace.

                                                                                                                                                                                                                                                                                                                                                            Metric Type: Gauge - Integer

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.replicationcontrollers.used

                                                                                                                                                                                                                                                                                                                                                            The current number of replication controllers that can exist in the Kubernetes namespace.

                                                                                                                                                                                                                                                                                                                                                            Metric Type: Gauge - Integer

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.requests.cpu.hard

                                                                                                                                                                                                                                                                                                                                                            The maximum number of CPU requests allowed across all cluster pods in a non-terminal state.

                                                                                                                                                                                                                                                                                                                                                            Metric Type: Gauge - Integer

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.requests.cpu.used

                                                                                                                                                                                                                                                                                                                                                            The current number of CPU requests across all cluster pods in a non-terminal state.

                                                                                                                                                                                                                                                                                                                                                            Metric Type: Gauge - Integer

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.requests.memory.hard

                                                                                                                                                                                                                                                                                                                                                            The maximum number of memory requests allowed across all cluster pods in a non-terminal state.

                                                                                                                                                                                                                                                                                                                                                            Metric Type: Gauge - Integer

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.requests.memory.used

                                                                                                                                                                                                                                                                                                                                                            The current total number of memory requests across all cluster pods in a non-terminal state.

                                                                                                                                                                                                                                                                                                                                                            Metric Type: Gauge - Integer

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.requests.storage.hard

                                                                                                                                                                                                                                                                                                                                                            The maximum number of storage requests allowed across all persistent volume claims in the cluster.

                                                                                                                                                                                                                                                                                                                                                            Metric Type: Gauge - Integer

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.requests.storage.used

                                                                                                                                                                                                                                                                                                                                                            The current total number of storage requests across all persistent volume claims.

                                                                                                                                                                                                                                                                                                                                                            Metric Type: Gauge - Integer

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.resourcequotas.hard

                                                                                                                                                                                                                                                                                                                                                            The maximum number of resource quotas that can exist in the Kubernetes namespace.

                                                                                                                                                                                                                                                                                                                                                            Metric Type: Gauge - Integer

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.resourcequotas.used

                                                                                                                                                                                                                                                                                                                                                            The current number of resource quotas that exist in the Kubernetes namespace.

                                                                                                                                                                                                                                                                                                                                                            Metric Type: Gauge - Integer

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.secrets.hard

                                                                                                                                                                                                                                                                                                                                                            The maximum number of secrets that can exist in the namespace.

                                                                                                                                                                                                                                                                                                                                                            Metric Type: Gauge - Integer

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.secrets.used

                                                                                                                                                                                                                                                                                                                                                            The current number of secrets that exist in the namespace.

                                                                                                                                                                                                                                                                                                                                                            Metric Type: Gauge - Integer

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.services.hard

                                                                                                                                                                                                                                                                                                                                                            The maximum number of services that can exist in the namespace.

                                                                                                                                                                                                                                                                                                                                                            Metric Type: Gauge - Integer

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.services.used

                                                                                                                                                                                                                                                                                                                                                            The current number of services that exist in the namespace.

                                                                                                                                                                                                                                                                                                                                                            Metric Type: Gauge - Integer

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.services.loadbalancers.hard

                                                                                                                                                                                                                                                                                                                                                            The maximum number of load balancer services that can exist in the namespace.

                                                                                                                                                                                                                                                                                                                                                            Metric Type: Gauge - Integer

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.services.loadbalancers.used

                                                                                                                                                                                                                                                                                                                                                            The current number of load balancer services that exist in the namespace.

                                                                                                                                                                                                                                                                                                                                                            Metric Type: Gauge - Integer

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.services.nodeports.hard

                                                                                                                                                                                                                                                                                                                                                            The maximum number of node port services that can exist in the namespace.

                                                                                                                                                                                                                                                                                                                                                            Metric Type: Gauge - Integer

                                                                                                                                                                                                                                                                                                                                                            kubernetes.resourcequota.services.nodeports.used

                                                                                                                                                                                                                                                                                                                                                            The current number of node port services that exist in the namespace.

                                                                                                                                                                                                                                                                                                                                                            Metric Type: Gauge - Integer

                                                                                                                                                                                                                                                                                                                                                            kubernetes.daemonSet.pods.desired

                                                                                                                                                                                                                                                                                                                                                            The number of nodes that should be running the daemon pod.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.daemonSet.pods.misscheduled

                                                                                                                                                                                                                                                                                                                                                            The number of nodes running a daemon pod but are not supposed to.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.daemonSet.pods.ready

                                                                                                                                                                                                                                                                                                                                                            The number of nodes that should be running the daemon pod and have one or more of the daemon pod running and ready.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.daemonSet.pods.scheduled

                                                                                                                                                                                                                                                                                                                                                            The number of nodes that running at least one daemon pod and are supposed to.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.deployment.replicas.available

                                                                                                                                                                                                                                                                                                                                                            The number of available pods per deployment.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.deployment.replicas.desired

                                                                                                                                                                                                                                                                                                                                                            The number of desired pods per deployment.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.deployment.replicas.paused

                                                                                                                                                                                                                                                                                                                                                            The number of paused pods per deployment. These pods will not be processed by the deployment controller.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.deployment.replicas.running

                                                                                                                                                                                                                                                                                                                                                            The number of running pods per deployment.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.deployment.replicas.unavailable

                                                                                                                                                                                                                                                                                                                                                            The number of unavailable pods per deployment.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.deployment.replicas.updated

                                                                                                                                                                                                                                                                                                                                                            The number of updated pods per deployment.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.job.completions

                                                                                                                                                                                                                                                                                                                                                            The desired number of successfully finished pods that the job should be run with.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.job.numFailed

                                                                                                                                                                                                                                                                                                                                                            The number of pods which reached Phase Failed.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.job.numSucceeded

                                                                                                                                                                                                                                                                                                                                                            The number of pods which reached Phase Succeeded.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.job.parallelism

                                                                                                                                                                                                                                                                                                                                                            The maximum desired number of pods that the job should run at any given time.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.job.status.active

                                                                                                                                                                                                                                                                                                                                                            The number of actively running pods.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.namespace.count

                                                                                                                                                                                                                                                                                                                                                            The number of namespaces.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.namespace.deployment.count

                                                                                                                                                                                                                                                                                                                                                            The number of deployments per namespace.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.namespace.job.count

                                                                                                                                                                                                                                                                                                                                                            The number of jobs per namespaces.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.namespace.pod.status.count

                                                                                                                                                                                                                                                                                                                                                            Supported by Sysdig Agent 9.5.0 and above.

                                                                                                                                                                                                                                                                                                                                                            The metric gives the number of pods in each aggregate state per Namespace. This is the value that the kubectl get pods command returns in the STATUS column. This metric does not represent the pod condition or the pod phase.

                                                                                                                                                                                                                                                                                                                                                            Segmentable by kubernetes.namespace.name and kubernetes.namespace.pod.status.name.

                                                                                                                                                                                                                                                                                                                                                            Due to performance implications, Sysdig Monitor shows only a subset of the pod aggregate statuses. The statuses displayed on the UI are:

                                                                                                                                                                                                                                                                                                                                                            • Evicted

                                                                                                                                                                                                                                                                                                                                                            • DeadlineExceeded

                                                                                                                                                                                                                                                                                                                                                            • Error

                                                                                                                                                                                                                                                                                                                                                            • ContainerCreating

                                                                                                                                                                                                                                                                                                                                                            • CrashLoopBackOff

                                                                                                                                                                                                                                                                                                                                                            • Pending

                                                                                                                                                                                                                                                                                                                                                            • Running

                                                                                                                                                                                                                                                                                                                                                            To view other statuses, override the default list by adding the following property in dragent.yaml

                                                                                                                                                                                                                                                                                                                                                            k8s_pod_status_reason_strings:
                                                                                                                                                                                                                                                                                                                                                              - Pending
                                                                                                                                                                                                                                                                                                                                                              - ImagePullBackOff
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            kubernetes.namespace.pod.running.count

                                                                                                                                                                                                                                                                                                                                                            Required: agent 9.6.0+

                                                                                                                                                                                                                                                                                                                                                            The number of all the running pods in a Namespace. The metric takes free pods also into account, that is, pods that do not belong to any controller. Therefore, its value is not the sum of (statefulset|daemonset|deployment).pod.running.count.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.namespace.pod.running.count is supported by Agent v9.6.0 and above.

                                                                                                                                                                                                                                                                                                                                                            Metric Type: Gauge

                                                                                                                                                                                                                                                                                                                                                            Segmented by: Namespace

                                                                                                                                                                                                                                                                                                                                                            kubernetes.namespace.replicaSet.count

                                                                                                                                                                                                                                                                                                                                                            The number of replicaSets per namespace.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.namespace.service.count

                                                                                                                                                                                                                                                                                                                                                            The number of services per namespace.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.node.allocatable.cpuCores

                                                                                                                                                                                                                                                                                                                                                            The CPU resources of a node that are available for scheduling.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.node.allocatable.memBytes

                                                                                                                                                                                                                                                                                                                                                            The memory resources of a node that are available for scheduling.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.node.allocatable.pods

                                                                                                                                                                                                                                                                                                                                                            The pod resources of a node that are available for scheduling.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.node.capacity.cpuCores

                                                                                                                                                                                                                                                                                                                                                            The maximum CPU resources of the node.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.node.capacity.memBytes

                                                                                                                                                                                                                                                                                                                                                            The maximum memory resources of the node.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.node.capacity.pods

                                                                                                                                                                                                                                                                                                                                                            The maximum number of pods of the node.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.node.diskPressure

                                                                                                                                                                                                                                                                                                                                                            The number of nodes with disk pressure.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.node.memoryPressure

                                                                                                                                                                                                                                                                                                                                                            The number of nodes with memory pressure.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.node.networkUnavailable

                                                                                                                                                                                                                                                                                                                                                            The number of nodes with network unavailable.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.node.outOfDisk

                                                                                                                                                                                                                                                                                                                                                            The number of nodes that are out of disk space.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.node.ready

                                                                                                                                                                                                                                                                                                                                                            The number of nodes that are ready.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.node.unschedulable

                                                                                                                                                                                                                                                                                                                                                            The number of nodes unavailable to schedule new pods.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.pod.containers.waiting

                                                                                                                                                                                                                                                                                                                                                            The number of containers waiting for a pod.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.pod.resourceLimits.cpuCores

                                                                                                                                                                                                                                                                                                                                                            The limit on CPU cores to be used by a container.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.pod.resourceLimits.memBytes

                                                                                                                                                                                                                                                                                                                                                            The limit on memory to be used by a container in bytes.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.pod.resourceRequests.cpuCores

                                                                                                                                                                                                                                                                                                                                                            The number of CPU cores requested by containers in the pod.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.pod.resourceRequests.memBytes

                                                                                                                                                                                                                                                                                                                                                            The number of memory bytes requested by containers in the pod.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.pod.status.ready

                                                                                                                                                                                                                                                                                                                                                            The number of pods ready to serve requests.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.replicaSet.replicas.fullyLabeled

                                                                                                                                                                                                                                                                                                                                                            The number of fully labeled pods per ReplicaSet.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.replicaSet.replicas.ready

                                                                                                                                                                                                                                                                                                                                                            The number of ready pods per ReplicaSet.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.statefulset.replicas

                                                                                                                                                                                                                                                                                                                                                            The desired number of pods per StatefulSet.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.statefulset.status.replicas

                                                                                                                                                                                                                                                                                                                                                            The total number of pods created by the StatefulSet.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.statefulset.status.replicas.current

                                                                                                                                                                                                                                                                                                                                                            The number of pods created by the current version of the StatefulSet.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.statefulset.status.replicas.ready

                                                                                                                                                                                                                                                                                                                                                            The number of ready pods created by this StatefulSet.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.statefulset.status.replicas.updated

                                                                                                                                                                                                                                                                                                                                                            The number of pods updated to the new version of this StatefulSet.

                                                                                                                                                                                                                                                                                                                                                            5.11.2 -

                                                                                                                                                                                                                                                                                                                                                            Resource Usage

                                                                                                                                                                                                                                                                                                                                                            Compatibility Mapping

                                                                                                                                                                                                                                                                                                                                                            Before using Kubernetes resource metrics, review their compatibility with Sysdig components. The newly supported Kubernetes metrics are not available to older versions of Sysdig Agent.

                                                                                                                                                                                                                                                                                                                                                            Note also that you must edit the agent config file, dragent.yaml, to enable these metrics. See Enable Kube State Metrics Collection with K8s_extra_resources.

                                                                                                                                                                                                                                                                                                                                                            Metric NameAgentPlatform
                                                                                                                                                                                                                                                                                                                                                            PVC metrics0.89.3 and beyondRelease 2172
                                                                                                                                                                                                                                                                                                                                                            Resource Quota metrics0.87.1 and beyondRelease 2172
                                                                                                                                                                                                                                                                                                                                                            HPA metrics0.79.0 and beyondRelease 2172

                                                                                                                                                                                                                                                                                                                                                            Kubernetes Resource Metrics

                                                                                                                                                                                                                                                                                                                                                            Metric Name

                                                                                                                                                                                                                                                                                                                                                            Metric Description

                                                                                                                                                                                                                                                                                                                                                            Metric Type

                                                                                                                                                                                                                                                                                                                                                            Segment By

                                                                                                                                                                                                                                                                                                                                                            kubernetes.persistentvolumeclaim.storage

                                                                                                                                                                                                                                                                                                                                                            The storage capacity requested by the persistent volume claim.

                                                                                                                                                                                                                                                                                                                                                            kubernetes.persistentvolumeclaim.storage provides Sysdig users with a single overarching metric for persistent volume claims (PVCs), rather than a series of metrics that often repeat/duplicate information. Each Kubernetes PVC metric is mapped to a kubernetes.persistentvolumeclaim.storage label, which can then be used to segment the overarching metric.

                                                                                                                                                                                                                                                                                                                                                            See Grouping, Scoping, and Segmenting Metrics for more information on segmenting metrics.

                                                                                                                                                                                                                                                                                                                                                            Gauge

                                                                                                                                                                                                                                                                                                                                                            • kubernetes.namespace.name

                                                                                                                                                                                                                                                                                                                                                            • kubernetes.persistentvolumeclaim.label.accessmode

                                                                                                                                                                                                                                                                                                                                                            • kubernetes.persistentvolumeclaim.label.app

                                                                                                                                                                                                                                                                                                                                                            • kubernetes.persistentvolumeclaim.label.status.phase

                                                                                                                                                                                                                                                                                                                                                            • kubernetes.persistentvolumeclaim.label.storage

                                                                                                                                                                                                                                                                                                                                                            • kubernetes.persistentvolumeclaim.label.storageclassname

                                                                                                                                                                                                                                                                                                                                                            • kubernetes.persistentvolumeclaim.label.volumename

                                                                                                                                                                                                                                                                                                                                                            kubernetes.pod.restart.count

                                                                                                                                                                                                                                                                                                                                                            The cumulative number of container restarts for the pod over its lifetime.

                                                                                                                                                                                                                                                                                                                                                            This metric is not useful for alerts. Sysdig recommends using kubernetes.pod.restart.rate instead.

                                                                                                                                                                                                                                                                                                                                                            Counter - Integer

                                                                                                                                                                                                                                                                                                                                                            Kubernetes

                                                                                                                                                                                                                                                                                                                                                            kubernetes.pod.restart.rate

                                                                                                                                                                                                                                                                                                                                                            The number of container restarts for the pod within the defined scope/time period.

                                                                                                                                                                                                                                                                                                                                                            Gauge - Integer

                                                                                                                                                                                                                                                                                                                                                            Kubernetes

                                                                                                                                                                                                                                                                                                                                                            kubernetes.replicaSet.replicas.desired

                                                                                                                                                                                                                                                                                                                                                            The number of replica pods the replicaSet is configured to maintain.

                                                                                                                                                                                                                                                                                                                                                            Gauge - Integer

                                                                                                                                                                                                                                                                                                                                                            Kubernetes

                                                                                                                                                                                                                                                                                                                                                            kubernetes.replicaSet.replicas.running

                                                                                                                                                                                                                                                                                                                                                            The current number of replica pods running in the replicaSet.

                                                                                                                                                                                                                                                                                                                                                            Gauge - Integer

                                                                                                                                                                                                                                                                                                                                                            Kubernetes

                                                                                                                                                                                                                                                                                                                                                            kubernetes.replicationController.replicas.desired

                                                                                                                                                                                                                                                                                                                                                            The number of replica pods the replicationController is configured to maintain.

                                                                                                                                                                                                                                                                                                                                                            Gauge - Integer

                                                                                                                                                                                                                                                                                                                                                            Kubernetes

                                                                                                                                                                                                                                                                                                                                                            kubernetes.replicationController.replicas.running

                                                                                                                                                                                                                                                                                                                                                            The current number of replica pods running in the replication controller.

                                                                                                                                                                                                                                                                                                                                                            Gauge - Integer

                                                                                                                                                                                                                                                                                                                                                            Kubernetes

                                                                                                                                                                                                                                                                                                                                                            5.12 -

                                                                                                                                                                                                                                                                                                                                                            Network

                                                                                                                                                                                                                                                                                                                                                            For more information heuristic metrics described in this topic, refer to the Heuristic and Deprecated Metrics.

                                                                                                                                                                                                                                                                                                                                                            net.bytes.in

                                                                                                                                                                                                                                                                                                                                                            Inbound network bytes. By default, this metric displays the total value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the total value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeByte
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.bytes.out

                                                                                                                                                                                                                                                                                                                                                            Outbound network bytes. By default, this metric displays the total value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the total value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeByte
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.bytes.total

                                                                                                                                                                                                                                                                                                                                                            Total network bytes. By default, this metric displays the total value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the total value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeByte
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.client.ip

                                                                                                                                                                                                                                                                                                                                                            The client IP address.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            net.connection.count.in

                                                                                                                                                                                                                                                                                                                                                            The number of currently established client (inbound) connections.

                                                                                                                                                                                                                                                                                                                                                            This metric is especially useful when segmented by port, process, or protocol.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Protocol, Port, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.connection.count.out

                                                                                                                                                                                                                                                                                                                                                            The number of currently established server (outbound) connections.

                                                                                                                                                                                                                                                                                                                                                            This metric is especially useful when segmented by port, process, or protocol.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Port, Protocol, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.connection.count.total

                                                                                                                                                                                                                                                                                                                                                            The number of currently established connections. This value may exceed the sum of the inbound and outbound metrics since it represents client and server inter-host connections as well as internal only connections.

                                                                                                                                                                                                                                                                                                                                                            This metric is especially useful when segmented by port, process, or protocol.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Port, Protocol, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.error.count

                                                                                                                                                                                                                                                                                                                                                            The number of errors encountered by network system calls, such as connect(), send(), and recv(). By default, this metric displays the total value for the defined scope. For example, if the scope is defined as a group of machines, the metric value will be the total value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.http.error.count

                                                                                                                                                                                                                                                                                                                                                            net.http.error.count is a heuristic metric.

                                                                                                                                                                                                                                                                                                                                                            The number of failed HTTP requests, determined by the total number of 4xx/5xx status codes.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.http.method

                                                                                                                                                                                                                                                                                                                                                            The HTTP request method.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment Byhost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            net.http.request.count

                                                                                                                                                                                                                                                                                                                                                            net.http.request.count is a heuristic metric.

                                                                                                                                                                                                                                                                                                                                                            HTTP request count.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.http.request.time

                                                                                                                                                                                                                                                                                                                                                            net.http.request.time is a heuristic metric.

                                                                                                                                                                                                                                                                                                                                                            Average HTTP request time.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TyperelativeTime
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.http.request.time.worst

                                                                                                                                                                                                                                                                                                                                                            The maximum time for HTTP requests.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TyperelativeTime
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.http.statusCode

                                                                                                                                                                                                                                                                                                                                                            The HTTP response status code.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            net.http.url

                                                                                                                                                                                                                                                                                                                                                            The HTTP request URL.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            net.link.clientServer.bytes

                                                                                                                                                                                                                                                                                                                                                            The number of bytes passing through the link from client to server.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeByte
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.link.delay.perRequest

                                                                                                                                                                                                                                                                                                                                                            Average delay in the network link per request.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TyperelativeTime
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.link.serverClient.bytes

                                                                                                                                                                                                                                                                                                                                                            The number of bytes passing through the link from server to client.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeByte
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.local.endpoint

                                                                                                                                                                                                                                                                                                                                                            The local endpoint for a connection. This metric is resolved to a user-friendly host name, if available.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            net.local.service

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            net.mongodb.collection

                                                                                                                                                                                                                                                                                                                                                            The MongoDB collection.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            net.mongodb.error.count

                                                                                                                                                                                                                                                                                                                                                            net.mongodb.error.count is a heuristic metric. For more information, refer to the Heuristic and Deprecated Metrics.

                                                                                                                                                                                                                                                                                                                                                            The number of Failed MongoDB requests.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.mongodb.operation

                                                                                                                                                                                                                                                                                                                                                            The MongoDB operation.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            net.mongodb.request.count

                                                                                                                                                                                                                                                                                                                                                            net.mongodb.request.count is a heuristic metric.

                                                                                                                                                                                                                                                                                                                                                            The total number of MongoDB requests.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.mongodb.request.time

                                                                                                                                                                                                                                                                                                                                                            net.mongodb.request.time is a heuristic metric.

                                                                                                                                                                                                                                                                                                                                                            The average time to complete a MongoDB request.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TyperelativeTime
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.mongodb.request.time.worst (deprecated)

                                                                                                                                                                                                                                                                                                                                                            The maximum time to complete a MongoDB request.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TyperelativeTime
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.protocol

                                                                                                                                                                                                                                                                                                                                                            The network protocol of a request (for example, HTTP or MySQL).

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            net.remote.endpoint

                                                                                                                                                                                                                                                                                                                                                            The remote endpoint of a connection. This metric automatically resolves as a user-friendly host name, if available.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            net.remote.service

                                                                                                                                                                                                                                                                                                                                                            Service (port number) of a remote node.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            net.request.count

                                                                                                                                                                                                                                                                                                                                                            net.request.count is a heuristic metric.

                                                                                                                                                                                                                                                                                                                                                            Total number of network requests.

                                                                                                                                                                                                                                                                                                                                                            This value may exceed the sum of inbound and outbound requests, because this count includes requests over internal connections.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.request.count.in

                                                                                                                                                                                                                                                                                                                                                            net.request.count.in is a heuristic metric.

                                                                                                                                                                                                                                                                                                                                                            Number of inbound network requests.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.request.count.out

                                                                                                                                                                                                                                                                                                                                                            Number of outbound network requests.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.request.time

                                                                                                                                                                                                                                                                                                                                                            net.request.time is a heuristic metric.

                                                                                                                                                                                                                                                                                                                                                            A measure of response time which includes app + network latency. For server side it is purely a measure of app latency. This is calculated by measuring when we see the arrival of the last request buffer to when we see the departure of the first response buffer.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TyperelativeTime
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.request.time.file (deprecated)

                                                                                                                                                                                                                                                                                                                                                            The amount of time for serving a request that is spent doing file I/O. See also net.request.time.net (network I/O time) and net.request.time.processing (CPU processing time).

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TyperelativeTime
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.request.time.file.percent

                                                                                                                                                                                                                                                                                                                                                            net.request.time.file.percent is a heuristic metric.

                                                                                                                                                                                                                                                                                                                                                            The percentage of time for serving a request that is spent doing file I/O. See also net.request.time.net (network I/O time) and net.request.time.processing (CPU processing time).

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.request.time.in

                                                                                                                                                                                                                                                                                                                                                            net.request.time.in is a heuristic metric.

                                                                                                                                                                                                                                                                                                                                                            Average time to serve an inbound request.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TyperelativeTime
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.request.time.local (deprecated)

                                                                                                                                                                                                                                                                                                                                                            Average per request delay introduced by this node when it serves requests coming from the previous tiers. In other words, this is the time spent serving incoming requests minus the time spent waiting for outgoing requests to complete.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TyperelativeTime
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.request.time.local.percent

                                                                                                                                                                                                                                                                                                                                                            net.request.time.local.percent is a heuristic metric.

                                                                                                                                                                                                                                                                                                                                                            The percentage of time spent in the local node versus the next tiers, when serving requests that come from previous tiers.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.request.time.net (deprecated)

                                                                                                                                                                                                                                                                                                                                                            The amount of time for serving a request that is spent doing network I/O. See also net.request.time.file (file I/O time) and net.request.time.processing (CPU processing time).

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TyperelativeTime
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.request.time.net.percent

                                                                                                                                                                                                                                                                                                                                                            net.request.time.net.percent is a heuristic metric.

                                                                                                                                                                                                                                                                                                                                                            The percent of time for serving a request that is spent doing network I/O. See also net.request.time.file (file I/O time) and net.request.time.processing (CPU processing time).

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.request.time.nextTiers (deprecated)

                                                                                                                                                                                                                                                                                                                                                            Delay introduced by the successive tiers when serving requests.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TyperelativeTime
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.request.time.nextTiers.percent

                                                                                                                                                                                                                                                                                                                                                            net.request.time.nextTiers.percent is a heuristic metric.

                                                                                                                                                                                                                                                                                                                                                            The percentage of time spent in the next tiers versus the local node, when serving requests that come from previous tiers.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.request.time.out

                                                                                                                                                                                                                                                                                                                                                            net.request.time.out is a heuristic metric.

                                                                                                                                                                                                                                                                                                                                                            Average time spent waiting for an outbound request.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TyperelativeTime
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.request.time.processing (deprecated)

                                                                                                                                                                                                                                                                                                                                                            The amount of time for serving a request that is spent doing CPU processing. See also net.request.time.file (file I/O time) and net.request.time.net (network I/O time).

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TyperelativeTime
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.request.time.processing.percent

                                                                                                                                                                                                                                                                                                                                                            net.request.time.processing.percent is a heuristic metric.

                                                                                                                                                                                                                                                                                                                                                            The percent of time for serving a request that is spent doing CPU processing. See also net.request.time.file (file I/O time) and net.request.time.net (network I/O time).

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.request.time.worst.in

                                                                                                                                                                                                                                                                                                                                                            net.request.time.worst.in is a heuristic metric.

                                                                                                                                                                                                                                                                                                                                                            Maximum time to serve an inbound request.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TyperelativeTime
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.request.time.worst.out

                                                                                                                                                                                                                                                                                                                                                            net.request.time.worst.out is a heuristic metric.

                                                                                                                                                                                                                                                                                                                                                            Maximum time spent waiting for an outbound request.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TyperelativeTime
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.role

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            net.server.ip

                                                                                                                                                                                                                                                                                                                                                            Server IP address.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            net.server.port

                                                                                                                                                                                                                                                                                                                                                            TCP/UDP Server port number.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.sql.error.count

                                                                                                                                                                                                                                                                                                                                                            net.sql.error.count is a heuristic metric.

                                                                                                                                                                                                                                                                                                                                                            The number of Failed SQL requests.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.sql.query

                                                                                                                                                                                                                                                                                                                                                            The full SQL query. If the query string is longer than 512 characters, it will be truncated to 512 characters.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            net.sql.query.type

                                                                                                                                                                                                                                                                                                                                                            The SQL query type (for example, SELECT, INSERT, or DELETE).

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            net.sql.request.count

                                                                                                                                                                                                                                                                                                                                                            net.sql.request.count is a heuristic metric.

                                                                                                                                                                                                                                                                                                                                                            The number of SQL requests.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.sql.request.time

                                                                                                                                                                                                                                                                                                                                                            net.sql.request.time is a heuristic metric.

                                                                                                                                                                                                                                                                                                                                                            Average time to complete an SQL request.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TyperelativeTime
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.sql.request.time.worst (deprecated)

                                                                                                                                                                                                                                                                                                                                                            Maximum time to complete a SQL request.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TyperelativeTime
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            net.sql.table

                                                                                                                                                                                                                                                                                                                                                            The SQL query table name.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            net.tcp.queue.len

                                                                                                                                                                                                                                                                                                                                                            The length of the TCP request queue.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            5.13 -

                                                                                                                                                                                                                                                                                                                                                            Process

                                                                                                                                                                                                                                                                                                                                                            fd.used.percent

                                                                                                                                                                                                                                                                                                                                                            The percentage of used file descriptors out of the maximum available. By default, this metric displays the average value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the average value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            This metric should be monitored carefully, and used for alerts, as when a process reaches its file descriptor limit, the process will stop operating correctly, and potentially crash.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            proc.commandLine

                                                                                                                                                                                                                                                                                                                                                            Command line used to start the process.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByProcess
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/a
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            proc.count

                                                                                                                                                                                                                                                                                                                                                            The number of processes on host or container, excluding any processes that do not have .exe or command line parameters in the process table. These processes typically are kernel or system level, and are typically identified by square brackets (for example, [kthreadd]).

                                                                                                                                                                                                                                                                                                                                                            As some processes are excluded, the host level proc.count value will be lower than the value reported by the ps -ef command on the host.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            proc.name

                                                                                                                                                                                                                                                                                                                                                            Name of the process.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByProcess
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            proc.name.client

                                                                                                                                                                                                                                                                                                                                                            Name of the Client process.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByProcess
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            proc.name.server

                                                                                                                                                                                                                                                                                                                                                            Name of the server process.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByProcess
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            proc.start.count

                                                                                                                                                                                                                                                                                                                                                            Number of process starts on host or container.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            5.14 -

                                                                                                                                                                                                                                                                                                                                                            RedisDB Metrics

                                                                                                                                                                                                                                                                                                                                                            See also: RedisDBintegration information.

                                                                                                                                                                                                                                                                                                                                                            redis.aof.buffer_length

                                                                                                                                                                                                                                                                                                                                                            The size of the AOF buffer.

                                                                                                                                                                                                                                                                                                                                                            redis.aof.last_rewrite_time

                                                                                                                                                                                                                                                                                                                                                            The duration of the last AOF rewrite.

                                                                                                                                                                                                                                                                                                                                                            redis.aof.rewrite

                                                                                                                                                                                                                                                                                                                                                            A flag indicating that a AOF rewrite operation is on-going.

                                                                                                                                                                                                                                                                                                                                                            redis.clients.biggest_input_buf

                                                                                                                                                                                                                                                                                                                                                            The biggest input buffer among current client connections.

                                                                                                                                                                                                                                                                                                                                                            redis.clients.blocked

                                                                                                                                                                                                                                                                                                                                                            The number of connections waiting on a blocking call.

                                                                                                                                                                                                                                                                                                                                                            redis.clients.longest_output_list

                                                                                                                                                                                                                                                                                                                                                            The longest output list among current client connections.

                                                                                                                                                                                                                                                                                                                                                            redis.command.calls

                                                                                                                                                                                                                                                                                                                                                            The number of times a redis command has been called. The commands are tagged with command (for example, command:append).

                                                                                                                                                                                                                                                                                                                                                            redis.command.usec_per_call

                                                                                                                                                                                                                                                                                                                                                            The CPU time consumed per redis command call. The commands are tagged with command (for example, command:append).

                                                                                                                                                                                                                                                                                                                                                            redis.cpu.sys

                                                                                                                                                                                                                                                                                                                                                            The system CPU consumed by the Redis server.

                                                                                                                                                                                                                                                                                                                                                            redis.cpu.sys_children

                                                                                                                                                                                                                                                                                                                                                            The system CPU consumed by the background processes.

                                                                                                                                                                                                                                                                                                                                                            redis.cpu.user

                                                                                                                                                                                                                                                                                                                                                            The user CPU consumed by the Redis server.

                                                                                                                                                                                                                                                                                                                                                            redis.cpu.user_children

                                                                                                                                                                                                                                                                                                                                                            The user CPU consumed by the background processes.

                                                                                                                                                                                                                                                                                                                                                            redis.expires

                                                                                                                                                                                                                                                                                                                                                            The number of keys that have expired.

                                                                                                                                                                                                                                                                                                                                                            redis.expires.percent

                                                                                                                                                                                                                                                                                                                                                            The percentage of total keys that have been expired.

                                                                                                                                                                                                                                                                                                                                                            redis.info.latency_ms

                                                                                                                                                                                                                                                                                                                                                            The latency of the redis INFO command.

                                                                                                                                                                                                                                                                                                                                                            redis.key.length

                                                                                                                                                                                                                                                                                                                                                            The number of elements in a given key. Each element is tagged by key (for example, key:mykeyname).

                                                                                                                                                                                                                                                                                                                                                            redis.keys

                                                                                                                                                                                                                                                                                                                                                            The total number of keys.

                                                                                                                                                                                                                                                                                                                                                            redis.keys.evicted

                                                                                                                                                                                                                                                                                                                                                            The total number of keys evicted due to the maxmemory limit.

                                                                                                                                                                                                                                                                                                                                                            redis.keys.expired

                                                                                                                                                                                                                                                                                                                                                            The total number of keys expired from the database.

                                                                                                                                                                                                                                                                                                                                                            redis.mem.fragmentation_ratio

                                                                                                                                                                                                                                                                                                                                                            The ratio between used_memory_rss and used_memory.

                                                                                                                                                                                                                                                                                                                                                            redis.mem.lua

                                                                                                                                                                                                                                                                                                                                                            The amount of memory used by the Lua engine.

                                                                                                                                                                                                                                                                                                                                                            redis.mem.maxmemory

                                                                                                                                                                                                                                                                                                                                                            The maximum amount of memory allotted to the RedisDB system.

                                                                                                                                                                                                                                                                                                                                                            redis.mem.overhead

                                                                                                                                                                                                                                                                                                                                                            Sum of all the overheads allocated by Redis for managing its internal data structures.

                                                                                                                                                                                                                                                                                                                                                            Supported by Sysdig Agent v9.7.0 and above.

                                                                                                                                                                                                                                                                                                                                                            redis.mem.peak

                                                                                                                                                                                                                                                                                                                                                            The peak amount of memory used by Redis.

                                                                                                                                                                                                                                                                                                                                                            redis.mem.startup

                                                                                                                                                                                                                                                                                                                                                            Amount of memory consumed by Redis while initializing.

                                                                                                                                                                                                                                                                                                                                                            Supported by Sysdig Agent v9.7.0 and above.

                                                                                                                                                                                                                                                                                                                                                            redis.mem.rss

                                                                                                                                                                                                                                                                                                                                                            The amount of memory that Redis allocated as seen by the operating system.

                                                                                                                                                                                                                                                                                                                                                            redis.mem.used

                                                                                                                                                                                                                                                                                                                                                            The amount of memory allocated by Redis.

                                                                                                                                                                                                                                                                                                                                                            redis.net.clients

                                                                                                                                                                                                                                                                                                                                                            The number of connected clients (excluding slaves).

                                                                                                                                                                                                                                                                                                                                                            redis.net.commands

                                                                                                                                                                                                                                                                                                                                                            The number of commands processed by the server.

                                                                                                                                                                                                                                                                                                                                                            redis.net.commands.instantaneous_ops_per_sec

                                                                                                                                                                                                                                                                                                                                                            The number of commands processed by the server per second.

                                                                                                                                                                                                                                                                                                                                                            redis.net.rejected

                                                                                                                                                                                                                                                                                                                                                            The number of rejected connections.

                                                                                                                                                                                                                                                                                                                                                            redis.net.slaves

                                                                                                                                                                                                                                                                                                                                                            The number of connected slaves.

                                                                                                                                                                                                                                                                                                                                                            redis.perf.latest_fork_usec

                                                                                                                                                                                                                                                                                                                                                            The duration of the latest fork.

                                                                                                                                                                                                                                                                                                                                                            redis.persist

                                                                                                                                                                                                                                                                                                                                                            The number of keys persisted. The formula for this metric is redis.keys - redis.expires.

                                                                                                                                                                                                                                                                                                                                                            redis.persist.percent

                                                                                                                                                                                                                                                                                                                                                            Percentage of total keys that are persisted.

                                                                                                                                                                                                                                                                                                                                                            redis.pubsub.channels

                                                                                                                                                                                                                                                                                                                                                            The number of active pubsub channels.

                                                                                                                                                                                                                                                                                                                                                            redis.pubsub.patterns

                                                                                                                                                                                                                                                                                                                                                            The number of active pubsub patterns.

                                                                                                                                                                                                                                                                                                                                                            redis.rdb.bgsave

                                                                                                                                                                                                                                                                                                                                                            Determines whether a bgsave is in progress. The value is one if a bgsave is in progress, and zero at all other times.

                                                                                                                                                                                                                                                                                                                                                            redis.rdb.changes_since_last

                                                                                                                                                                                                                                                                                                                                                            The number of changes since the last background save.

                                                                                                                                                                                                                                                                                                                                                            redis.rdb.last_bgsave_time

                                                                                                                                                                                                                                                                                                                                                            The duration of the last bg_save operation.

                                                                                                                                                                                                                                                                                                                                                            redis.replication.backlog_histlen

                                                                                                                                                                                                                                                                                                                                                            The amount of data in the backlog sync buffer.

                                                                                                                                                                                                                                                                                                                                                            redis.replication.delay

                                                                                                                                                                                                                                                                                                                                                            The replication delay in offsets.

                                                                                                                                                                                                                                                                                                                                                            redis.replication.last_io_seconds_ago

                                                                                                                                                                                                                                                                                                                                                            The amount of time since the last interaction with master.

                                                                                                                                                                                                                                                                                                                                                            The amount of time that the master link has been down.

                                                                                                                                                                                                                                                                                                                                                            redis.replication.master_repl_offset

                                                                                                                                                                                                                                                                                                                                                            The replication offset reported by the master.

                                                                                                                                                                                                                                                                                                                                                            redis.replication.slave_repl_offset

                                                                                                                                                                                                                                                                                                                                                            The replication offset reported by the slave.

                                                                                                                                                                                                                                                                                                                                                            redis.replication.sync

                                                                                                                                                                                                                                                                                                                                                            Determines whether a sync is in progress. The value is one if a sync is in progress, and zero at all other times.

                                                                                                                                                                                                                                                                                                                                                            redis.replication.sync_left_bytes

                                                                                                                                                                                                                                                                                                                                                            The amount of data left before syncing is complete.

                                                                                                                                                                                                                                                                                                                                                            redis.slowlog.micros.95percentile

                                                                                                                                                                                                                                                                                                                                                            The 95th percentile of the duration of queries reported in the slow log.

                                                                                                                                                                                                                                                                                                                                                            redis.slowlog.micros.avg

                                                                                                                                                                                                                                                                                                                                                            The average duration of queries reported in the slow log.

                                                                                                                                                                                                                                                                                                                                                            redis.slowlog.micros.count

                                                                                                                                                                                                                                                                                                                                                            The rate of queries reported in the slow log.

                                                                                                                                                                                                                                                                                                                                                            redis.slowlog.micros.max

                                                                                                                                                                                                                                                                                                                                                            The maximum duration of queries reported in the slow log.

                                                                                                                                                                                                                                                                                                                                                            redis.slowlog.micros.median

                                                                                                                                                                                                                                                                                                                                                            The median duration of queries reported in the slow log.

                                                                                                                                                                                                                                                                                                                                                            redis.stats.keyspace_hits

                                                                                                                                                                                                                                                                                                                                                            The total number of successful lookups in the database.

                                                                                                                                                                                                                                                                                                                                                            redis.stats.keyspace_misses

                                                                                                                                                                                                                                                                                                                                                            The total number of missed lookups in the database.

                                                                                                                                                                                                                                                                                                                                                            5.15 -

                                                                                                                                                                                                                                                                                                                                                            Security Policy Metrics

                                                                                                                                                                                                                                                                                                                                                            Metrics

                                                                                                                                                                                                                                                                                                                                                            Description

                                                                                                                                                                                                                                                                                                                                                            Type

                                                                                                                                                                                                                                                                                                                                                            Segmented by

                                                                                                                                                                                                                                                                                                                                                            Minimum Agent Version

                                                                                                                                                                                                                                                                                                                                                            security.evts.k8s_audit

                                                                                                                                                                                                                                                                                                                                                            The total number of policy events from a Kubernetes audit policy.

                                                                                                                                                                                                                                                                                                                                                            Gauge

                                                                                                                                                                                                                                                                                                                                                            host.mac

                                                                                                                                                                                                                                                                                                                                                            host.hostname

                                                                                                                                                                                                                                                                                                                                                            0.86.0

                                                                                                                                                                                                                                                                                                                                                            security.policy_evts.syscall

                                                                                                                                                                                                                                                                                                                                                            The total number of policy events from a syscall policy.

                                                                                                                                                                                                                                                                                                                                                            security.policies.enabled

                                                                                                                                                                                                                                                                                                                                                            The number of security policies enabled for a user.

                                                                                                                                                                                                                                                                                                                                                            security.policies.total

                                                                                                                                                                                                                                                                                                                                                            The number of security policies that exist for a user.

                                                                                                                                                                                                                                                                                                                                                            security.policy_evts.container

                                                                                                                                                                                                                                                                                                                                                            The total number of policy events from a container policy.

                                                                                                                                                                                                                                                                                                                                                            security.policy_evts.falco

                                                                                                                                                                                                                                                                                                                                                            The total number of policy events from a Falco policy.

                                                                                                                                                                                                                                                                                                                                                            security.policy_evts.filesystem

                                                                                                                                                                                                                                                                                                                                                            The total number of policy events from a filesystem policy.

                                                                                                                                                                                                                                                                                                                                                            security.policy_evts.high

                                                                                                                                                                                                                                                                                                                                                            The number of policy events from a policy with high severity.

                                                                                                                                                                                                                                                                                                                                                            security.policy_evts.low

                                                                                                                                                                                                                                                                                                                                                            The number of policy events from a policy with low severity.

                                                                                                                                                                                                                                                                                                                                                            security.policy_evts.medium

                                                                                                                                                                                                                                                                                                                                                            The number of policy events from a policy with medium severity.

                                                                                                                                                                                                                                                                                                                                                            security.policy_evts.network

                                                                                                                                                                                                                                                                                                                                                            The total number of policy events from a network policy.

                                                                                                                                                                                                                                                                                                                                                            security.policy_evts.process

                                                                                                                                                                                                                                                                                                                                                            The total number of policy events from a process policy.

                                                                                                                                                                                                                                                                                                                                                            security.policy_evts.total

                                                                                                                                                                                                                                                                                                                                                            The total number of policy events across all policy types.

                                                                                                                                                                                                                                                                                                                                                            security_policy_evts.by_name

                                                                                                                                                                                                                                                                                                                                                            The number of events triggered with segment name available.

                                                                                                                                                                                                                                                                                                                                                            name

                                                                                                                                                                                                                                                                                                                                                            host.mac

                                                                                                                                                                                                                                                                                                                                                            host.hostname

                                                                                                                                                                                                                                                                                                                                                            5.16 -

                                                                                                                                                                                                                                                                                                                                                            System

                                                                                                                                                                                                                                                                                                                                                            capacity.estimated.request.stolen.count (deprecated)

                                                                                                                                                                                                                                                                                                                                                            The number of requests the node cannot serve due to CPU steal time. This metric is calculated by measuring the current number of requests the machine is serving, and calculating how many more requests could be served if there was no steal time.

                                                                                                                                                                                                                                                                                                                                                            This metric can be used to understand how steal time impacts the ability to serve user requests.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Process
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            capacity.estimated.request.total.count (deprecated)

                                                                                                                                                                                                                                                                                                                                                            The estimated number of requests the node serves at full capacity. This metric is calculated by measuring the number of requests that a machine is serving, and the resources each request is using, and combining the values to project how many requests the machine can serve.

                                                                                                                                                                                                                                                                                                                                                            This metric can help users determine if/when the infrastructure capacity should be increased.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Process
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            capacity.stolen.percent (deprecated)

                                                                                                                                                                                                                                                                                                                                                            The lost service request capacity due to stolen CPU. This metric reflects the impact on other resource usage capabilities, including disk I/O and network I/O.

                                                                                                                                                                                                                                                                                                                                                            capacity.stolen.percent is non-zero only if cpu.stolen.percent is also non-zero.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Process
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            capacity.total.percent (deprecated)

                                                                                                                                                                                                                                                                                                                                                            The estimated current capacity usage, based on CPU and disk/network utilization, with CPU stolen time added back in.

                                                                                                                                                                                                                                                                                                                                                            capacity.total.percent can be used to show how the system would perform with dedicated CPU usage.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Process
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            capacity.used.percent (deprecated)

                                                                                                                                                                                                                                                                                                                                                            The estimated current capacity usage, based on CPU and disk/network utilization. This metric is calculated by adding the value of how many resources each request coming to the machine is using, creating a score that indicates how saturates the machine resources are.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Process
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            cpu.idle.percent

                                                                                                                                                                                                                                                                                                                                                            The percentage of time that the CPU/s were idle and the system did not have an outstanding disk I/O request. By default, this metric displays the average value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the average value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            cpu.iowait.percent

                                                                                                                                                                                                                                                                                                                                                            The percentage of time that the CPU/s were idle during which the system had an outstanding disk I/O request. By default, this metric displays the average value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the average value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            cpu.nice.percent

                                                                                                                                                                                                                                                                                                                                                            The percentage of CPU utilization that occurred while executing at the user level with Nice priority. By default, this metric displays the average value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the average value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            cpu.stolen.percent

                                                                                                                                                                                                                                                                                                                                                            Measures the percentage of time that a virtual machine’s CPU is in a state of involuntary wait due to the fact that the physical CPU is shared among virtual machines. In calculating steal time, the operating system kernel detects when it has work available but does not have access to the physical CPU to perform that work.

                                                                                                                                                                                                                                                                                                                                                            If the percent of steal time is consistently high, you may want to stop and restart the instance (since it will most likely start on different physical hardware) or upgrade to a virtual machine with more CPU power. Also see capacity.total.percent to see how steal time directly impacts the number of server requests that could not be handled. On AWS EC2, steal time does not depend on the activity of other virtual machine neighbors. EC2 is simply making sure your instance is not using more CPU cycles than paid for.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            cpu.system.percent

                                                                                                                                                                                                                                                                                                                                                            The percentage of CPU utilization that occurred while executing at the system level (kernel). By default, this metric displays the average value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the average value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            cpu.cores.used

                                                                                                                                                                                                                                                                                                                                                            The CPU core usage of each container is obtained from cgroups, and is equal to the number of cores used by the container. For example, if a container uses two of an available four cores, the value of cpu.cores.used will be two.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeNumber
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max, RateofChange
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            cpu.cores.used.percent

                                                                                                                                                                                                                                                                                                                                                            The CPU core usage percent for each container is obtained from cgroups, and is equal to the number of cores multiplied by 100. For example, if a container uses three cores, the value of cpu.cores.used.percent would be 300%.

                                                                                                                                                                                                                                                                                                                                                            This metric is comparable to the CPU usage metric in docker stats.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAverage, Rate, Sum, Min, Max, rateOfChange
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAverage, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            cpu.used.percent

                                                                                                                                                                                                                                                                                                                                                            Containers

                                                                                                                                                                                                                                                                                                                                                            The CPU usage for each container is obtained from cgroups, and normalized by dividing by the number of cores to determine an overall percentage.

                                                                                                                                                                                                                                                                                                                                                            For example, if the environment contains six cores on a host, and the container or processes are assigned two cores, Sysdig will report CPU usage as:

                                                                                                                                                                                                                                                                                                                                                            2/6 * 100% = 33.33%
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            By comparison, the docker stats command would report the CPU usage as 200%, as each individual core is assigned a value of 100%.

                                                                                                                                                                                                                                                                                                                                                            For service or orchestrator constructs, the container CPU is aggregated based on container labels.

                                                                                                                                                                                                                                                                                                                                                            Hosts

                                                                                                                                                                                                                                                                                                                                                            The CPU usage for each host is obtained from /proc, and measured as the sum of the CPU usage of all cores, normalized by dividing by the number of cores.

                                                                                                                                                                                                                                                                                                                                                            The CPU usage for each host is the sum of cpu.user.percent, cpu.nice.percent, cpu.stolen.percent, and cpu.system.percent.

                                                                                                                                                                                                                                                                                                                                                            The Linux command top can be used to review these values as well.

                                                                                                                                                                                                                                                                                                                                                            Processes

                                                                                                                                                                                                                                                                                                                                                            The CPU usage for each process is obtained from /proc, and normalized by dividing by the number of cores.

                                                                                                                                                                                                                                                                                                                                                            When cpu.used.percent is segmented by process at the host level, the sum of the CPU usage of each process may not always add up to the CPU usage of the host. The most common reasons for this are:

                                                                                                                                                                                                                                                                                                                                                            • There are short-lived processes that spike for less than two seconds.

                                                                                                                                                                                                                                                                                                                                                            • Granular data is retained for the highest CPU usage processes, rather than all processes.

                                                                                                                                                                                                                                                                                                                                                            • The kernel thread CPU usage is not reported as a process.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            cpu.user.percent

                                                                                                                                                                                                                                                                                                                                                            The percentage of CPU utilization that occurred while executing at the user level (application). By default, this metric displays the average value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the average value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            fs.bytes.free

                                                                                                                                                                                                                                                                                                                                                            Available filesystem space.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeByte
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            fs.bytes.total

                                                                                                                                                                                                                                                                                                                                                            Total filesystem size.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeByte
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            fs.bytes.used

                                                                                                                                                                                                                                                                                                                                                            Used filesystem space.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeByte
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            fs.device

                                                                                                                                                                                                                                                                                                                                                            Filesystem device.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            fs.free.percent

                                                                                                                                                                                                                                                                                                                                                            The percentage of free filesystem space.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            fs.inodes.total.count

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            fs.inodes.used.count

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            fs.inodes.used.percent

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            fs.largest.used.percent

                                                                                                                                                                                                                                                                                                                                                            The percentage of filesystem space used by the largest filesystem.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            fs.mountDir

                                                                                                                                                                                                                                                                                                                                                            The filesystem mount directory.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            fs.root.used.percent

                                                                                                                                                                                                                                                                                                                                                            The percentage of root filesystem space used.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            fs.type

                                                                                                                                                                                                                                                                                                                                                            Filesystem type.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeString
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsN/A
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationN/A
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsN/A

                                                                                                                                                                                                                                                                                                                                                            fs.used.percent

                                                                                                                                                                                                                                                                                                                                                            The amount of space written by a single container instance. This value is provided by the container engine and is not supported for some versions of CRIO. For example, CRIO-1.15 which is used in Openshift 4.2. crictl stats not showing the size indicates that this feature is not supported.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            host.error.count

                                                                                                                                                                                                                                                                                                                                                            The number of system call errors. By default, this metric displays the total value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the total value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            load.average.15m

                                                                                                                                                                                                                                                                                                                                                            The 15 minute system load average represents the average number of jobs in (1) the CPU run queue or (2) waiting for disk I/O averaged over 15 minutes for all cores. The value should correspond to the third (and last) load average value displayed by the ‘uptime’ command.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            load.average.1m

                                                                                                                                                                                                                                                                                                                                                            The 1 minute system load average represents the average number of jobs in (1) the CPU run queue or (2) waiting for disk I/O averaged over 1 minute for all cores. The value should correspond to the third (and last) load average value displayed by the ‘uptime’ command.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            load.average.5m

                                                                                                                                                                                                                                                                                                                                                            The 5 minute system load average represents the average number of jobs in (1) the CPU run queue or (2) waiting for disk I/O averaged over 5 minutes for all cores. The value should correspond to the third (and last) load average value displayed by the ‘uptime’ command.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            load.average.percpu.15m

                                                                                                                                                                                                                                                                                                                                                            The 15 minute system load average represents the average number of jobs in (1) the CPU run queue or (2) waiting for disk I/O averaged over 15 minutes, divided by number of system CPUs.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            load.average.percpu.1m

                                                                                                                                                                                                                                                                                                                                                            The 1 minute system load average represents the average number of jobs in (1) the CPU run queue or (2) waiting for disk I/O averaged over 1 minute, divided by number of system CPUs.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            load.average.percpu.5m

                                                                                                                                                                                                                                                                                                                                                            The 5 minute system load average represents the average number of jobs in (1) the CPU run queue or (2) waiting for disk I/O averaged over 5 minutes, divided by number of system CPUs.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            memory.bytes.available

                                                                                                                                                                                                                                                                                                                                                            The amount of available memory. By default, this metric displays the average value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the average value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            An estimate of how much memory is available for starting new applications, without swapping.

                                                                                                                                                                                                                                                                                                                                                            memory.bytes.available may not be directly available on older systems using kernel versions older than 3.14. In these instances, the metric is an approximate value, determined by adding the free and cached memory values.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeByte
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            memory.bytes.total

                                                                                                                                                                                                                                                                                                                                                            The total memory of a host, in bytes. This value is obtained from /proc. By default, this metric displays the average value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the average value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeByte
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            memory.bytes.used

                                                                                                                                                                                                                                                                                                                                                            The amount of physical memory currently in use. By default, this metric displays the average value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the average value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            The formula for determining memory.bytes.used is slightly different depending on whether you are examining processes or containers. For containers, the formula is rss+cache-inactive_file. This means that the total amount of page cache memory (inactive_file) is subtracted from the total number of bytes of page cache memory, and the total number of bytes of anonymous and swap cache memory, combined.

                                                                                                                                                                                                                                                                                                                                                            This is different to the docker stats approach, and may result in different results.

                                                                                                                                                                                                                                                                                                                                                            For processes, the formula is the total value of the size of the resident anonymous memory, the size of the resident file mappings, and the size of the resident shared memory.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            memory.bytes.virtual

                                                                                                                                                                                                                                                                                                                                                            The virtual memory size of the process, in bytes. This value is obtained from Sysdig events. By default, this metric displays the average value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the average value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeByte
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            memory.pageFault.major

                                                                                                                                                                                                                                                                                                                                                            A count of the condition that occurs when a program accesses a memory page that is mapped in the virtual address space, but not loaded in physical memory. By default, this metric displays the total value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the total value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            A major or ‘hard’ page fault is handled by using a disk I/O operation (e.g., memory mapped file or page replacement causing a page swapping). For instance, when starting an application, the Linux kernel will search physical memory and the CPU cache, and, if data does not exist, a major page fault occurs. Generally, adjusting application source code or making more physical memory available reduces major page faults.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            memory.pageFault.minor

                                                                                                                                                                                                                                                                                                                                                            A count of the condition in which a memory page had been loaded in memory at the time the page fault was generated, but was not marked in the memory management unit as being loaded in memory. By default, this metric displays the total value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the total value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            If the page is loaded in memory at the time the fault is generated, but is not marked in the memory management unit as being loaded in memory, then it is called a minor or ‘soft’ page fault. A minor page fault is handled without using a disk I/O operation (e.g., allocated by malloc().). The effect of minor page faults depends on system load and other factors, but are typically short and have very little impact.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeCounter
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationSum
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            memory.swap.bytes.available

                                                                                                                                                                                                                                                                                                                                                            The swap memory available. This metric is determined by the sum of the free and cached swap memory. By default, this metric displays the average value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the average value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeByte
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            memory.swap.bytes.total

                                                                                                                                                                                                                                                                                                                                                            The total amount of swap memory. By default, this metric displays the average value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the average value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeByte
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            memory.swap.bytes.used

                                                                                                                                                                                                                                                                                                                                                            The amount of swap memory used. By default, this metric displays the average value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the average value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeByte
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            memory.swap.used.percent

                                                                                                                                                                                                                                                                                                                                                            The percentage of swap memory used. By default, this metric displays the average value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the average value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            memory.used.percent

                                                                                                                                                                                                                                                                                                                                                            The percentage of physical memory in use. By default, this metric displays the average value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the average value for the whole group.

                                                                                                                                                                                                                                                                                                                                                            Refer to memory.bytes.used for information on the calculation formulas.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value Type%
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            system.uptime

                                                                                                                                                                                                                                                                                                                                                            The system uptime.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            thread.count

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            uptime

                                                                                                                                                                                                                                                                                                                                                            The percentage of time the selected entity or entities was down over the defined time window.

                                                                                                                                                                                                                                                                                                                                                            While this metric is a percentage value, the value is presented as an integer between 0 and 1, rather than a percentage between 0% and 100%.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Metric TypeGauge
                                                                                                                                                                                                                                                                                                                                                            Value TypeInteger
                                                                                                                                                                                                                                                                                                                                                            Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
                                                                                                                                                                                                                                                                                                                                                            Default Time AggregationRate
                                                                                                                                                                                                                                                                                                                                                            Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
                                                                                                                                                                                                                                                                                                                                                            Default Group AggregationAverage
                                                                                                                                                                                                                                                                                                                                                            Available Group Aggregation FormatsAvg, Sum, Min, Max

                                                                                                                                                                                                                                                                                                                                                            6 -

                                                                                                                                                                                                                                                                                                                                                            Dashboards

                                                                                                                                                                                                                                                                                                                                                            Sysdig users can create customized dashboards to display the most useful or relevant views and metrics for the infrastructure in a single location. This feature-rich dashboards support both form-based and PromQL-based queries and offer several user experience enhancements:

                                                                                                                                                                                                                                                                                                                                                            • Multiple data queries per panel

                                                                                                                                                                                                                                                                                                                                                            • Basic (form-based) and advanced (PromQL) data queries

                                                                                                                                                                                                                                                                                                                                                            • Compare basic query result against historical data

                                                                                                                                                                                                                                                                                                                                                            • Query support:

                                                                                                                                                                                                                                                                                                                                                              • Allows to query multiple metrics

                                                                                                                                                                                                                                                                                                                                                              • Render the results of a query (time series) as line, bars, stacked area, stairs, text, and so on.

                                                                                                                                                                                                                                                                                                                                                              • Ability to scope and segment each query separately

                                                                                                                                                                                                                                                                                                                                                              • Inherit, augment or override dashboard scope

                                                                                                                                                                                                                                                                                                                                                              • Metric descriptor based units with the ability to override

                                                                                                                                                                                                                                                                                                                                                              • Assign Y-axis automatically based on query unit type with the ability to override

                                                                                                                                                                                                                                                                                                                                                            Each dashboard is composed of a series of panels configured to display specific data in a number of different formats. Learn more about how dashboards and panels are created, organized, and managed in the following sections:

                                                                                                                                                                                                                                                                                                                                                            6.1 -

                                                                                                                                                                                                                                                                                                                                                            About the Dashboard UI

                                                                                                                                                                                                                                                                                                                                                            The main components of the Dashboard UI include widgets, time navigation, and panels.

                                                                                                                                                                                                                                                                                                                                                            Widgets

                                                                                                                                                                                                                                                                                                                                                            Dashboards support time series (Timechart), Histogram, Number graphs, Table, Text, and Toplist.

                                                                                                                                                                                                                                                                                                                                                            Timechart and Number graph support both form-based and advanced (PromQL) queries, whereas Histogram, Table, and Toplist panels support building only form-based queries.

                                                                                                                                                                                                                                                                                                                                                            Time Navigation

                                                                                                                                                                                                                                                                                                                                                            Dashboard is designed around time. After a query has been executed, Sysdig Monitor polls the infrastructure data every 10 seconds and refreshes the metrics on the Dashboard panel. You select how to view this gathered data by choosing a Preset interval and a time Range.

                                                                                                                                                                                                                                                                                                                                                            Presets

                                                                                                                                                                                                                                                                                                                                                            Presets are a way of visualizing data that Sysdig Monitor gathers every 10 minutes. Select a preset to determine the data sample to be displayed. Overview supports the following presets:

                                                                                                                                                                                                                                                                                                                                                            • 10 Seconds

                                                                                                                                                                                                                                                                                                                                                            • 1 Minutes

                                                                                                                                                                                                                                                                                                                                                            • 10 Minutes

                                                                                                                                                                                                                                                                                                                                                            • 1 Hour

                                                                                                                                                                                                                                                                                                                                                            • 6 Hour

                                                                                                                                                                                                                                                                                                                                                            • 1 Day

                                                                                                                                                                                                                                                                                                                                                            • 2 Weeks

                                                                                                                                                                                                                                                                                                                                                            A preset that is 10 minutes or less is refreshed every 30 seconds. A preset that is greater than 10 minutes is refreshed every 1 minute.

                                                                                                                                                                                                                                                                                                                                                            Presets work in conjunction with Range selections. Selecting a particular preset interval refreshes Range selection and reloads the data subsequently. For example:

                                                                                                                                                                                                                                                                                                                                                            • 10 Minutes: Resets the Range to December 9, 2.20 pm - December 9, 2.30 pm.

                                                                                                                                                                                                                                                                                                                                                            • 6 Hour: Resets the Range to December 9, 8.30 am - December 9, 2.30 pm.

                                                                                                                                                                                                                                                                                                                                                            • 1 Day: Resets the Range to December 8, 2.30 pm - December 9, 2.30 pm.

                                                                                                                                                                                                                                                                                                                                                            Range

                                                                                                                                                                                                                                                                                                                                                            Range shows both date and time interval as well as the selected Presets in parenthesis. The Range indicated on the UI is determined by Presets. The time given is the closest time interval and by default, it is the current date and time preset by 1 hour.

                                                                                                                                                                                                                                                                                                                                                            Click on the Range tab to open a calendar to select a range.

                                                                                                                                                                                                                                                                                                                                                            See Presets to understand how Range works with Presets.

                                                                                                                                                                                                                                                                                                                                                            Live

                                                                                                                                                                                                                                                                                                                                                            The Live badge shows if the data shown is Live or Paused.

                                                                                                                                                                                                                                                                                                                                                            • Live: the data is continuously updating based on the 10-minute polling of the Sysdig back end. The Overview feed is normally always Live.

                                                                                                                                                                                                                                                                                                                                                            • Paused: When a specific row is selected, the data refresh pauses and the rows will not be updated with new data coming in.

                                                                                                                                                                                                                                                                                                                                                            Time Format

                                                                                                                                                                                                                                                                                                                                                            Dashboards support UTC and PDT time formats. Use the toggle button next to Range to change the time format for the slot shown in Range. The default is PDT.

                                                                                                                                                                                                                                                                                                                                                            Panel Properties

                                                                                                                                                                                                                                                                                                                                                            Query

                                                                                                                                                                                                                                                                                                                                                            With the Dashboard, you can construct queries in two ways: Form-Based and Advanced. As you construct your query and type in a keyword in the Metrics field, auto-complete offers suggestions for the metrics in the query.

                                                                                                                                                                                                                                                                                                                                                            Form-Based Query

                                                                                                                                                                                                                                                                                                                                                            Use the UI fields to construct queries. Form-based data queries consist of one or multiple metrics with time and group aggregation, Segmentation, Display, Unit for both incoming data as well as displaying data on the Y-Axis, and Scope. You can choose to inherit the Dashboard scope.

                                                                                                                                                                                                                                                                                                                                                            Form-based queries support both Sysdig dot notation and Prometheus-compatible underscore notation.

                                                                                                                                                                                                                                                                                                                                                            PromQL Query

                                                                                                                                                                                                                                                                                                                                                            The PromQL field supports only PromQL queries. Manually enter a PromQL query as follows:

                                                                                                                                                                                                                                                                                                                                                            Each query starts with a group aggregator, followed by a time aggregator, then the metrics and segmentation. For example:

                                                                                                                                                                                                                                                                                                                                                            sum(rate(promraw_go_memstats_heap_released_bytes_total[5m])) by(kubernetes_cluster_name)
                                                                                                                                                                                                                                                                                                                                                            
                                                                                                                                                                                                                                                                                                                                                            Intelligent $__interval

                                                                                                                                                                                                                                                                                                                                                            You can use $__interval within a PromQL query to use the most appropriate sampling depending on the time range you have selected. This configuration ensures that the most granular data is accessible while downsampling when you select a long time range to panels load as fast as possible.

                                                                                                                                                                                                                                                                                                                                                            Scope variables

                                                                                                                                                                                                                                                                                                                                                            You can configure scope variables at the dashboard level to quickly filter metrics based on Cluster, Namespace, Workload, and more.

                                                                                                                                                                                                                                                                                                                                                            When using PromQL queries, you can select the scope by using dynamic variables. This configuration is significant when troubleshooting as it allows you to switch context quickly without reconfiguring queries.

                                                                                                                                                                                                                                                                                                                                                            Smart Autocompletion and Syntax Highlighting

                                                                                                                                                                                                                                                                                                                                                            Autocomplete suggests metrics, operators, and functions, while syntax highlighting helps highlight problems within a PromQL query. This is invaluable in dynamic environments and allows you to craft the right queries faster.

                                                                                                                                                                                                                                                                                                                                                            Define Axes

                                                                                                                                                                                                                                                                                                                                                            Sysdig Monitor provides the flexibility to add two Y-axes on the graph. You can also determine whether you want to use them at all. Having the option to add an extra Y-axis help when you decide to add an extra query.

                                                                                                                                                                                                                                                                                                                                                            Specify the following for both Y-Axis and Y-Axis Right:

                                                                                                                                                                                                                                                                                                                                                            • Show: Select to show the Y-Axis on the graph.

                                                                                                                                                                                                                                                                                                                                                            • Scale: Specify the scale in which you want the data to be shown on the graph.

                                                                                                                                                                                                                                                                                                                                                            • Unit: Specify the unit of scale for the incoming data.

                                                                                                                                                                                                                                                                                                                                                            • Display Format: Specify the unit of scale for the data to be displayed on the Y-Axis.

                                                                                                                                                                                                                                                                                                                                                            • Y-Max: Specify the highest value to be displayed on the Y-Axis. Consider this as the highest point on the range. You can specify the limits as numeric values. However, the type of values that you specify must match the type of values along the axis. Y-Max should be always greater than Y-Min.

                                                                                                                                                                                                                                                                                                                                                            • Y-Min: Specify the lowest value to be displayed on the Y-Axis. Consider this as the lowest point on the range. You can specify both limits or you can specify one limit and let the axes automatically calculate the other.

                                                                                                                                                                                                                                                                                                                                                            Define Legend

                                                                                                                                                                                                                                                                                                                                                            Determine whether you want a legend with a descriptive label for each plotted time series. Specify the location and layout. Determine the value to be displayed should be the most recently calculated data.

                                                                                                                                                                                                                                                                                                                                                            For the labels, the legend uses the text you have specified in the Query Display Name and Timeseries Name fields.

                                                                                                                                                                                                                                                                                                                                                            Enable Show to show the legend or create a legend if one does not exist.

                                                                                                                                                                                                                                                                                                                                                            Right positions the legend in the upper right corner of the panel. Bottom positions the legend in the lower-left corner of the panel.

                                                                                                                                                                                                                                                                                                                                                            Define Panel

                                                                                                                                                                                                                                                                                                                                                            Specify the Panel heading and description by using the Panel tab. The description you enter appears as the panel information as follows:

                                                                                                                                                                                                                                                                                                                                                            6.2 -

                                                                                                                                                                                                                                                                                                                                                            Using PromQL

                                                                                                                                                                                                                                                                                                                                                            PromQL is available only in Sysdig SaaS editions. The feature is not yet supported by Sysdig on-premises installations.

                                                                                                                                                                                                                                                                                                                                                            The Prometheus Query Language (PromQL) is the defacto standard for querying Prometheus metric data. PromQL is designed to allow the user to select and aggregate time-series data.

                                                                                                                                                                                                                                                                                                                                                            Sysdig Monitor’s PromQL support includes all of the features, functions, and aggregations in standard open-source PromQL. The PromQL language is documented at Prometheus Query Basics.

                                                                                                                                                                                                                                                                                                                                                            For new functionalities released as part of agent v10.0.0, see Collect Prometheus Metrics.

                                                                                                                                                                                                                                                                                                                                                            Construct a PromQL Query

                                                                                                                                                                                                                                                                                                                                                            In the Dashboard Panel, select the PromQL type to query data using PromQL.

                                                                                                                                                                                                                                                                                                                                                            • Display: Specify a meaningful display name for the legend. The text you enter replaces the metric name displayed in the legend. The default legend title is the metric name.

                                                                                                                                                                                                                                                                                                                                                            • Query: Enter a PromQL query directly. For example:

                                                                                                                                                                                                                                                                                                                                                              sum(rate(promraw_go_memstats_heap_released_bytes_total[5m])) by(kubernetes_cluster_name)
                                                                                                                                                                                                                                                                                                                                                              

                                                                                                                                                                                                                                                                                                                                                              Specify the following:

                                                                                                                                                                                                                                                                                                                                                              • Metrics: Search the desired metric. The field supports auto-complete. Enter the text and the rest of the text you type is predicted so you can filter the metric easily. In the example: promraw_go_memstats_heap_released_bytes_total.

                                                                                                                                                                                                                                                                                                                                                              • Segmentation: This is the process of categorizing aggregated data with labels to provide precise control over the data. Choose an appropriate value for segmenting the aggregated PromQL data. In this example, kubernetes_cluster_name.

                                                                                                                                                                                                                                                                                                                                                              The PromQL query field supports the following variables. Both the variables are replaced in the UI in real-time.

                                                                                                                                                                                                                                                                                                                                                              • $__range: A built-in variable. It represents the time range currently selected in the time navigation. In the Live mode, the value is constantly updated to reflect the new time range.

                                                                                                                                                                                                                                                                                                                                                              • $__interval: A built-in variable. It represents a time interval and is automatically configured based on the time range.

                                                                                                                                                                                                                                                                                                                                                            • Unit and Y-Axes: Specify the unit of scale and display format.

                                                                                                                                                                                                                                                                                                                                                            • Compare To: (Form-Based) Allows you to compare the current value to historical values. Not supported if Segmentation is specified.

                                                                                                                                                                                                                                                                                                                                                            • Axes: Determine scale, unit, display format, and gauge for the Y-axes.

                                                                                                                                                                                                                                                                                                                                                            • Legend: Determine the position of the legend in the Dashboard.

                                                                                                                                                                                                                                                                                                                                                            • Panel: Specify a name and add details about the panel.

                                                                                                                                                                                                                                                                                                                                                              See Create a New Panel for details.

                                                                                                                                                                                                                                                                                                                                                            Applying a Dashboard Scope to a PromQL Query

                                                                                                                                                                                                                                                                                                                                                            The dashboard scope is automatically applied only to form-based panels. To scope a panel built from a PromQL query, you must use a scope variable within the query. The variable will take the value of the referenced scope parameter, and the PromQL panel will change accordingly.

                                                                                                                                                                                                                                                                                                                                                            There are two predefined variables available:

                                                                                                                                                                                                                                                                                                                                                            • $__interval represents the time interval defined based on the time range. This will help to adapt the time range for different operations, such as rate and avg_over_time, and prevent displaying empty graphs due to the change in the granularity of the data.

                                                                                                                                                                                                                                                                                                                                                            • $__range represents the time interval defined for the dashboard. This is used to adapt operations like calculating average for a time frame selected.

                                                                                                                                                                                                                                                                                                                                                            The following examples show how to use scope variables within PromQL queries.

                                                                                                                                                                                                                                                                                                                                                            CPU Used Percent

                                                                                                                                                                                                                                                                                                                                                            The following query returns the CPU used percent for all the hosts, regardless of the scope configured at the dashboard level, with a mobile average depending on the time span defined.

                                                                                                                                                                                                                                                                                                                                                            avg_over_time(sysdig_host_cpu_used_percent[$__interval])
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            To scope this query, you must set up an appropriate scope variable. A key step is to provide a variable name that is referenced as part of the query.

                                                                                                                                                                                                                                                                                                                                                            In this example, hostname is used as the variable name. The host can then be referenced using $hostname as follows:

                                                                                                                                                                                                                                                                                                                                                            avg_over_time(sysdig_host_cpu_used_percent{host_name=$hostname}[$__interval])
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            Depending on the operator specified while configuring scope values, you might need to use a different operator within the query. If you are not using the correct operator for the scope type, the system will perform the query but will show a warning as the results may not be the expected ones.

                                                                                                                                                                                                                                                                                                                                                            Scope Operator

                                                                                                                                                                                                                                                                                                                                                            PromQL Filter Operator

                                                                                                                                                                                                                                                                                                                                                            Example

                                                                                                                                                                                                                                                                                                                                                            • is foo

                                                                                                                                                                                                                                                                                                                                                            • is not foo

                                                                                                                                                                                                                                                                                                                                                            • = : Select labels that are exactly equal to the provided string.

                                                                                                                                                                                                                                                                                                                                                            • != : Select labels that are not equal to the provided string.

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_cpu_used_percent{host_name=$hostname}
                                                                                                                                                                                                                                                                                                                                                            • in foo,bar

                                                                                                                                                                                                                                                                                                                                                            • not in foo,bar

                                                                                                                                                                                                                                                                                                                                                            • =~: Select labels that regex-match the provided string.

                                                                                                                                                                                                                                                                                                                                                            • !~ : Select labels that do not regex-match the provided string.

                                                                                                                                                                                                                                                                                                                                                            sysdig_host_cpu_used_percent{host_name=~$hostname}

                                                                                                                                                                                                                                                                                                                                                            Enrich Metrics with Labels

                                                                                                                                                                                                                                                                                                                                                            Running PromQL queries in Sysdig Monitor by default returns only a minimum set of labels. To enrich the return results of PromQL queries with additional labels, such as Kubernetes cluster name, you need to use a vector matching operation. The vector matching operation in Prometheus is similar to the SQL-like join operation.

                                                                                                                                                                                                                                                                                                                                                            Info Metrics

                                                                                                                                                                                                                                                                                                                                                            Prometheus returns different information metrics that have a value of 1 with several labels. The information that the info metrics return might not be useful as it is. However, joining the labels of an info metric with a non-info metric can provide useful information, such as the value of metric X across an infrastructure/application/deployment.

                                                                                                                                                                                                                                                                                                                                                            Vector Matching Operation

                                                                                                                                                                                                                                                                                                                                                            The vector matching operation is similar to an SQL join. You use a vector matching operation to build a PromQL query that can return metrics with information from your infrastructure. Vector matching helps filter and enrich labels, usually adding information labels to the metrics you are trying to visualize.

                                                                                                                                                                                                                                                                                                                                                            See Mapping Between Classic Metrics and PromQL Metrics for a list of info metrics.

                                                                                                                                                                                                                                                                                                                                                            Example 1: Return a Metric Filtered by Cluster

                                                                                                                                                                                                                                                                                                                                                            This example shows a metric returned by an application, say myapp_guage, running on Kubernetes. The query attempts at getting an aggregated value of a cluster, by having one cluster selected in the scope. We assume that previously you have set a $cluster variable in your scope.

                                                                                                                                                                                                                                                                                                                                                            To do so, run the following query to return the myapp_guage metrics:

                                                                                                                                                                                                                                                                                                                                                            sum (myapp_gauge * on (container_id) kube_pod_container_info{cluster=$cluster})
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            The query performs the following operations, not necessarily in this order:

                                                                                                                                                                                                                                                                                                                                                            • The kube_pod_container_info info metrics is filtered, selecting only those timeseries and the associated cluster values you want to see. The selection is based on the cluster label.

                                                                                                                                                                                                                                                                                                                                                            • The myapp_gauge metric is matched with the kube_pod_container_info metric where the container_id label has the same value, multiplying both the values. Because the info metric has the value 1, multiplying the values doesn’t change the result. As the info metric has already been filtered by a cluster, only those values associated with the cluster will be kept.

                                                                                                                                                                                                                                                                                                                                                            • The resultant timeseries with the value of myapp_gauge are then aggregated with the sum function and the result is returned.

                                                                                                                                                                                                                                                                                                                                                            Example 2: Calculate the GC Latency

                                                                                                                                                                                                                                                                                                                                                            This example shows calculating the GC latency in a go application deployed on a specific Kubernetes namespace.

                                                                                                                                                                                                                                                                                                                                                            To calculate the GC latency, run the following query:

                                                                                                                                                                                                                                                                                                                                                            go_gc_duration_seconds * on (container_id,host_mac) group_left(pod,namespace) kube_pod_container_info{namespace=~$namespace}
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            The query is performing the following operations:

                                                                                                                                                                                                                                                                                                                                                            • The kube_pod_container_info info metrics are filtered based on the namespace variable.

                                                                                                                                                                                                                                                                                                                                                            • The metrics associated with go_gc_duration_seconds is matched in a many-to-one way with the filtered kube_pod_container_info .

                                                                                                                                                                                                                                                                                                                                                              The pod and namespace labels are added from the kube_pod_container_info metric to the result. The query keeps only those metrics that have the matching container_id and host_mac labels on both sides.

                                                                                                                                                                                                                                                                                                                                                            • The values are multiplied and the resulting metrics are returned. The new metrics will only have the values associated with go_gc_duration_seconds because the info metric value is always 1.

                                                                                                                                                                                                                                                                                                                                                            You can use any Prometheus metric in the query. For example, the query above can be rewritten for a sample Apache metric as follows:

                                                                                                                                                                                                                                                                                                                                                            appinfo_apache_net_bytes * on (container_id) group_left(pod, namespace) kube_pod_container_info{namespace=~$namespace}
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            Example 3: Calculate Average CPU Used Percent in AWS Hosts

                                                                                                                                                                                                                                                                                                                                                            This example shows calculating the average CPU used percent per AWS account and region, having the hosts filtered by account and region.

                                                                                                                                                                                                                                                                                                                                                            avg by(region,account_id) (sysdig_host_cpu_used_percent  * on (host_mac) group_left(region,account_id) sysdig_cloud_provider_info{account_id=~$AWS_account, region=~$AWS_region})
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            The query performs the following operations:

                                                                                                                                                                                                                                                                                                                                                            • Filters the sysdig_cloud_provider_info metric based on the account_id and region labels that come from the dashboard scope as variables.

                                                                                                                                                                                                                                                                                                                                                            • Matches the sysdig_host_cpu_used_percent metrics with sysdig_cloud_provider_info. Only those metrics with the same host_mac label on both sides are extracted, adding region and account_id labels to the resulting metrics.

                                                                                                                                                                                                                                                                                                                                                            • Calculates the average of the new metrics by account_id and region.

                                                                                                                                                                                                                                                                                                                                                            Example 4: Calculate Total CPU Usage in Deployments

                                                                                                                                                                                                                                                                                                                                                            This example shows calculating the total CPU usage per deployment. The value can also be filtered by cluster, namespace, and deployment by using the dashboard scope.

                                                                                                                                                                                                                                                                                                                                                            sum by(cluster,namespace,owner_name) ((sysdig_container_cpu_cores_used * on(container_id) group_left(pod,namespace,cluster) kube_pod_container_info) * on(pod,namespace,cluster) group_left(owner_name) kube_pod_owner{owner_kind="Deployment",owner_name=~$deployment,cluster=~$cluster,namespace=~$namespace})
                                                                                                                                                                                                                                                                                                                                                            
                                                                                                                                                                                                                                                                                                                                                            • sysdig_container_cpu_cores_used can be replaced by any metric that has the container_id label.

                                                                                                                                                                                                                                                                                                                                                            • To connect the sysdig_container_cpu_cores_used  metric with the pod, use kube``_pod_container_info and then, use kube``_pod_owner to connect the pod to other kubernetes objects.

                                                                                                                                                                                                                                                                                                                                                            The query performs the following:

                                                                                                                                                                                                                                                                                                                                                            • sysdig_container_cpu_cores_used * on(container_id) group_left(pod,namespace,cluster) kube_pod_container_info:

                                                                                                                                                                                                                                                                                                                                                              • The sysdig_container_cpu_cores_used metric value is multiplied with kube_pod_container_info (which has the value of 1), by matching container_id and by keeping the pod, namespace and cluster labels as it is.

                                                                                                                                                                                                                                                                                                                                                                _name_='sysdig_container_cpu_cores_used',container='<label>', container_id='<label>',container_type='DOCKER`,host_mac='<label>'
                                                                                                                                                                                                                                                                                                                                                                
                                                                                                                                                                                                                                                                                                                                                              • The new metrics will be

                                                                                                                                                                                                                                                                                                                                                                cluster='<label>',container='<label>', container_id='<label>',container_type='DOCKER`,host_mac='<label>',namespace='<label>, pod='<label>'
                                                                                                                                                                                                                                                                                                                                                                
                                                                                                                                                                                                                                                                                                                                                            • The value extracted from the previous result is multiplied with kube``_pod_owner (which has the value of 1) by matching on the pod, namespace, and cluster labels and keeping the owner name from the value of kube``_pod_owner . The owner can be deployment, replicaset, service, daemonset, or statefulset object.

                                                                                                                                                                                                                                                                                                                                                              • The name of the deployment to filter upon is extracted from the kube``_pod_owner metrics.

                                                                                                                                                                                                                                                                                                                                                              • The pod, namespace, and cluster names are extracted from the kube_pod_container_info metrics.

                                                                                                                                                                                                                                                                                                                                                            • The new metrics will be:

                                                                                                                                                                                                                                                                                                                                                              cluster='<matched_label>',container='<matched_container_label>', container_id='<label>',container_type='DOCKER`,host_mac='<label>',namespace='<label>, owner_name ='<label>', pod='<label>'
                                                                                                                                                                                                                                                                                                                                                              
                                                                                                                                                                                                                                                                                                                                                            • The kube_pod_owner will have a label owner_name that is the name of the object that owns the pod. This value is extracted by filtering:

                                                                                                                                                                                                                                                                                                                                                              kube_pod_owner{owner_kind="Deployment",owner_name=~$deployment,cluster=~$cluster,namespace=~$namespace}
                                                                                                                                                                                                                                                                                                                                                              

                                                                                                                                                                                                                                                                                                                                                              The owner_kind provides the deployment name and the origin of owner_name , that is the dashboard scope.

                                                                                                                                                                                                                                                                                                                                                            • The sum aggregation is applied and the time series are aggregated by cluster, namespace, and deployment.

                                                                                                                                                                                                                                                                                                                                                            The following table helps understand the labels applied in each step of the query:

                                                                                                                                                                                                                                                                                                                                                            __name__

                                                                                                                                                                                                                                                                                                                                                            container_id

                                                                                                                                                                                                                                                                                                                                                            container

                                                                                                                                                                                                                                                                                                                                                            container_type

                                                                                                                                                                                                                                                                                                                                                            host_mac

                                                                                                                                                                                                                                                                                                                                                            pod

                                                                                                                                                                                                                                                                                                                                                            namespace

                                                                                                                                                                                                                                                                                                                                                            cluster

                                                                                                                                                                                                                                                                                                                                                            owner_name

                                                                                                                                                                                                                                                                                                                                                            sysdig_container_cpu_cores_used * on(container_id) group_left(pod,namespace,cluster) kube_pod_container_info)

                                                                                                                                                                                                                                                                                                                                                            No

                                                                                                                                                                                                                                                                                                                                                            Yes

                                                                                                                                                                                                                                                                                                                                                            Yes

                                                                                                                                                                                                                                                                                                                                                            Yes

                                                                                                                                                                                                                                                                                                                                                            Yes

                                                                                                                                                                                                                                                                                                                                                            Yes

                                                                                                                                                                                                                                                                                                                                                            Yes

                                                                                                                                                                                                                                                                                                                                                            Yes

                                                                                                                                                                                                                                                                                                                                                            No

                                                                                                                                                                                                                                                                                                                                                            (sysdig_container_cpu_cores_used * on(container_id) group_left(pod,namespace,cluster) kube_pod_container_info) * on(pod,namespace,cluster) group_left(owner_name) kube_pod_owner{owner_kind="Deployment",owner_name=~$deployment,cluster=~$cluster,namespace=~$namespace}

                                                                                                                                                                                                                                                                                                                                                            No

                                                                                                                                                                                                                                                                                                                                                            Yes

                                                                                                                                                                                                                                                                                                                                                            Yes

                                                                                                                                                                                                                                                                                                                                                            Yes

                                                                                                                                                                                                                                                                                                                                                            Yes

                                                                                                                                                                                                                                                                                                                                                            Yes

                                                                                                                                                                                                                                                                                                                                                            Yes

                                                                                                                                                                                                                                                                                                                                                            Yes

                                                                                                                                                                                                                                                                                                                                                            Yes

                                                                                                                                                                                                                                                                                                                                                            sum by(cluster,namespace,owner_name) ((sysdig_container_cpu_cores_used * on(container_id) group_left(pod,namespace,cluster) kube_pod_container_info) * on(pod,namespace,cluster) group_left(owner_name) kube_pod_owner{owner_kind="Deployment",owner_name=~$deployment,cluster=~$cluster,namespace=~$namespace})

                                                                                                                                                                                                                                                                                                                                                            No

                                                                                                                                                                                                                                                                                                                                                            No

                                                                                                                                                                                                                                                                                                                                                            No

                                                                                                                                                                                                                                                                                                                                                            No

                                                                                                                                                                                                                                                                                                                                                            No

                                                                                                                                                                                                                                                                                                                                                            No

                                                                                                                                                                                                                                                                                                                                                            Yes

                                                                                                                                                                                                                                                                                                                                                            Yes

                                                                                                                                                                                                                                                                                                                                                            Yes

                                                                                                                                                                                                                                                                                                                                                            Formatting

                                                                                                                                                                                                                                                                                                                                                            Sysdig Monitor supports percentages only as 0-100 values. In calculated ratios, you can skip multiplying the whole query times 100 by selecting percentage as a 0-1 value.

                                                                                                                                                                                                                                                                                                                                                            Learn More

                                                                                                                                                                                                                                                                                                                                                            6.3 -

                                                                                                                                                                                                                                                                                                                                                            Dashboard Scope

                                                                                                                                                                                                                                                                                                                                                            Dashboard and panel scope defines what data is valid for aggregation and display within the dashboard. The scope can be set at a dashboard-wide level, or overridden for individual panels, by any user type except for View Only users.

                                                                                                                                                                                                                                                                                                                                                            The current scope is displayed in the top left-hand corner of the module screen:

                                                                                                                                                                                                                                                                                                                                                            For more information on how scopes work, refer to the Grouping, Scoping, and Segmenting Metrics documentation.

                                                                                                                                                                                                                                                                                                                                                            Configure Dashboard Scope

                                                                                                                                                                                                                                                                                                                                                            To configure the scope of an existing dashboard:

                                                                                                                                                                                                                                                                                                                                                            1. From the Dashboard module, select the relevant dashboard from the dashboard list.

                                                                                                                                                                                                                                                                                                                                                            2. Click the Edit Scope link in the top right of the module screen:

                                                                                                                                                                                                                                                                                                                                                            3. Open the first level drop-down menu.

                                                                                                                                                                                                                                                                                                                                                            4. Select the first level label either by clicking the desired label, or searching for the label, then clicking it.

                                                                                                                                                                                                                                                                                                                                                            5. Define how the value relates to the label, and define the value.

                                                                                                                                                                                                                                                                                                                                                              Scope editor restricts the scope of the selection for subsequent filters by rendering values that are specific to the selected labels. For example, if the value of the kubernetes.namespace.name label is kube-system, the values of the subsequent label, container.name will be filtered by kube-system. This means the containers rendered for filtering are only those that are part of the kube-system namespace.

                                                                                                                                                                                                                                                                                                                                                            6. Optional: Dashboard Templating.

                                                                                                                                                                                                                                                                                                                                                              Dashboard scope values can be defined as variables, allowing users to create a template, and use one dashboard for multiple outputs. For example, in the image below, the kubernetes.namespace.name label is set as a variable:

                                                                                                                                                                                                                                                                                                                                                              Once saved, the dashboard has multiple values that can be displayed:

                                                                                                                                                                                                                                                                                                                                                            7. Optional: Add additional label/value combinations to further refine the scope.

                                                                                                                                                                                                                                                                                                                                                            8. Click the Save button to save the new scope, or click the Cancel button to revert the changes.

                                                                                                                                                                                                                                                                                                                                                              To reset the dashboard scope to the entire infrastructure, or to update an existing dashboard’s scope to the entire infrastructure, open the first scope drop-down menu and select everywhere.

                                                                                                                                                                                                                                                                                                                                                            Configure Panel Scope

                                                                                                                                                                                                                                                                                                                                                            To configure the scope of an existing dashboard panel:

                                                                                                                                                                                                                                                                                                                                                            1. From the Dashboard module, select the relevant dashboard from the dashboard list.

                                                                                                                                                                                                                                                                                                                                                            2. Hover the cursor over the desired panel, and select the Edit (pencil) icon:

                                                                                                                                                                                                                                                                                                                                                            3. Click the Override Dashboard Scope link to enable a custom panel scope.

                                                                                                                                                                                                                                                                                                                                                              To return an individual panel scope to the default dashboard scope, click the Default to Dashboard Scope link, and save the changes.

                                                                                                                                                                                                                                                                                                                                                            4. Open the Scope drop-down menu.

                                                                                                                                                                                                                                                                                                                                                            5. Either select the new scope, or search for the desired scope, and then select it.

                                                                                                                                                                                                                                                                                                                                                            6. Click the Save button to confirm the changes.

                                                                                                                                                                                                                                                                                                                                                            Panels that have a custom scope (a different scope to the overall dashboard) are marked with a shaded corner:

                                                                                                                                                                                                                                                                                                                                                            6.4 -

                                                                                                                                                                                                                                                                                                                                                            Configure Dashboards

                                                                                                                                                                                                                                                                                                                                                            There are two parts to creating a dashboard - creating the dashboard itself, and creating the panels that display the information.

                                                                                                                                                                                                                                                                                                                                                            6.4.1 -

                                                                                                                                                                                                                                                                                                                                                            Create a New Dashboard

                                                                                                                                                                                                                                                                                                                                                            To create a dashboard with the following:

                                                                                                                                                                                                                                                                                                                                                            • Using the Get Started Wizard.

                                                                                                                                                                                                                                                                                                                                                            • Using a dashboard template.

                                                                                                                                                                                                                                                                                                                                                              Dashboard templates are essentially immutable dashboards that can’t be edited, and the scope is fixed. You can copy them and customize as desired. See Dashboard Templates.

                                                                                                                                                                                                                                                                                                                                                            • Using directly the Dashboard tab. This section helps you navigate to the default Panel editor screen.

                                                                                                                                                                                                                                                                                                                                                            Get Started Wizard

                                                                                                                                                                                                                                                                                                                                                            Clicking the Create Dashboard takes you to the default panel editor screen.

                                                                                                                                                                                                                                                                                                                                                            Dashboard Tab

                                                                                                                                                                                                                                                                                                                                                            1. On the Dashboards tab, click the Add Dashboard button.

                                                                                                                                                                                                                                                                                                                                                            2. Select one of the following:

                                                                                                                                                                                                                                                                                                                                                              • From Dashboard Template: Copy from a dashboard template.

                                                                                                                                                                                                                                                                                                                                                              • Blank Dashboard: When you create a new dashboard, you are dropped into the panel editor. It is the default dashboard for the avg(avg(cpu.used.percent)) metrics.

                                                                                                                                                                                                                                                                                                                                                            3. Specify a name for the dashboard, build a query, and save.

                                                                                                                                                                                                                                                                                                                                                              For information on running queries, see the following:

                                                                                                                                                                                                                                                                                                                                                              The new dashboard will now be added to the side panel under My Dashboards and is ready for configuration.

                                                                                                                                                                                                                                                                                                                                                            6.4.2 -

                                                                                                                                                                                                                                                                                                                                                            Dashboard Templates

                                                                                                                                                                                                                                                                                                                                                            Sysdig provides a number of pre-built dashboards, designed around various supported applications, network topologies, infrastructure layouts, and services. These can be used to jump-start the dashboard building process, as templates for further configuration.

                                                                                                                                                                                                                                                                                                                                                            Templates come with a series of panels already configured, based on the information most relevant users. The example below uses the Container dashboard template:

                                                                                                                                                                                                                                                                                                                                                            The default dashboard includes number panels for CPU and Memory usage, total, in the network, and out of network bytes, and line graphs comparing in the network and out of network bytes, as well as byte usage by application/port, process, and by the host.

                                                                                                                                                                                                                                                                                                                                                            To learn more, see Dashboard Templates.

                                                                                                                                                                                                                                                                                                                                                            6.4.3 -

                                                                                                                                                                                                                                                                                                                                                            Configure Dashboard Layout

                                                                                                                                                                                                                                                                                                                                                            Configure Full Screen

                                                                                                                                                                                                                                                                                                                                                            To view the current dashboard in full-screen mode:

                                                                                                                                                                                                                                                                                                                                                            Click the Settings (three dots) icon for the dashboard, and select the Full Screen option:

                                                                                                                                                                                                                                                                                                                                                            Dashboards cannot be configured in full-screen mode. They are read-only until the full-screen mode is exited.

                                                                                                                                                                                                                                                                                                                                                            To exit full-screen mode, either press the ESC keyboard key or click the Exit (cross) icon.

                                                                                                                                                                                                                                                                                                                                                            Configure Panel Size

                                                                                                                                                                                                                                                                                                                                                            Configure Individual Panels

                                                                                                                                                                                                                                                                                                                                                            The size of individual panels can be altered by moving the mouse cursor over the bottom right corner of a panel, until the diagonal resize cursor appears, pressing and holding the left mouse button, and increasing or decreasing the size of the panel by moving the cursor while pressed. The changes can be saved by clicking the Save Layout link, or reverted by clicking the Revert Changes link.

                                                                                                                                                                                                                                                                                                                                                            Configure All Panels

                                                                                                                                                                                                                                                                                                                                                            To configure the size of every panel in the dashboard:

                                                                                                                                                                                                                                                                                                                                                            1. On the Dashboards tab, select the relevant dashboard from the left-hand panel.

                                                                                                                                                                                                                                                                                                                                                            2. Click the Settings (three dots) icon for the dashboard.

                                                                                                                                                                                                                                                                                                                                                            3. Select Layout to open the drop-down menu.

                                                                                                                                                                                                                                                                                                                                                            4. Select the desired panel size.

                                                                                                                                                                                                                                                                                                                                                            5. If the new size is correct, click the Save Layout link. Otherwise, select Revert Changes.

                                                                                                                                                                                                                                                                                                                                                              Configuring this setting overrides all custom panel sizes.

                                                                                                                                                                                                                                                                                                                                                            Move Panels

                                                                                                                                                                                                                                                                                                                                                            To move a panel to a new position in the dashboard, move the mouse cursor over the top of the panel, until the hand cursor appears. Press and hold the left mouse button, and move the panel by moving the cursor while pressing the button. The changes can be saved by clicking the Save Layout link, or reverted by clicking the Revert Changes link.

                                                                                                                                                                                                                                                                                                                                                            6.4.4 -

                                                                                                                                                                                                                                                                                                                                                            Delete a Dashboard

                                                                                                                                                                                                                                                                                                                                                            The owner or the administrator of a shared dashboard can delete it. If users duplicate that dashboard, they become the owner of the new one and are allowed to freely delete it.

                                                                                                                                                                                                                                                                                                                                                            For information on access rights, see Access Levels in Dashboard.

                                                                                                                                                                                                                                                                                                                                                            To delete an existing dashboard:

                                                                                                                                                                                                                                                                                                                                                            1. On the Dashboard tab, select the relevant dashboard from the left-hand panel.

                                                                                                                                                                                                                                                                                                                                                            2. Click the Settings (three dots) icon for the dashboard.

                                                                                                                                                                                                                                                                                                                                                            3. Select Delete Dashboard.

                                                                                                                                                                                                                                                                                                                                                            4. Click the Yes, Delete the Dashboard button to confirm the change.

                                                                                                                                                                                                                                                                                                                                                            6.5 -

                                                                                                                                                                                                                                                                                                                                                            Configure Panels

                                                                                                                                                                                                                                                                                                                                                            Learn more about types, creating, and managing panels in the following sections:

                                                                                                                                                                                                                                                                                                                                                            6.5.1 -

                                                                                                                                                                                                                                                                                                                                                            Create a New Panel

                                                                                                                                                                                                                                                                                                                                                            Sysdig Monitor supports both form-based and PromQL-based queries. You simply run a query and Sysdig Monitor builds a Dashboard that you can customize according to your preferences.

                                                                                                                                                                                                                                                                                                                                                            To create a new panel, you can do one of the following:

                                                                                                                                                                                                                                                                                                                                                            • Create a new dashboard.

                                                                                                                                                                                                                                                                                                                                                              When you create a new dashboard, it opens to a pre-built panel. You can run a new query and build the dashboard.

                                                                                                                                                                                                                                                                                                                                                            • Use a dashboard template.

                                                                                                                                                                                                                                                                                                                                                              Dashboard templates are essentially immutable dashboards that can’t be edited, and the scope is fixed. You can copy them and customize as desired. See Dashboard Templates.

                                                                                                                                                                                                                                                                                                                                                            • Add a new panel to an existing dashboard.

                                                                                                                                                                                                                                                                                                                                                            To create a new panel:

                                                                                                                                                                                                                                                                                                                                                            1. On the Dashboard tab, select the relevant dashboard from the drop-down.

                                                                                                                                                                                                                                                                                                                                                            2. Click the Add Panel icon.

                                                                                                                                                                                                                                                                                                                                                              The default panel editor opens up.

                                                                                                                                                                                                                                                                                                                                                            3. Set up the panel:

                                                                                                                                                                                                                                                                                                                                                              1. Build either a form-based query or a PromQL-based query.

                                                                                                                                                                                                                                                                                                                                                              2. Define right and left Y-axes.

                                                                                                                                                                                                                                                                                                                                                              3. Define the legend.

                                                                                                                                                                                                                                                                                                                                                              4. Set up Panel title and description.

                                                                                                                                                                                                                                                                                                                                                            4. Click Save to save the changes.

                                                                                                                                                                                                                                                                                                                                                            Building a Form-Based Query

                                                                                                                                                                                                                                                                                                                                                            Each type of visualization has different settings and the query fields are determined by the type. For demonstration purposes, this topic explains the steps to create a Line chart.

                                                                                                                                                                                                                                                                                                                                                            1. On the Dashboards tab, click Add dashboard.

                                                                                                                                                                                                                                                                                                                                                              Clicking the (+) icon opens a default panel editor.

                                                                                                                                                                                                                                                                                                                                                            2. Select a visualization type.

                                                                                                                                                                                                                                                                                                                                                              1. Click the Timechart tab and open the Select Visualization page.

                                                                                                                                                                                                                                                                                                                                                              2. Click the desired visualization type.

                                                                                                                                                                                                                                                                                                                                                              For more information on types of visualization, see Types of Panels.

                                                                                                                                                                                                                                                                                                                                                            3. Select the appropriate time presets from the time navigation.

                                                                                                                                                                                                                                                                                                                                                            4. Select a metric from the drop-down as follows:

                                                                                                                                                                                                                                                                                                                                                              You can either scroll down or type the first few letters of the metrics. As you enter the first few letters the drop-down lists the matching entries.

                                                                                                                                                                                                                                                                                                                                                            5. Specify Time Aggregation and Group Rollup.

                                                                                                                                                                                                                                                                                                                                                            6. Specify the display text in the Display field.

                                                                                                                                                                                                                                                                                                                                                              The text appears as a title for the legend:

                                                                                                                                                                                                                                                                                                                                                            7. Specify an appropriate segmentation:

                                                                                                                                                                                                                                                                                                                                                              You can enter the number of entities and the order in which they are displayed in the legend.

                                                                                                                                                                                                                                                                                                                                                              Not applicable to Number panels.

                                                                                                                                                                                                                                                                                                                                                            8. Specify the scope for the panel you are creating.

                                                                                                                                                                                                                                                                                                                                                              You can either choose to inherit the dashboard scope as it is or apply the scope to one or all the queries.

                                                                                                                                                                                                                                                                                                                                                            9. Specify the unit of scale and the display format for Y-Axis.

                                                                                                                                                                                                                                                                                                                                                              Applicable only to Timeseries panels.

                                                                                                                                                                                                                                                                                                                                                            10. Determine how to display null data on the dashboard.

                                                                                                                                                                                                                                                                                                                                                              You can display no data as a gap, a zero value, a dotted line, or a solid line in the graph. See Display Missing Data.

                                                                                                                                                                                                                                                                                                                                                            11. Optionally, compare the data against historical data.

                                                                                                                                                                                                                                                                                                                                                              When segmentation is applied, comparing metrics against historical data is not supported.

                                                                                                                                                                                                                                                                                                                                                            Building a PromQL Query

                                                                                                                                                                                                                                                                                                                                                            To run a PromQL query:

                                                                                                                                                                                                                                                                                                                                                            1. Do one of the following:

                                                                                                                                                                                                                                                                                                                                                              • Click Add Dashboard if you are creating a new dashboard.

                                                                                                                                                                                                                                                                                                                                                              • Click Add Panel if you are adding a new panel to an existing Dashboard.

                                                                                                                                                                                                                                                                                                                                                            2. Click the PromQL button.

                                                                                                                                                                                                                                                                                                                                                              The PromQL panel appears.

                                                                                                                                                                                                                                                                                                                                                            3. Enter the query in the PromQL field as follows:

                                                                                                                                                                                                                                                                                                                                                              In this example, the rate of memory heaps released in bytes in an interval of 5 minutes is calculated and then the total rate is calculated in each Kubernetes cluster.

                                                                                                                                                                                                                                                                                                                                                            4. Select the desired time window.

                                                                                                                                                                                                                                                                                                                                                            5. Specify a descriptive title for the legend and a name for the time series.

                                                                                                                                                                                                                                                                                                                                                              You can specify a variable as shown in the image. The variable name is replaced with the Kubernetes cluster names in the legend.

                                                                                                                                                                                                                                                                                                                                                            6. Specify the unit for incoming data and how it should be displayed.

                                                                                                                                                                                                                                                                                                                                                              For example, you can specify the incoming data to be gathered in kilobytes and displayed as megabytes.

                                                                                                                                                                                                                                                                                                                                                              Also, determine the location of the Y-Axis on the graph. When you have additional queries, the flexibility to place an additional Y-axis on the graph comes in handy.

                                                                                                                                                                                                                                                                                                                                                            7. Determine how to display null data on the dashboard.

                                                                                                                                                                                                                                                                                                                                                              You can display no data as a gap, a zero value, a dotted line, or a solid line in the graph. See Display Missing Data.

                                                                                                                                                                                                                                                                                                                                                            8. Click Save to save the changes.

                                                                                                                                                                                                                                                                                                                                                            6.5.2 -

                                                                                                                                                                                                                                                                                                                                                            Types of Panels

                                                                                                                                                                                                                                                                                                                                                            This topic introduces you to the types of panels in the New Dashboard.

                                                                                                                                                                                                                                                                                                                                                            6.5.2.1 -

                                                                                                                                                                                                                                                                                                                                                            Timechart Panel

                                                                                                                                                                                                                                                                                                                                                            A Timechart is a graph produced by applying statistical aggregation to a label over an interval. The X-axis of a timechart will always be time.

                                                                                                                                                                                                                                                                                                                                                            Timecharts allow you to see the change in metric value over time. The amount of data visualized on a graph is dependent on the time selection selected within the Dashboard. You can aggregate metrics from multiple sources into a single line, or graph a line per combination of segment labels.

                                                                                                                                                                                                                                                                                                                                                            Time aggregation: For example, the average value of cpu.used.percent metric is computed for each entity over 1 hour at 1-minute intervals.

                                                                                                                                                                                                                                                                                                                                                            Group Rollup: For each host.hostName the values from time aggregation are averaged over the scope and the top 10 segments are shown on the chart.

                                                                                                                                                                                                                                                                                                                                                            The only supported panel type now in time series is the Line chart.

                                                                                                                                                                                                                                                                                                                                                            Line Chart

                                                                                                                                                                                                                                                                                                                                                            The Line panel show change over time in a selected window. Time is plotted on the horizontal axis and the change that is measured is plotted on the vertical axis.

                                                                                                                                                                                                                                                                                                                                                            The image below shows the trend of resource consumption of top resource-hogging hosts in the last one hour.

                                                                                                                                                                                                                                                                                                                                                            Configure Line Chart

                                                                                                                                                                                                                                                                                                                                                            For information on configuring a chart, see Create a New Panel.

                                                                                                                                                                                                                                                                                                                                                            Stacked Area

                                                                                                                                                                                                                                                                                                                                                            An area chart is distinguished from a line chart by the addition of shading between lines.

                                                                                                                                                                                                                                                                                                                                                            For information on configuring a chart, see Create a New Panel.

                                                                                                                                                                                                                                                                                                                                                            6.5.2.2 -

                                                                                                                                                                                                                                                                                                                                                            Number Panel

                                                                                                                                                                                                                                                                                                                                                            Number panels allow you to view a single value for a given entity, along with optionally comparing the current value to historical values. Use the Number panel when the number is the most important aspect of the metric you’re trying to display, such as unique visitors to a website.

                                                                                                                                                                                                                                                                                                                                                            Do not use this panel to see a trend, rather use it when you need to see the average of a value over the given time range. This is also useful for counting entities, such as the number of nodes in a cluster.

                                                                                                                                                                                                                                                                                                                                                            For information on configuring a panel, see Create a New Panel.

                                                                                                                                                                                                                                                                                                                                                            Major Features

                                                                                                                                                                                                                                                                                                                                                            • The default preset for the Number visualization is 1 hour.

                                                                                                                                                                                                                                                                                                                                                            • The global default values for the threshold are overridable. The new value can be reset back to the global default.

                                                                                                                                                                                                                                                                                                                                                            • A comparison between two threshold values determines color-coding directions.

                                                                                                                                                                                                                                                                                                                                                            • Use the No Data Display text box under the Options menu to set a custom no data text.

                                                                                                                                                                                                                                                                                                                                                            • The Compare To functionality can be toggled between enabled and disabled.

                                                                                                                                                                                                                                                                                                                                                            • When the Compare To value is set, the preview is updated accordingly showing the comparison value and an arrow denoting the metric has increased or decreased.

                                                                                                                                                                                                                                                                                                                                                            • The unit displayed for Thresholds is determined by the query.

                                                                                                                                                                                                                                                                                                                                                            6.5.2.3 -

                                                                                                                                                                                                                                                                                                                                                            Table Panel

                                                                                                                                                                                                                                                                                                                                                            The Table panel displays metric data in tabular form. In this view, you can review metric values and their associated labels in a single view. Use Table panels for such quantitative analysis where you can see actual values instead of visual representations. Similar to a spreadsheet, you can look at a combination of metric values and their segments. This is useful when you don’t necessarily care about the change in metric over time, or want to run reports to download as CSV/JSON for offline analysis.

                                                                                                                                                                                                                                                                                                                                                            The panel displays the value returned by the metric query specified in the Query tab. The value is determined by the data source and the query. Each datapoint will have an associated raw and an option to add columns for additional metric values.

                                                                                                                                                                                                                                                                                                                                                            Configuring Table Panel

                                                                                                                                                                                                                                                                                                                                                            Major features include but not limited to :

                                                                                                                                                                                                                                                                                                                                                            • Queries

                                                                                                                                                                                                                                                                                                                                                              • The first query you build cannot be removed.

                                                                                                                                                                                                                                                                                                                                                              • With subsequent queries are built, you cannot remove all the queries except the first one.

                                                                                                                                                                                                                                                                                                                                                              • Changing the unit of the query changes the unit in the table as well.

                                                                                                                                                                                                                                                                                                                                                              • Changing the display format on the query reflects on the row values.

                                                                                                                                                                                                                                                                                                                                                            • Segmentation

                                                                                                                                                                                                                                                                                                                                                              • The segmentation label determines the column name.

                                                                                                                                                                                                                                                                                                                                                              • The segmentation in conjunction with metric values determines the values displayed on the rows.

                                                                                                                                                                                                                                                                                                                                                            • Scope

                                                                                                                                                                                                                                                                                                                                                              • The selected scope determines the values displayed on the table.
                                                                                                                                                                                                                                                                                                                                                            • Metric / Labels Columns

                                                                                                                                                                                                                                                                                                                                                              • Adding a new query insert a new column with the name of the metric as the column heading.

                                                                                                                                                                                                                                                                                                                                                              • Metric values in conjunction with segmentation determine the values displayed on the rows.

                                                                                                                                                                                                                                                                                                                                                            • Sorting

                                                                                                                                                                                                                                                                                                                                                              • Column sorting is based on the selected column header and the type of sorting (ascending and descending).

                                                                                                                                                                                                                                                                                                                                                              • When another column is sorted, the table is resorted by that column, resetting the previous sorting.

                                                                                                                                                                                                                                                                                                                                                            • Resizing

                                                                                                                                                                                                                                                                                                                                                              • Grab the header column by the borderline to resize the columns.

                                                                                                                                                                                                                                                                                                                                                              • Browser window resizes shouldn’t reset the resize of the columns if you have resized any columns.

                                                                                                                                                                                                                                                                                                                                                              • When resizing the browser window, table columns are resized to cover the full width. An exception is when you have already resized columns. In such cases, other columns that you have not resized are resized on browser window resize.

                                                                                                                                                                                                                                                                                                                                                              • The last column in the table is not resizable.

                                                                                                                                                                                                                                                                                                                                                            • Export

                                                                                                                                                                                                                                                                                                                                                              • The table by default shows a maximum of 50 rows.

                                                                                                                                                                                                                                                                                                                                                              • Clicking on Export all results… below the table opens the Export Data window.

                                                                                                                                                                                                                                                                                                                                                              • Export data in either JSON or CVS format to a file. The default name of the file is the panel name. Renaming the default filename is permissible.

                                                                                                                                                                                                                                                                                                                                                            For information on configuring a chart, see Create a New Panel.

                                                                                                                                                                                                                                                                                                                                                            6.5.2.4 -

                                                                                                                                                                                                                                                                                                                                                            Text

                                                                                                                                                                                                                                                                                                                                                            The example below uses a text panel as a reminder list of the testing steps for a procedure.

                                                                                                                                                                                                                                                                                                                                                            Text Panel Markdown

                                                                                                                                                                                                                                                                                                                                                            Headers

                                                                                                                                                                                                                                                                                                                                                            # H1
                                                                                                                                                                                                                                                                                                                                                            ## H2
                                                                                                                                                                                                                                                                                                                                                            ### H3
                                                                                                                                                                                                                                                                                                                                                            #### H4
                                                                                                                                                                                                                                                                                                                                                            ##### H5
                                                                                                                                                                                                                                                                                                                                                            ###### H6
                                                                                                                                                                                                                                                                                                                                                            
                                                                                                                                                                                                                                                                                                                                                            H1
                                                                                                                                                                                                                                                                                                                                                            ======
                                                                                                                                                                                                                                                                                                                                                            H2
                                                                                                                                                                                                                                                                                                                                                            ------
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            Emphasis

                                                                                                                                                                                                                                                                                                                                                            *italics* or _italics_
                                                                                                                                                                                                                                                                                                                                                            **bold** or __bold__
                                                                                                                                                                                                                                                                                                                                                            **combined _emphasis_**
                                                                                                                                                                                                                                                                                                                                                            ~~strikethrough~~
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            Lists

                                                                                                                                                                                                                                                                                                                                                            1. First ordered list item
                                                                                                                                                                                                                                                                                                                                                            2. Second item
                                                                                                                                                                                                                                                                                                                                                              * Unordered sub-list.
                                                                                                                                                                                                                                                                                                                                                                Sub-paragraph within the list item.
                                                                                                                                                                                                                                                                                                                                                            1. Third item
                                                                                                                                                                                                                                                                                                                                                              8. First ordered sub-list item.
                                                                                                                                                                                                                                                                                                                                                            103. Fourth item
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            General guidelines:

                                                                                                                                                                                                                                                                                                                                                            • The list item number does not matter. As shown in the example below, the formatting defines the lists.

                                                                                                                                                                                                                                                                                                                                                            • List items can contain properly indented paragraphs, using white space.

                                                                                                                                                                                                                                                                                                                                                            • Unordered list can use: *, -, or +.

                                                                                                                                                                                                                                                                                                                                                            Linebreaks

                                                                                                                                                                                                                                                                                                                                                            This is the first sentence.
                                                                                                                                                                                                                                                                                                                                                            
                                                                                                                                                                                                                                                                                                                                                            This line is separated from the one above by two newlines, so it will be a *separate paragraph*.
                                                                                                                                                                                                                                                                                                                                                            
                                                                                                                                                                                                                                                                                                                                                            This line is also a separate paragraph.
                                                                                                                                                                                                                                                                                                                                                            This line is only separated by a single newline, so it's a separate line in the *same paragraph*.
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            Trailing spaces can be used for line-breaks without creating a new paragraph. This behavior is contrary to the typical GFM line break behavior, where trailing spaces are not required.

                                                                                                                                                                                                                                                                                                                                                            6.5.2.5 -

                                                                                                                                                                                                                                                                                                                                                            Toplist

                                                                                                                                                                                                                                                                                                                                                            A Toplist chart displays the specified number of entities, such as containers, with the most or least of any metric value. This is useful for “ranking” metric values in order, for example, considering hosts that have the highest amount of pods running or the highest consumers of CPU or memory in your infrastructure.

                                                                                                                                                                                                                                                                                                                                                            The following types of metric segmentation are supported:

                                                                                                                                                                                                                                                                                                                                                            Major Features

                                                                                                                                                                                                                                                                                                                                                            • Toplist supports executing only one query, however, adding a new one and deleting the existing one is allowed.

                                                                                                                                                                                                                                                                                                                                                            • Segmentation is not supported for multiple metrics.

                                                                                                                                                                                                                                                                                                                                                            • Text displayed on the bars in the chart is based on metrics and segmentation.

                                                                                                                                                                                                                                                                                                                                                              • If a single metric is selected, segmentation text is displayed.

                                                                                                                                                                                                                                                                                                                                                              • If multiple segmentations are selected, segmentation texts separated by > sign are displayed.

                                                                                                                                                                                                                                                                                                                                                              • If multiple metrics are selected, the metric name is displayed.

                                                                                                                                                                                                                                                                                                                                                            Segmentation

                                                                                                                                                                                                                                                                                                                                                            The following types of metric segmentation are supported:

                                                                                                                                                                                                                                                                                                                                                            Multi-Segmenting Single Metrics

                                                                                                                                                                                                                                                                                                                                                            You can use multiple objects to simultaneously segment a single metric. For example, cpu.used.percent segmented by kubernetes.cluster.name, kubernetes.namespace.name, and kubernetes.deployment.name.

                                                                                                                                                                                                                                                                                                                                                            In this example, deployments are sequentially listed in the order of resource consumption. Use Display to toggle between descending (Top) and ascending order (Bottom).

                                                                                                                                                                                                                                                                                                                                                            Multi-Metrics Without Segmentation

                                                                                                                                                                                                                                                                                                                                                            You can add multiple metrics to view the Toplist chart. However, segmentation is disabled for multi-metric selection.

                                                                                                                                                                                                                                                                                                                                                            For information on configuring a chart, see Create a New Panel.

                                                                                                                                                                                                                                                                                                                                                            6.5.2.6 -

                                                                                                                                                                                                                                                                                                                                                            Histogram

                                                                                                                                                                                                                                                                                                                                                            Sysdig Monitor handles three types of Histograms:

                                                                                                                                                                                                                                                                                                                                                            • Histogram panel type on the Dashboard: Histogram panels allow you to visualize the distribution of metric values for large data collection. You should select a segmentation, and optionally, the number of buckets.

                                                                                                                                                                                                                                                                                                                                                              Use Histogram for any metric, Sysdig native or custom, counter or gauge, segmented by a dimension/label. The histogram panel helps understand value across different segments. For example, CPU usage percent by pods across your cluster gives you the aggregated value across the selected time.

                                                                                                                                                                                                                                                                                                                                                            • Legacy Prometheus histogram collection: This implementation of legacy Prometheus Histograms is deprecated in SaaS 3.2.6 release.

                                                                                                                                                                                                                                                                                                                                                              To create a Histogram, use the Prometheus integration to collect histogram metrics and use the PromQL panel with the histogram_quantile function.

                                                                                                                                                                                                                                                                                                                                                            • Prometheus histograms (collected as raw metrics): The legacy Prometheus histogram collection is replaced by the new Prometheus histogram. You can natively collect histogram metrics, and for visualization, use timechart:

                                                                                                                                                                                                                                                                                                                                                              For example, run the following query to build a timechart:

                                                                                                                                                                                                                                                                                                                                                              sum(histogram_metrics_bucket{kubernetes_cluster_name="prod"}) by (le)
                                                                                                                                                                                                                                                                                                                                                              
                                                                                                                                                                                                                                                                                                                                                              

                                                                                                                                                                                                                                                                                                                                                            6.5.3.1 -

                                                                                                                                                                                                                                                                                                                                                            Create Panel Alerts

                                                                                                                                                                                                                                                                                                                                                            Alerts can be created directly from a form-based panel in a New Dashboard. If the panel has more than one query, you must select the query to use as the base for the alert.

                                                                                                                                                                                                                                                                                                                                                            To create an alert:

                                                                                                                                                                                                                                                                                                                                                            1. Click the More Options (three dots) icon.

                                                                                                                                                                                                                                                                                                                                                            2. Select Create Alert.

                                                                                                                                                                                                                                                                                                                                                            3. Configure the alert, and click the **Create **button.

                                                                                                                                                                                                                                                                                                                                                            6.5.3.2 -

                                                                                                                                                                                                                                                                                                                                                            Export Panel Data

                                                                                                                                                                                                                                                                                                                                                            Table and Timechart panels in New Dashboard allow exporting data to a CSV or JSON file. This file could serve as a backup of your data or for programmatical use.

                                                                                                                                                                                                                                                                                                                                                            You can export data using the following:

                                                                                                                                                                                                                                                                                                                                                            • Panel menu in the New Dashboard

                                                                                                                                                                                                                                                                                                                                                            • Table panel

                                                                                                                                                                                                                                                                                                                                                            To export while creating or editing a Table panel:

                                                                                                                                                                                                                                                                                                                                                            1. Select Table from the Visualization type.

                                                                                                                                                                                                                                                                                                                                                              The panel opens to the Columns tab.

                                                                                                                                                                                                                                                                                                                                                            2. Below the table, click Export all results….

                                                                                                                                                                                                                                                                                                                                                              The Export Data window is displayed.

                                                                                                                                                                                                                                                                                                                                                            3. Select the format.

                                                                                                                                                                                                                                                                                                                                                            4. Specify a filename.

                                                                                                                                                                                                                                                                                                                                                              The default name of the file is the panel name. You can rename the file that you are about to download.

                                                                                                                                                                                                                                                                                                                                                            5. Click Export to save the data into the file.

                                                                                                                                                                                                                                                                                                                                                              Exporting might take several minutes to complete.

                                                                                                                                                                                                                                                                                                                                                            6.5.3.3 -

                                                                                                                                                                                                                                                                                                                                                            Copy Panels to a Different Dashboards

                                                                                                                                                                                                                                                                                                                                                            Copy a Single Panel

                                                                                                                                                                                                                                                                                                                                                            To copy a single panel to a different dashboard:

                                                                                                                                                                                                                                                                                                                                                            1. From the Explore tab, select the desired drill-down view.

                                                                                                                                                                                                                                                                                                                                                            2. Hover over the desired panel, select the Settings (ellipsis) icon, and select Copy Panel.

                                                                                                                                                                                                                                                                                                                                                            3. Open the drop-down menu and select the desired dashboard, or use the text-field to search through existing dashboards.

                                                                                                                                                                                                                                                                                                                                                              To copy the panel to a new dashboard, enter a name for the new dashboard in the text-field instead.

                                                                                                                                                                                                                                                                                                                                                            4. Click the Copy and Open button to save the changes and navigate to the configured dashboard.

                                                                                                                                                                                                                                                                                                                                                            Copy All Panels

                                                                                                                                                                                                                                                                                                                                                            To copy all panels in a drill-down view to a dashboard:

                                                                                                                                                                                                                                                                                                                                                            1. From the Explore tab, select the desired drill-down view.

                                                                                                                                                                                                                                                                                                                                                            2. Select the More Options (three dots) icon.

                                                                                                                                                                                                                                                                                                                                                            3. Select Copy to Dashboard:

                                                                                                                                                                                                                                                                                                                                                            4. Open the drop-down menu and select the desired dashboard, or use the text-field to search through existing dashboards.

                                                                                                                                                                                                                                                                                                                                                              To copy the panel to a new dashboard, enter a name for the new dashboard in the text-field instead.

                                                                                                                                                                                                                                                                                                                                                            5. Click the Copy and Open button to save the changes and navigate to the configured dashboard.

                                                                                                                                                                                                                                                                                                                                                            Create a Panel Alert

                                                                                                                                                                                                                                                                                                                                                            Alerts can be created directly from a dashboard panel:

                                                                                                                                                                                                                                                                                                                                                            1. Click the More Options (three dots) icon.

                                                                                                                                                                                                                                                                                                                                                            2. Select CreateAlert.

                                                                                                                                                                                                                                                                                                                                                            3. Configure the alert, and click the Create button.

                                                                                                                                                                                                                                                                                                                                                            6.5.3.4 -

                                                                                                                                                                                                                                                                                                                                                            Duplicate a Panel

                                                                                                                                                                                                                                                                                                                                                            Hover over the desired panel, click the Settings (ellipsis) icon, and select Duplicate Panel.

                                                                                                                                                                                                                                                                                                                                                            6.5.3.5 -

                                                                                                                                                                                                                                                                                                                                                            Delete an Existing Panel

                                                                                                                                                                                                                                                                                                                                                            To delete a panel from a dashboard:

                                                                                                                                                                                                                                                                                                                                                            1. Hover over the desired panel, click the Settings (ellipsis) icon, and select Delete Panel.

                                                                                                                                                                                                                                                                                                                                                            2. Click the Yes, delete panel button to confirm, or the Cancel button to keep the panel.

                                                                                                                                                                                                                                                                                                                                                            6.6 -

                                                                                                                                                                                                                                                                                                                                                            Managing Dashboards

                                                                                                                                                                                                                                                                                                                                                            This section helps you effectively use dashboards and share them with your team.

                                                                                                                                                                                                                                                                                                                                                            6.6.1 -

                                                                                                                                                                                                                                                                                                                                                            Dashboards Types

                                                                                                                                                                                                                                                                                                                                                            Dashboards are organized into the following main categories

                                                                                                                                                                                                                                                                                                                                                            • My Favourites: The dashboards marked as favorites by the current user.

                                                                                                                                                                                                                                                                                                                                                            • Shared By My Team: Dashboards created by other users in the team and shared with the current user.

                                                                                                                                                                                                                                                                                                                                                            • My Dashboards: Dashboards created by the current user.

                                                                                                                                                                                                                                                                                                                                                            • Dashboard Templates: Out-of-the-box templates that you can copy and use. A dashboard created from a template inherits the template name.

                                                                                                                                                                                                                                                                                                                                                            6.6.2 -

                                                                                                                                                                                                                                                                                                                                                            Set a Default Dashboard

                                                                                                                                                                                                                                                                                                                                                            A default dashboard can be configured by setting the default entry point for a team, unifying a team’s Sysdig Monitor experience, and allowing users to focus their immediate attention on the most relevant information for them. For more information on configuring a default entry point, refer to the Configure an Entry Page or Dashboard for a Team section of the Sysdig Platform documentation.

                                                                                                                                                                                                                                                                                                                                                            6.6.3 -

                                                                                                                                                                                                                                                                                                                                                            Display Dashboard Specific Events

                                                                                                                                                                                                                                                                                                                                                            Sysdig Monitor allows users to configure dashboards to display infrastructure events relevant to a dashboard’s panels within the panels themselves. This allows users an even more in-depth view of the status of their environment. To configure how events are displayed:

                                                                                                                                                                                                                                                                                                                                                            1. On the Dashboard tab, select the relevant dashboard from the dashboard list.

                                                                                                                                                                                                                                                                                                                                                            2. Click the Dashboard Settings (three dots) icon and select Events Display:

                                                                                                                                                                                                                                                                                                                                                            3. Enable the Show Events slider to show events in the dashboard panels.

                                                                                                                                                                                                                                                                                                                                                            4. Configure the available parameters, and click the Close button.

                                                                                                                                                                                                                                                                                                                                                            OptionDescription
                                                                                                                                                                                                                                                                                                                                                            FilterDefines specific events, or a scope of events, to display.
                                                                                                                                                                                                                                                                                                                                                            ScopeDetermines whether the range of events displayed includes those for dashboard scope or team scope.
                                                                                                                                                                                                                                                                                                                                                            SeverityDetermines whether only high severity events or all events are displayed.
                                                                                                                                                                                                                                                                                                                                                            Event TypeDetermines what types of events to be displayed. The supported events types are alert, custom events, containers, or Kubernetes.
                                                                                                                                                                                                                                                                                                                                                            StatusDetermines the state of events displayed. The supported status are Triggered, Resolved, Acknowledged, Un-acknowledged.

                                                                                                                                                                                                                                                                                                                                                            6.6.4 -

                                                                                                                                                                                                                                                                                                                                                            Sharing New Dashboards

                                                                                                                                                                                                                                                                                                                                                            Dashboards can be shared internally among team members, with other teams, within the wider organization, or publicly, by configuring a public URL for the dashboard.

                                                                                                                                                                                                                                                                                                                                                            As an owner of a dashboard, you can share the dashboard with any team and provide the Viewer or Collaborator access permission.

                                                                                                                                                                                                                                                                                                                                                            Access Levels in Dashboard

                                                                                                                                                                                                                                                                                                                                                            The RBAC-based permissions determine how users can interact with Dashboards. They establish what capabilities are allowed or denied for a user or a team. For more information on RBAC rules, see RBAC Rules for Dashboards.

                                                                                                                                                                                                                                                                                                                                                            The table below summarizes the various ways a dashboard can be shared and effective permissions for users.

                                                                                                                                                                                                                                                                                                                                                            Who can share/copyDashboard InstanceTeam/User who has accessCan ReadCan Edit
                                                                                                                                                                                                                                                                                                                                                            Share with current TeamDashboard CreatorSame dashboard instanceCurrent team members onlyAll members of the teamEdit users of the team
                                                                                                                                                                                                                                                                                                                                                            Share publicly as URLAny Edit User of the teamSame dashboard instanceAnyone with URL (does not have to by Sysdig user)AnyoneAnyone with URL (does not have to by Sysdig user) with Scope variables
                                                                                                                                                                                                                                                                                                                                                            Copy to My TeamsAny Edit User of the teamDuplicate Copy of the dashboardCurrent team members onlyAll members of the teamEdit users of the team

                                                                                                                                                                                                                                                                                                                                                            Share a Dashboard with Teams

                                                                                                                                                                                                                                                                                                                                                            Dashboards can be shared across a user’s current team or a selected set of teams, allowing other team members to view the dashboard, as well as edit the panels if they have edit permissions within the team.

                                                                                                                                                                                                                                                                                                                                                            If a dashboard has been shared with another team, a user within that team can then copy it to make it their own if they wish.

                                                                                                                                                                                                                                                                                                                                                            To share a dashboard:

                                                                                                                                                                                                                                                                                                                                                            1. Select the dashboard you want to share.

                                                                                                                                                                                                                                                                                                                                                            2. Click the Dashboard Settings (three dots) icon and select Dashboard Settings.

                                                                                                                                                                                                                                                                                                                                                            3. In the Dashboard Settings page, use the Shared With drop-down.

                                                                                                                                                                                                                                                                                                                                                            4. Select one of the three options:

                                                                                                                                                                                                                                                                                                                                                              • Not Shared: If selected, the specified Dashboard cannot be shared with a team or selected team the owner is a member of.

                                                                                                                                                                                                                                                                                                                                                              • All Teams: If selected, the owners of the Dashboard can share with all the teams that they are part of.

                                                                                                                                                                                                                                                                                                                                                              • Selected Teams: If selected, the owner of the Dashboard can share with a selected list of teams. You can select one of the available teams in the drop-down, and select member permission:

                                                                                                                                                                                                                                                                                                                                                                • View Only: This permission allows members to view the Dashboard.

                                                                                                                                                                                                                                                                                                                                                                • Collaborator: A collaborator can edit the Dashboard.

                                                                                                                                                                                                                                                                                                                                                            Enable Public Sharing

                                                                                                                                                                                                                                                                                                                                                            Dashboards can be shared outside of the internal team by using public URLs. This allows external users to review the dashboard metrics while restricting access to changing panels and configurations.

                                                                                                                                                                                                                                                                                                                                                            The scope parameters, including scope variables, are included in the Dashboard URL. External users with a valid link can change the scope parameters without having to sign in. They can edit either on the UI or in the URL. The scope parameters are passed to the standard request header, consisting of a question mark, followed by the parameter name, an equal sign, and the parameter value. To edit a parameter in the URL, simply replace it with the desired one.

                                                                                                                                                                                                                                                                                                                                                            1. Select the dashboard you want to share.

                                                                                                                                                                                                                                                                                                                                                            2. Click the Dashboard Settings (three dots) icon and select Dashboard Settings.

                                                                                                                                                                                                                                                                                                                                                            3. In the Dashboard Settings page, enable the Public Sharing slider.

                                                                                                                                                                                                                                                                                                                                                              When enabled, the dashboard is visible with scope parameters to anyone with the link. If this setting is disabled, the link will no longer work, and the setting will need to be re-enabled and shared again in order for the dashboard to be accessed.

                                                                                                                                                                                                                                                                                                                                                            4. Copy the public sharing URL for sharing.

                                                                                                                                                                                                                                                                                                                                                            6.6.4.1 -

                                                                                                                                                                                                                                                                                                                                                            RBAC Rules for Dashboards

                                                                                                                                                                                                                                                                                                                                                            The table below summarizes the role-based permissions.

                                                                                                                                                                                                                                                                                                                                                            Owner Permissions

                                                                                                                                                                                                                                                                                                                                                            Roles

                                                                                                                                                                                                                                                                                                                                                            Owner Permissions

                                                                                                                                                                                                                                                                                                                                                            User Roles

                                                                                                                                                                                                                                                                                                                                                            Administrator

                                                                                                                                                                                                                                                                                                                                                            A user owning a dashboard will now have three different team sharing options:

                                                                                                                                                                                                                                                                                                                                                            • Not Shared

                                                                                                                                                                                                                                                                                                                                                            • Share with all the teams that the owner is part of

                                                                                                                                                                                                                                                                                                                                                            • Share with a selected list of teams

                                                                                                                                                                                                                                                                                                                                                            For the last two options, the owner can pick the type of access: Collaborator (with edit rights) or View only.

                                                                                                                                                                                                                                                                                                                                                            Regular User (non-administrator user)

                                                                                                                                                                                                                                                                                                                                                            Team Roles

                                                                                                                                                                                                                                                                                                                                                            Advanced user

                                                                                                                                                                                                                                                                                                                                                            Standard user

                                                                                                                                                                                                                                                                                                                                                            Team manager

                                                                                                                                                                                                                                                                                                                                                            View-only user

                                                                                                                                                                                                                                                                                                                                                            Not applicable.

                                                                                                                                                                                                                                                                                                                                                            Owner Permissions

                                                                                                                                                                                                                                                                                                                                                            When a user decides to share a dashboard with a set of teams, they’ll only be able to pick teams that they are members of.

                                                                                                                                                                                                                                                                                                                                                            The table below summarizes what you can do with a shared dashboard.

                                                                                                                                                                                                                                                                                                                                                            User Permissions

                                                                                                                                                                                                                                                                                                                                                            User Permissions

                                                                                                                                                                                                                                                                                                                                                            View Only

                                                                                                                                                                                                                                                                                                                                                            Collaborator

                                                                                                                                                                                                                                                                                                                                                            User Role

                                                                                                                                                                                                                                                                                                                                                            Administrator

                                                                                                                                                                                                                                                                                                                                                            Edit

                                                                                                                                                                                                                                                                                                                                                            An admin can still edit a shared dashboard even if it's shared in view-only mode.

                                                                                                                                                                                                                                                                                                                                                            Edit

                                                                                                                                                                                                                                                                                                                                                            Regular User (non-administrator user)

                                                                                                                                                                                                                                                                                                                                                            View Only

                                                                                                                                                                                                                                                                                                                                                            Team Role

                                                                                                                                                                                                                                                                                                                                                            Advanced user

                                                                                                                                                                                                                                                                                                                                                            Advanced user

                                                                                                                                                                                                                                                                                                                                                            Team manager

                                                                                                                                                                                                                                                                                                                                                            View-only user

                                                                                                                                                                                                                                                                                                                                                            View Only

                                                                                                                                                                                                                                                                                                                                                            User Permissions

                                                                                                                                                                                                                                                                                                                                                            6.6.4.2 -

                                                                                                                                                                                                                                                                                                                                                            Transfer Dashboard Ownership

                                                                                                                                                                                                                                                                                                                                                            Dashboards have a single owner. Sysdig Monitor allows administrators and dashboard owners with administrator permissions to transfer the ownership of a dashboard within the UI.

                                                                                                                                                                                                                                                                                                                                                            There are several reasons for assigning a new owner to dashboards.

                                                                                                                                                                                                                                                                                                                                                            • The dashboard owners are no longer in control of the dashboard data.

                                                                                                                                                                                                                                                                                                                                                            • Administrators require to update the dashboard settings or fix how data is displayed.

                                                                                                                                                                                                                                                                                                                                                            General Guidelines

                                                                                                                                                                                                                                                                                                                                                            • When a user is deleted, any shared dashboards they own or have created will be preserved by default.

                                                                                                                                                                                                                                                                                                                                                            • The administrator can transfer only the dashboards that are shared by other users. Private dashboards cannot be seen and therefore cannot be transferred.

                                                                                                                                                                                                                                                                                                                                                            • Transferring ownership can only happen one dashboard at a time.

                                                                                                                                                                                                                                                                                                                                                            • When editing a user, the administrator can specify to transfer dashboards to a new owner.

                                                                                                                                                                                                                                                                                                                                                            • Before changing the dashboard ownership,

                                                                                                                                                                                                                                                                                                                                                              • It is a good practice to ensure that the new owner is part of the team the previous owner is part of. The administrator can preview the teams that will no longer be part of before confirming the transfer.

                                                                                                                                                                                                                                                                                                                                                                The new owner need not be part of any teams the previous owner was part of. In this case, the dashboard will be transferred to the new owner but will no longer be shared with any team. The dashboard will become a private dashboard.

                                                                                                                                                                                                                                                                                                                                                              • A shared dashboard will be visible only to the teams that the new owner is not part of.

                                                                                                                                                                                                                                                                                                                                                            Transfer Ownership as an Admin

                                                                                                                                                                                                                                                                                                                                                            1. Log in to the Monitor UI.

                                                                                                                                                                                                                                                                                                                                                            2. Select Settings > Users.

                                                                                                                                                                                                                                                                                                                                                            3. Select the user you want to change the ownership.

                                                                                                                                                                                                                                                                                                                                                            4. Select one or multiple Dashboards that you want to assign a new owner.

                                                                                                                                                                                                                                                                                                                                                            5. Click Transfer Ownership.

                                                                                                                                                                                                                                                                                                                                                              The Transfer Dashboard Ownership page is displayed.

                                                                                                                                                                                                                                                                                                                                                            6. Select a new user from the drop-down.

                                                                                                                                                                                                                                                                                                                                                              If the user that you selected is not part of the teams that the Dashboard is shared with, you will see a prompt stating the Dashboard will be unshared with the teams that the new owner is not part of.

                                                                                                                                                                                                                                                                                                                                                            7. If you are satisfied with the changes, click Transfer.

                                                                                                                                                                                                                                                                                                                                                            Transfer Ownership as a User

                                                                                                                                                                                                                                                                                                                                                            1. On the Dashboards tab, select the relevant dashboard from the left-hand panel.

                                                                                                                                                                                                                                                                                                                                                            2. Click the Settings (three dots) icon for the dashboard.

                                                                                                                                                                                                                                                                                                                                                            3. Select Transfer Ownership.

                                                                                                                                                                                                                                                                                                                                                              The Transfer Dashboard Ownership page is displayed.

                                                                                                                                                                                                                                                                                                                                                            4. Select a new user from the drop-down.

                                                                                                                                                                                                                                                                                                                                                            5. If everything looks ok, click Transfer.

                                                                                                                                                                                                                                                                                                                                                              The teams indicated with cross-out text are the ones that had access to the dashboard earlier and will lose access to it after the transfer.

                                                                                                                                                                                                                                                                                                                                                              The dashboard will also be visible to all the teams that the new owner is part of. If you are not part of the teams that the new owner is a member of, you will no longer have the visibility to the dashboard.

                                                                                                                                                                                                                                                                                                                                                            6.7 -

                                                                                                                                                                                                                                                                                                                                                            Dashboard Templates

                                                                                                                                                                                                                                                                                                                                                            Sysdig provides a number of pre-defined dashboards to assist users in monitoring their environments and applications. Dashboard templates are essentially immutable dashboards that can’t be edited, and the scope is fixed. They are useful as is to get a quick overview of infrastructure, but you can use them as a template and can copy them to customize.

                                                                                                                                                                                                                                                                                                                                                            This section outlines the main dashboards that are available out-of-the-box.

                                                                                                                                                                                                                                                                                                                                                            6.7.1 -

                                                                                                                                                                                                                                                                                                                                                            Application Dashboards

                                                                                                                                                                                                                                                                                                                                                            Dashboard

                                                                                                                                                                                                                                                                                                                                                            Description

                                                                                                                                                                                                                                                                                                                                                            Use Cases

                                                                                                                                                                                                                                                                                                                                                            Elasticsearch

                                                                                                                                                                                                                                                                                                                                                            This view lists eight important metrics for node and document counts, shards, indexing time and query latency.

                                                                                                                                                                                                                                                                                                                                                            • Track the node count, as this can impact query times.

                                                                                                                                                                                                                                                                                                                                                            HAProxy

                                                                                                                                                                                                                                                                                                                                                            This view reports metrics for host CPU use and proxy throughput.

                                                                                                                                                                                                                                                                                                                                                            Redis

                                                                                                                                                                                                                                                                                                                                                            This view reports seven metrics for host resource usage and application performance.

                                                                                                                                                                                                                                                                                                                                                            Cassandra By Node

                                                                                                                                                                                                                                                                                                                                                            This view shows how every node in a Cassandra cluster is performing, by mixing key system metrics with Cassandra-specific metrics such as requests volume and compactions.

                                                                                                                                                                                                                                                                                                                                                            • Use this view on a group containing the entire Cassandra cluster when you have already identified that there is a problem with a metric (using the "Cassandra Overview" view), and you need to see which node is causing the problem.

                                                                                                                                                                                                                                                                                                                                                            • Spot issues such as imbalances between the size of data held in each node, nodes going down and generating a lot of hinted handoffs, or disk bottlenecks by looking at the pending compactions.

                                                                                                                                                                                                                                                                                                                                                            Cassandra Overview

                                                                                                                                                                                                                                                                                                                                                            This view shows how a Cassandra cluster is performing, by mixing key system metrics with Cassandra-specific metrics such as requests volume and compactions.

                                                                                                                                                                                                                                                                                                                                                            • Use this view on a group containing the entire Cassandra cluster as a first starting point to troubleshoot the overall health of your database.

                                                                                                                                                                                                                                                                                                                                                            • Inspect typical system metrics to make sure the cluster is not being overloaded

                                                                                                                                                                                                                                                                                                                                                            • Correlate the information displayed with important advanced Cassandra metrics such as pending compactions or JVM metrics to identify critical problems.

                                                                                                                                                                                                                                                                                                                                                            HTTP Top Requests

                                                                                                                                                                                                                                                                                                                                                            This view details the top requested URLs to your web server, including the total number of requests, average and maximum times to service the requests, and the amount of traffic contained in the requests and responses.

                                                                                                                                                                                                                                                                                                                                                            MongoDB

                                                                                                                                                                                                                                                                                                                                                            This view shows how busy the MongoDB service is, which collections are in highest demand and which have the slowest performance.

                                                                                                                                                                                                                                                                                                                                                            • Use to spot which collections may benefit from query and index performance tuning.

                                                                                                                                                                                                                                                                                                                                                            HTTP

                                                                                                                                                                                                                                                                                                                                                            This view provides a basic understanding of the health of your web server by showing the load being put on it and the server's ability to service requests in a timely manner.

                                                                                                                                                                                                                                                                                                                                                            • Gauge the overall busyness of the server.

                                                                                                                                                                                                                                                                                                                                                            • Identify correlations between the Top URLs and Slowest URLs panels to find opportunities to increase performance.

                                                                                                                                                                                                                                                                                                                                                            MySQL/PostgreSQL

                                                                                                                                                                                                                                                                                                                                                            This view shows the overall load and performance status of your SQL database transactions with metrics for the number of requests and how quickly they are handled.

                                                                                                                                                                                                                                                                                                                                                            • Determine whether performance can be improved.

                                                                                                                                                                                                                                                                                                                                                            MySQL/PostgreSQL Top

                                                                                                                                                                                                                                                                                                                                                            This view shows the top SQL queries by displaying metrics for the number of queries received and the amount of traffic sent and received for the query.

                                                                                                                                                                                                                                                                                                                                                            • Identify the most requested, highest traffic producing or slowest processing queries.

                                                                                                                                                                                                                                                                                                                                                            6.7.2 -

                                                                                                                                                                                                                                                                                                                                                            AWS CloudWatch Dashboards

                                                                                                                                                                                                                                                                                                                                                            DashboardDescription
                                                                                                                                                                                                                                                                                                                                                            ALB OverviewDisplays information such as unhealthy host count, response time, HTTP response count, active and new connection, and so on.
                                                                                                                                                                                                                                                                                                                                                            DynamoDB OverviewProvides information such as user errors and consumed Read and Write capacity units.
                                                                                                                                                                                                                                                                                                                                                            DynamoDB Overview By OperationShows the count of HTTP operations performed on the DynamoDB.
                                                                                                                                                                                                                                                                                                                                                            EC2 OverviewDisplays CPU, disk, network operations in a selected window.
                                                                                                                                                                                                                                                                                                                                                            ECS ProjectsProvides the resource count and usage percentage in each cluster.
                                                                                                                                                                                                                                                                                                                                                            ECS OverviewHighlights the containers and services per host, request count, and highest resource consumption in containers.
                                                                                                                                                                                                                                                                                                                                                            ECS ServicesDisplay information including container and request count per services and resource usage.
                                                                                                                                                                                                                                                                                                                                                            ECS Task FamiliesDisplays container and request count per task family and resource usage.
                                                                                                                                                                                                                                                                                                                                                            ElastiCache OverviewHighlights resource usage in ElastiCache.
                                                                                                                                                                                                                                                                                                                                                            ELB OverviewHighlights resource usage in ELB.
                                                                                                                                                                                                                                                                                                                                                            RDS OverviewHighlights resource usage in RDS.
                                                                                                                                                                                                                                                                                                                                                            SQS OverviewDisplays information such as number of messages sent, received, deleted in SQS.

                                                                                                                                                                                                                                                                                                                                                            6.7.3 -

                                                                                                                                                                                                                                                                                                                                                            Capacity and Resource Management Dashboards

                                                                                                                                                                                                                                                                                                                                                            DashboardDescription
                                                                                                                                                                                                                                                                                                                                                            Available Resources CalculatorEnsure there is sufficient capacity in a cluster to deploy a new application.
                                                                                                                                                                                                                                                                                                                                                            Cluster Capacity PlanningMonitor the capacity of Kubernetes clusters ensuring they’re correctly sized to support new applications when they’re deployed.
                                                                                                                                                                                                                                                                                                                                                            Pod Scheduling TroubleshootingIf a pod cannot be scheduled due to insufficient resources, use this dashboard to identify where the resource bottleneck is.
                                                                                                                                                                                                                                                                                                                                                            Pod Rightsizing & Capacity Optimization. Optimize your infrastructure and better control cluster spend by ensuring pods are sized correctly. Understand if you can free up resources by reducing memory and/or CPU requests.

                                                                                                                                                                                                                                                                                                                                                            6.7.4 -

                                                                                                                                                                                                                                                                                                                                                            Compliance & Security Dashboards

                                                                                                                                                                                                                                                                                                                                                            Dashboard

                                                                                                                                                                                                                                                                                                                                                            Description

                                                                                                                                                                                                                                                                                                                                                            Use Cases

                                                                                                                                                                                                                                                                                                                                                            Compliance (Docker)

                                                                                                                                                                                                                                                                                                                                                            Provides an overview of the available compliance metrics for Docker.

                                                                                                                                                                                                                                                                                                                                                            • Review the Docker configuration after running CIS Docker benchmark tests.

                                                                                                                                                                                                                                                                                                                                                            Compliance (Kubernetes)

                                                                                                                                                                                                                                                                                                                                                            Provides an overview of the available compliance metrics for Kubernetes

                                                                                                                                                                                                                                                                                                                                                            • Review the Kubernetes Cluster configuration after running CIS Kubernetes benchmark tests.

                                                                                                                                                                                                                                                                                                                                                            Sysdig Secure Summary

                                                                                                                                                                                                                                                                                                                                                            The summary dashboard provides a complete overview of the Sysdig Secure environment, including the number of active agents, the number of defined policies and how many have been enabled, and summary policy event information.

                                                                                                                                                                                                                                                                                                                                                            6.7.5 -

                                                                                                                                                                                                                                                                                                                                                            Containers Dashboards

                                                                                                                                                                                                                                                                                                                                                            Dashboard

                                                                                                                                                                                                                                                                                                                                                            Description

                                                                                                                                                                                                                                                                                                                                                            Use Cases

                                                                                                                                                                                                                                                                                                                                                            Container Resource Usage

                                                                                                                                                                                                                                                                                                                                                            Displays resource usage statistics, including CPU, file bytes, memory and network bytes, for containers running within the defined scope.

                                                                                                                                                                                                                                                                                                                                                            • Monitor this view to identify which containers are using disproportionate amounts of resources.

                                                                                                                                                                                                                                                                                                                                                            • Helpful in determining if an application should be moved to a more capable host.

                                                                                                                                                                                                                                                                                                                                                            • Identify which file systems are filling up or being underutilized.

                                                                                                                                                                                                                                                                                                                                                            Container File System Usage

                                                                                                                                                                                                                                                                                                                                                            This table view shows directory mount points, file system devices, and capacity and usage information for the file systems mounted on the instance. When groups are selected, metrics are averages for similar filesystem mount points.

                                                                                                                                                                                                                                                                                                                                                            Container CPU & Memory Limits

                                                                                                                                                                                                                                                                                                                                                            Shows CPU and memory limits across the environment, and the percentages currently used.

                                                                                                                                                                                                                                                                                                                                                            Container Network Traffic & Bandwidth

                                                                                                                                                                                                                                                                                                                                                            Highlights network bytes usage, connection count, errors, and queue length.

                                                                                                                                                                                                                                                                                                                                                            6.7.6 -

                                                                                                                                                                                                                                                                                                                                                            Hosts Infrastructure Dashboards

                                                                                                                                                                                                                                                                                                                                                            Dashboard

                                                                                                                                                                                                                                                                                                                                                            Description

                                                                                                                                                                                                                                                                                                                                                            Use Cases

                                                                                                                                                                                                                                                                                                                                                            Host Resource Usage

                                                                                                                                                                                                                                                                                                                                                            Displays resource usage statistics, including CPU, file bytes, memory and network bytes, for hosts running within the defined scope.

                                                                                                                                                                                                                                                                                                                                                            • Use this view to identify when a host is being over or under utilized within a group of hosts with similar job functions.

                                                                                                                                                                                                                                                                                                                                                            • Identify which file systems are filling up or being underutilized.

                                                                                                                                                                                                                                                                                                                                                            Disk and File System

                                                                                                                                                                                                                                                                                                                                                            This table view shows directory mount points, file system devices, and capacity and usage information for the file systems mounted on the instance. When groups are selected, metrics are averages for similar filesystem mount points.

                                                                                                                                                                                                                                                                                                                                                            Remotely mounted file systems are not listed by default. To enable, add the remotefs = true entry to the /opt/draios/bin/dragent.properties file on each instance.

                                                                                                                                                                                                                                                                                                                                                            Memory Usage

                                                                                                                                                                                                                                                                                                                                                            Displays the memory and swap usage and page faults.

                                                                                                                                                                                                                                                                                                                                                            Network Traffic & Bandwidth

                                                                                                                                                                                                                                                                                                                                                            Provides an overview of network traffic in the host, including throughput, queue length, and errors

                                                                                                                                                                                                                                                                                                                                                            Sysdig Agent Health and Status

                                                                                                                                                                                                                                                                                                                                                            This view reports the number of Sysdig agents deployed in your environment and their versions.

                                                                                                                                                                                                                                                                                                                                                            6.7.7 -

                                                                                                                                                                                                                                                                                                                                                            Kubernetes Dashboards

                                                                                                                                                                                                                                                                                                                                                            The Kubernetes * Health dashboards break down resource and performance metrics by various logical entities to allow for an in-depth analysis, and for critical issues to be identified and isolated. Each dashboard is built around the Golden Signals approach to monitoring: Latency, Traffic, Errors, and Saturation. Resource utilization metrics are oriented toward health and performance. These are aspects like CPU, memory, network, and storage usage by Kubernetes object. kube-state-metrics is about the status or count. Pairing kube-state-metrics with resource utilization metrics, each dashboard provides a comprehensive picture of what’s happening in your Kubernetes environment.

                                                                                                                                                                                                                                                                                                                                                            Dashboard

                                                                                                                                                                                                                                                                                                                                                            Description

                                                                                                                                                                                                                                                                                                                                                            Use Cases

                                                                                                                                                                                                                                                                                                                                                            Kubernetes Horizontal Pod Autoscaler

                                                                                                                                                                                                                                                                                                                                                            Highlights minimum, maximum, current, and desired replicas.

                                                                                                                                                                                                                                                                                                                                                            • Identify performance bottlenecks.

                                                                                                                                                                                                                                                                                                                                                            • Identify whether there are enough available pods compared to the desired pods.

                                                                                                                                                                                                                                                                                                                                                            • Use usage percentages over time to better estimate expansion capacity.

                                                                                                                                                                                                                                                                                                                                                            • Locate logical entities that are consuming too many cluster resources, or that are rapidly trending upwards towards unsustainable levels.

                                                                                                                                                                                                                                                                                                                                                            • Dive deeper into specific entities to identify the root cause of problems.

                                                                                                                                                                                                                                                                                                                                                            • Use usage percentages over time to better estimate expansion capacity.

                                                                                                                                                                                                                                                                                                                                                            For example:

                                                                                                                                                                                                                                                                                                                                                            • A deployment with no available pods indicates that the corresponding app is not serving requests. Getting a dashboard on this condition means you can visualize the metrics and spring into action to find and resolve the issue quickly.

                                                                                                                                                                                                                                                                                                                                                            • Dropping the number of pods available and remaining below the desired number indicate that your application performance is degraded or not running at the redundancy required. With these metrics represented on the dashboard, you get a quick glance of the severity of the impact on your app's user experience.

                                                                                                                                                                                                                                                                                                                                                            • A lower number of replicas running during an extended period of time than the number of replicas desired indicates a symptom of entities not working properly, such as nodes or resources unavailability, Kubernetes or Docker Engine failure, broken Docker images, and so on. No replicas for a deployment object could potentially mean that the app is down.

                                                                                                                                                                                                                                                                                                                                                            • A continuous loop of pod restart (CrashLoopBackOff) might be associated with missing dependencies or unmet requirements, or insufficient resources. In CrashLoopBackOff, pods never get into ready status and therefore are counted as unavailable and down.

                                                                                                                                                                                                                                                                                                                                                            • Use these three dashboards to provide a high-level overview of all aspects of the Kubernetes environment's performance and resource saturation status.

                                                                                                                                                                                                                                                                                                                                                            • Set high-level alerts to narrow down areas of concern, before moving to the more in-depth dashboards.

                                                                                                                                                                                                                                                                                                                                                            • Quickly identify major performance issues within each type of entity.

                                                                                                                                                                                                                                                                                                                                                            Kubernetes Resource Quota

                                                                                                                                                                                                                                                                                                                                                            Provides an overview of resource limit and request, and the number of replication controllers, services, service ports, service load balancers, configMap, and secrets.

                                                                                                                                                                                                                                                                                                                                                            Kubernetes Memory Allocation Optimization

                                                                                                                                                                                                                                                                                                                                                            Highlights Memory allocation optimization.

                                                                                                                                                                                                                                                                                                                                                            Kubernetes CPU Allocation Optimization

                                                                                                                                                                                                                                                                                                                                                            Displays CPU utilization of your Kubernetes environment.

                                                                                                                                                                                                                                                                                                                                                            Kubernetes Cluster Overview

                                                                                                                                                                                                                                                                                                                                                            Provides an overview of your Kubernetes cluster.

                                                                                                                                                                                                                                                                                                                                                            Kubernetes DaemonSet Overview

                                                                                                                                                                                                                                                                                                                                                            Overview of DaemonSet objects.

                                                                                                                                                                                                                                                                                                                                                            Kubernetes Deployment Overview

                                                                                                                                                                                                                                                                                                                                                            Highlights whether each deployment has a sufficient number of available pods and resources, and indicates the number of pods running, desired, or have been updated.

                                                                                                                                                                                                                                                                                                                                                            Kubernetes Job Overview

                                                                                                                                                                                                                                                                                                                                                            Overview of all the jobs and the performance information.

                                                                                                                                                                                                                                                                                                                                                            Kubernetes Namespace Overview

                                                                                                                                                                                                                                                                                                                                                            Displays metrics such as resource requests and resource limits at the namespace level; identifies the performance of the Kubernetes entities such as pods, deployments, DaemonSet, Statefulset, and jobs, and compliance with replicaSets specs. Highlights the number of services, deployments, replicaSets, and jobs per namespace.

                                                                                                                                                                                                                                                                                                                                                            Kubernetes Node Overview

                                                                                                                                                                                                                                                                                                                                                            Highlights the number of nodes that are ready, unavailable, or out of disk; the number of nodes that are under the memory, disk, or network pressure; compares allocatable capacity with requested capacity on the node; provides the number of pod resources of a node that are available for scheduling and the available capacity to serve the pods running on the nodes.

                                                                                                                                                                                                                                                                                                                                                            Kubernetes Pod Overview

                                                                                                                                                                                                                                                                                                                                                            Helps identify potential bottlenecks by graphing the number of container restarts, the number of pods waiting to be scheduled, resource utilization of containers within each pod and available capacity to serve pod requests, the number of available pods compared to the desired pods, and the number of pods in available state and ready to serve requests.

                                                                                                                                                                                                                                                                                                                                                            Kubernetes ReplicaSet Overview

                                                                                                                                                                                                                                                                                                                                                            Provides details such as the number of pods per replicaSet, the desired number of pods per replicaSet, and pods per replicaSet that are in a ready state.

                                                                                                                                                                                                                                                                                                                                                            Kubernetes StatefulSet Overview

                                                                                                                                                                                                                                                                                                                                                            Overview of the StatefulSet objects in your environment.

                                                                                                                                                                                                                                                                                                                                                            Kubernetes Cluster and Node Capacity

                                                                                                                                                                                                                                                                                                                                                            Highlights a comprehensive overview of the performance of the hosts or nodes that form the Kubernetes cluster, including CPU, memory, and file system usage, and network traffic.

                                                                                                                                                                                                                                                                                                                                                            Before analyzing the Dashboard, consider the following guidelines related to resource usage:

                                                                                                                                                                                                                                                                                                                                                            • If Resource Limits is undefined for a container, Kubernetes does not default to a value.

                                                                                                                                                                                                                                                                                                                                                            • if Resource Requests is unspecified for a container, Kubernetes defaults to Limits if that is explicitly specified, otherwise to an implementation-defined value. Limits do not default to any value.

                                                                                                                                                                                                                                                                                                                                                            • If both Resource Limits and Resource Requests are not specified, no matter which value had been defaulted by Kubernetes, kube-state-metric (and hence Sysdig Monitor) reports zero. Therefore, only user-defined requested are reported by the kubernetes.pod.resourceRequests.memByte metric.

                                                                                                                                                                                                                                                                                                                                                            • The memory used by a container (the value returned by memory.used.bytes) can be greater than the memory requested by a pod (the value returned by kubernetes.pod.resourceRequests.memByte). This is permissible in Kubernetes because Requests value determines the minimum amount of resources required.

                                                                                                                                                                                                                                                                                                                                                            For these reasons, it can be deduced that

                                                                                                                                                                                                                                                                                                                                                            • In some cases, the value of Used Resources will be more than that of Resource Requests and Resource Limits, and the value of Resource RequestS could be more than that of Resource Limit.

                                                                                                                                                                                                                                                                                                                                                            • The value of kubernetes.pod.resourceRequests.memByte<=memory.used.bytes<=kubernetes.pod.resourceLimits.memByte

                                                                                                                                                                                                                                                                                                                                                            Kubernetes Health Overview

                                                                                                                                                                                                                                                                                                                                                            Provides a comprehensive overview of the performance of the entire Kubernetes environment, broken down by various logical entities and underlying resource availability and usage. This dashboard breaks down resource and performance kube-state-metrics by the logical Kubernetes entities, such as pods, namespaces, deployments, and replicaSets, containers, and so on.

                                                                                                                                                                                                                                                                                                                                                            Kubernetes Service Health

                                                                                                                                                                                                                                                                                                                                                            Displays the count, resource usage, performance, and limitations of services running in the Kuberenetes environment. The dashboard provides and overview of what resources each service is using, their response times, the container and request counts, and how the response times measure up against the resource utilization.

                                                                                                                                                                                                                                                                                                                                                            Kubernetes Workloads CPU Usage and Allocation

                                                                                                                                                                                                                                                                                                                                                            Displays resource utilization of your workloads. This dashboard helps you review the CPU usage of your workloads, making sure that the CPU is properly allocated in the Kubernetes environment. All the numbers in this dashboard are expressed in CPU cores.

                                                                                                                                                                                                                                                                                                                                                            Kubernetes Workloads Memory Usage and Allocation

                                                                                                                                                                                                                                                                                                                                                            Helps you review the memory usage of your workloads, making sure that the memory is properly allocated in the Kubernetes environment.

                                                                                                                                                                                                                                                                                                                                                            Kubernetes Service Golden Signals

                                                                                                                                                                                                                                                                                                                                                            Highlights the latency, traffic, errors, and saturation in your Kubernetes environment.

                                                                                                                                                                                                                                                                                                                                                            6.7.8 -

                                                                                                                                                                                                                                                                                                                                                            Marathon Dashboards

                                                                                                                                                                                                                                                                                                                                                            DashboardDescription
                                                                                                                                                                                                                                                                                                                                                            ApplicationsDisplays the container count and resource usage.
                                                                                                                                                                                                                                                                                                                                                            OverviewHighlights the overall performance of marathon application. The dashboard provides container count, top resource-consuming containers and file system, request count by application, and so on.
                                                                                                                                                                                                                                                                                                                                                            GroupsDisplays the container count and resource usage in each group.

                                                                                                                                                                                                                                                                                                                                                            6.7.9 -

                                                                                                                                                                                                                                                                                                                                                            Mesos Dashboards

                                                                                                                                                                                                                                                                                                                                                            DashboardDescription
                                                                                                                                                                                                                                                                                                                                                            FrameworksHighlights container count and resource consumption.
                                                                                                                                                                                                                                                                                                                                                            OverviewProvides container count, top resource-consuming containers and file system, request count within defined scope.
                                                                                                                                                                                                                                                                                                                                                            TasksShows the resource usage and performance of Mesos tasks.

                                                                                                                                                                                                                                                                                                                                                            6.7.10 -

                                                                                                                                                                                                                                                                                                                                                            Platform Application & Troubleshooting Dashboards

                                                                                                                                                                                                                                                                                                                                                            DashboardDescription
                                                                                                                                                                                                                                                                                                                                                            Application Status&OverviewUnderstand the status of applications (workloads) running in a cluster by monitoring performance, pod health, and resource usage.
                                                                                                                                                                                                                                                                                                                                                            Pod Status&OverviewMonitor the health, resource usage, and network statistics for pods running as part of workloads.
                                                                                                                                                                                                                                                                                                                                                            Container Resource Usage&Troubleshooting Understand the performance of the different containers running in pods across your infrastructure and identify any that are behaving anomalously.
                                                                                                                                                                                                                                                                                                                                                            Node Status&OverviewMonitor the health, resource usage, and network statistics for nodes running in clusters.

                                                                                                                                                                                                                                                                                                                                                            6.7.11 -

                                                                                                                                                                                                                                                                                                                                                            Troubleshooting Dashboards

                                                                                                                                                                                                                                                                                                                                                            Dashboard

                                                                                                                                                                                                                                                                                                                                                            Description

                                                                                                                                                                                                                                                                                                                                                            Use Cases

                                                                                                                                                                                                                                                                                                                                                            Process Resource

                                                                                                                                                                                                                                                                                                                                                            Highlights the resource consumption for processes (for example, httpd, java, and ntpd).

                                                                                                                                                                                                                                                                                                                                                            • Identify the top consuming processes in an environment where the same process is spawned multiple times.

                                                                                                                                                                                                                                                                                                                                                            • Monitor this view to identify which processes are using disproportionate amounts of resources.

                                                                                                                                                                                                                                                                                                                                                            • Use to spot which collections may benefit from query and index performance tuning.

                                                                                                                                                                                                                                                                                                                                                            • Identify the most requested, highest traffic producing or slowest processing queries.

                                                                                                                                                                                                                                                                                                                                                            • Determine whether performance can be improved.

                                                                                                                                                                                                                                                                                                                                                            MongoDB Troubleshooting

                                                                                                                                                                                                                                                                                                                                                            Displays the performance of the MongoDB cluster. This view shows how busy the MongoDB service is, which collections are in highest demand and which have the slowest performance

                                                                                                                                                                                                                                                                                                                                                            Network Connections Table

                                                                                                                                                                                                                                                                                                                                                            Displays a full list of the environment’s local and remote endpoints, and all network traffic resource statistics relevant to those endpoints.

                                                                                                                                                                                                                                                                                                                                                            SQL Troubleshooting

                                                                                                                                                                                                                                                                                                                                                            Shows the top SQL queries by displaying metrics for the number of queries received and the amount of traffic sent and received for the query.

                                                                                                                                                                                                                                                                                                                                                            Top Processes

                                                                                                                                                                                                                                                                                                                                                            Lists the top processes running on the Kubernetes environment. Displays resource usage statistics, including CPU, file bytes, memory, and network bytes, for the top processes running within the defined scope.

                                                                                                                                                                                                                                                                                                                                                            7 -

                                                                                                                                                                                                                                                                                                                                                            Events

                                                                                                                                                                                                                                                                                                                                                            The Sysdig Monitor Events module displays a comprehensive and unified list of events, both monitoring and security, that have occurred within the environment, as a live events feed. The feed displays events created by triggered alerts, pulled from infrastructure services, initiated by Sysdig Security such as policy and image scanning, or defined by users, and allows users to review, track, and resolve issues. Each event is enriched with rich metadata and the entire relationship within the system under purview is built when searched for events. With a unified Event stream, Sysdig Monitor eliminates the need for standalone tools for security and monitoring alerts.

                                                                                                                                                                                                                                                                                                                                                            Learn more about Sysdig Monitor Events in the following sections:

                                                                                                                                                                                                                                                                                                                                                            7.1 -

                                                                                                                                                                                                                                                                                                                                                            Event Types

                                                                                                                                                                                                                                                                                                                                                            There are three primary types of events displayed in the Sysdig Secure Events feed: alert events, infrastructure events, and custom events. Note that image scanning and security events are displayed in the Sysdig Secure interface.

                                                                                                                                                                                                                                                                                                                                                            Alert Events

                                                                                                                                                                                                                                                                                                                                                            Alert events are triggered by user-configured alerts. For more information on configuring alerts, refer to the Sysdig Monitor Alerts documentation.

                                                                                                                                                                                                                                                                                                                                                            Infrastructure Events

                                                                                                                                                                                                                                                                                                                                                            Events can be collected from supported services within the production environment. The Sysdig agent automatically discovers these services and is configured to collect event data for a select group of events by default. Additional events can be added to the list by configuring the dragent.yaml file.

                                                                                                                                                                                                                                                                                                                                                            Sysdig currently supports event monitoring for the following infrastructure services:

                                                                                                                                                                                                                                                                                                                                                            Events marked with * are enabled by default. For more information on configuring additional infrastructure events, refer to the Enable/Disable Event Data.

                                                                                                                                                                                                                                                                                                                                                            Docker Events

                                                                                                                                                                                                                                                                                                                                                            The following Docker events are supported.

                                                                                                                                                                                                                                                                                                                                                            docker:
                                                                                                                                                                                                                                                                                                                                                                container:
                                                                                                                                                                                                                                                                                                                                                                  - attach       # Container Attached      (information)
                                                                                                                                                                                                                                                                                                                                                                  - commit       # Container Committed     (information)
                                                                                                                                                                                                                                                                                                                                                                  - copy         # Container Copied        (information)
                                                                                                                                                                                                                                                                                                                                                                  - create       # Container Created       (information)
                                                                                                                                                                                                                                                                                                                                                                  - destroy      # Container Destroyed     (warning)
                                                                                                                                                                                                                                                                                                                                                                  - die          # Container Died          (warning)
                                                                                                                                                                                                                                                                                                                                                                  - exec_create  # Container Exec Created  (information)
                                                                                                                                                                                                                                                                                                                                                                  - exec_start   # Container Exec Started  (information)
                                                                                                                                                                                                                                                                                                                                                                  - export       # Container Exported      (information)
                                                                                                                                                                                                                                                                                                                                                                  - kill         # Container Killed        (warning)*
                                                                                                                                                                                                                                                                                                                                                                  - oom          # Container Out of Memory (warning)*
                                                                                                                                                                                                                                                                                                                                                                  - pause        # Container Paused        (information)
                                                                                                                                                                                                                                                                                                                                                                  - rename       # Container Renamed       (information)
                                                                                                                                                                                                                                                                                                                                                                  - resize       # Container Resized       (information)
                                                                                                                                                                                                                                                                                                                                                                  - restart      # Container Restarted     (warning)
                                                                                                                                                                                                                                                                                                                                                                  - start        # Container Started       (information)
                                                                                                                                                                                                                                                                                                                                                                  - stop         # Container Stopped       (information)
                                                                                                                                                                                                                                                                                                                                                                  - top          # Container Top           (information)
                                                                                                                                                                                                                                                                                                                                                                  - unpause      # Container Unpaused      (information)
                                                                                                                                                                                                                                                                                                                                                                  - update       # Container Updated       (information)
                                                                                                                                                                                                                                                                                                                                                                image:
                                                                                                                                                                                                                                                                                                                                                                  - delete # Image Deleted  (information)
                                                                                                                                                                                                                                                                                                                                                                  - import # Image Imported (information)
                                                                                                                                                                                                                                                                                                                                                                  - pull   # Image Pulled   (information)
                                                                                                                                                                                                                                                                                                                                                                  - push   # Image Pushed   (information)
                                                                                                                                                                                                                                                                                                                                                                  - tag    # Image Tagged   (information)
                                                                                                                                                                                                                                                                                                                                                                  - untag  # Image Untaged  (information)
                                                                                                                                                                                                                                                                                                                                                                volume:
                                                                                                                                                                                                                                                                                                                                                                  - create  # Volume Created    (information)
                                                                                                                                                                                                                                                                                                                                                                  - mount   # Volume Mounted    (information)
                                                                                                                                                                                                                                                                                                                                                                  - unmount # Volume Unmounted  (information)
                                                                                                                                                                                                                                                                                                                                                                  - destroy # Volume Destroyed  (information)
                                                                                                                                                                                                                                                                                                                                                                network:
                                                                                                                                                                                                                                                                                                                                                                  - create     # Network Created       (information)
                                                                                                                                                                                                                                                                                                                                                                  - connect    # Network Connected     (information)
                                                                                                                                                                                                                                                                                                                                                                  - disconnect # Network Disconnected  (information)
                                                                                                                                                                                                                                                                                                                                                                  - destroy    # Network Destroyed     (information)
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            Kubernetes Events

                                                                                                                                                                                                                                                                                                                                                            The following Kubernetes events are supported.

                                                                                                                                                                                                                                                                                                                                                            kubernetes:
                                                                                                                                                                                                                                                                                                                                                                node:
                                                                                                                                                                                                                                                                                                                                                                  - TerminatedAllPods       # Terminated All Pods      (information)
                                                                                                                                                                                                                                                                                                                                                                  - RegisteredNode          # Node Registered          (information)*
                                                                                                                                                                                                                                                                                                                                                                  - RemovingNode            # Removing Node            (information)*
                                                                                                                                                                                                                                                                                                                                                                  - DeletingNode            # Deleting Node            (information)*
                                                                                                                                                                                                                                                                                                                                                                  - DeletingAllPods         # Deleting All Pods        (information)
                                                                                                                                                                                                                                                                                                                                                                  - TerminatingEvictedPod   # Terminating Evicted Pod  (information)*
                                                                                                                                                                                                                                                                                                                                                                  - NodeReady               # Node Ready               (information)*
                                                                                                                                                                                                                                                                                                                                                                  - NodeNotReady            # Node not Ready           (information)*
                                                                                                                                                                                                                                                                                                                                                                  - NodeSchedulable         # Node is Schedulable      (information)*
                                                                                                                                                                                                                                                                                                                                                                  - NodeNotSchedulable      # Node is not Schedulable  (information)*
                                                                                                                                                                                                                                                                                                                                                                  - CIDRNotAvailable        # CIDR not Available       (information)*
                                                                                                                                                                                                                                                                                                                                                                  - CIDRAssignmentFailed    # CIDR Assignment Failed   (information)*
                                                                                                                                                                                                                                                                                                                                                                  - Starting                # Starting Kubelet         (information)*
                                                                                                                                                                                                                                                                                                                                                                  - KubeletSetupFailed      # Kubelet Setup Failed     (warning)*
                                                                                                                                                                                                                                                                                                                                                                  - FailedMount             # Volume Mount Failed      (warning)*
                                                                                                                                                                                                                                                                                                                                                                  - NodeSelectorMismatching # Node Selector Mismatch   (warning)*
                                                                                                                                                                                                                                                                                                                                                                  - InsufficientFreeCPU     # Insufficient Free CPU    (warning)*
                                                                                                                                                                                                                                                                                                                                                                  - InsufficientFreeMemory  # Insufficient Free Mem    (warning)*
                                                                                                                                                                                                                                                                                                                                                                  - OutOfDisk               # Out of Disk              (information)*
                                                                                                                                                                                                                                                                                                                                                                  - HostNetworkNotSupported # Host Ntw not Supported   (warning)*
                                                                                                                                                                                                                                                                                                                                                                  - NilShaper               # Undefined Shaper         (warning)*
                                                                                                                                                                                                                                                                                                                                                                  - Rebooted                # Node Rebooted            (warning)*
                                                                                                                                                                                                                                                                                                                                                                  - NodeHasSufficientDisk   # Node Has Sufficient Disk (information)*
                                                                                                                                                                                                                                                                                                                                                                  - NodeOutOfDisk           # Node Out of Disk Space   (information)*
                                                                                                                                                                                                                                                                                                                                                                  - InvalidDiskCapacity     # Invalid Disk Capacity    (warning)*
                                                                                                                                                                                                                                                                                                                                                                  - FreeDiskSpaceFailed     # Free Disk Space Failed   (warning)*
                                                                                                                                                                                                                                                                                                                                                                pod:
                                                                                                                                                                                                                                                                                                                                                                  - Pulling           # Pulling Container Image          (information)
                                                                                                                                                                                                                                                                                                                                                                  - Pulled            # Ctr Img Pulled                   (information)
                                                                                                                                                                                                                                                                                                                                                                  - Failed            # Ctr Img Pull/Create/Start Fail   (warning)*
                                                                                                                                                                                                                                                                                                                                                                  - InspectFailed     # Ctr Img Inspect Failed           (warning)*
                                                                                                                                                                                                                                                                                                                                                                  - ErrImageNeverPull # Ctr Img NeverPull Policy Violate (warning)*
                                                                                                                                                                                                                                                                                                                                                                  - BackOff           # Back Off Ctr Start, Image Pull   (warning)
                                                                                                                                                                                                                                                                                                                                                                  - Created           # Container Created                (information)
                                                                                                                                                                                                                                                                                                                                                                  - Started           # Container Started                (information)
                                                                                                                                                                                                                                                                                                                                                                  - Killing           # Killing Container                (information)*
                                                                                                                                                                                                                                                                                                                                                                  - Unhealthy         # Container Unhealthy              (warning)
                                                                                                                                                                                                                                                                                                                                                                  - FailedSync        # Pod Sync Failed                  (warning)
                                                                                                                                                                                                                                                                                                                                                                  - FailedValidation  # Failed Pod Config Validation     (warning)
                                                                                                                                                                                                                                                                                                                                                                  - OutOfDisk         # Out of Disk                      (information)*
                                                                                                                                                                                                                                                                                                                                                                  - HostPortConflict  # Host/Port Conflict               (warning)*
                                                                                                                                                                                                                                                                                                                                                                replicationController:
                                                                                                                                                                                                                                                                                                                                                                  - SuccessfulCreate    # Pod Created        (information)*
                                                                                                                                                                                                                                                                                                                                                                  - FailedCreate        # Pod Create Failed  (warning)*
                                                                                                                                                                                                                                                                                                                                                                  - SuccessfulDelete    # Pod Deleted        (information)*
                                                                                                                                                                                                                                                                                                                                                                  - FailedDelete        # Pod Delete Failed  (warning)*
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            Custom Events

                                                                                                                                                                                                                                                                                                                                                            Additional events can be collected by the Sysdig agent and displayed in the Events module, but require more comprehensive configuration steps. These custom events can be integrated via:

                                                                                                                                                                                                                                                                                                                                                            • The Sysdig Monitor Slackbot

                                                                                                                                                                                                                                                                                                                                                            • Python scripts (either pre-built by Sysdig or user-created)

                                                                                                                                                                                                                                                                                                                                                            • A CURL request

                                                                                                                                                                                                                                                                                                                                                            For brief sample scripts regarding configuring other custom events, refer to the Custom Events. For more information, contact Sysdig Support.

                                                                                                                                                                                                                                                                                                                                                            LogDNA Events

                                                                                                                                                                                                                                                                                                                                                            Sysdig provides the ability to view LogDNA alerts as Sysdig events.

                                                                                                                                                                                                                                                                                                                                                            If you are both a LogDNA and Sysdig Monitor user, you can send alerts from the LogDNA platform to Sysdig Monitor as Sysdig events. These events will provide a link redirecting you to the LogDNA for further investigation. Similar to other types of Sysdig Events, you can create alerts based on the LogDNA events.

                                                                                                                                                                                                                                                                                                                                                            The log data provided by LogDNA carries additional details about system health. The ability to view relevant LogDNA events in Sysdig helps you debug and monitor the health of a system efficiently.

                                                                                                                                                                                                                                                                                                                                                            For example, if the number of logs generated during a deployment is higher than expected, you get notified with your Sysdig Events feed.

                                                                                                                                                                                                                                                                                                                                                            There is no configuration required on the Sysdig Monitor side. For information on configuring LogDNA to send alerts to Sysdig Monitor, see Sysdig Alert Integration.

                                                                                                                                                                                                                                                                                                                                                            7.2 -

                                                                                                                                                                                                                                                                                                                                                            Custom Events

                                                                                                                                                                                                                                                                                                                                                            Sysdig Monitor can ingest any custom event created, including code deploys, auto-scaling activities, and business level actions. These events will be automatically overlayed on charts and graphs for easy correlation of all performance data. The sections below outline the different ways custom events can be sent to Sysdig Monitor.

                                                                                                                                                                                                                                                                                                                                                            Application Integrations

                                                                                                                                                                                                                                                                                                                                                            Sysdig Monitor supports event integrations with certain applications by default. The Sysdig agent will automatically discover these services and begin collecting event data from them. For more information, refer to the Events documentation.

                                                                                                                                                                                                                                                                                                                                                            Sysdig Monitor Slackbot

                                                                                                                                                                                                                                                                                                                                                            Sysdigbot, the Sysdig Monitor Slackbot, allows users to post custom events directly to the Sysdig Cloud through chats with a Slack bot.

                                                                                                                                                                                                                                                                                                                                                            Prebuilt Python Script

                                                                                                                                                                                                                                                                                                                                                            The Sysdig python script provides a way to send events to Sysdig Monitor directly from the command line, using the following command structure:

                                                                                                                                                                                                                                                                                                                                                            python post_event.py SYSDIG_TOKEN NAME [-d DESCRIPTION] [-s SEVERITY] [-c SCOPE] [-t TAGS] [-h]
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            For more information, refer to the Sysdig Github repository.

                                                                                                                                                                                                                                                                                                                                                            Python Sample Client

                                                                                                                                                                                                                                                                                                                                                            The Sysdig Monitor python client acts as a wrapper around the Sysdig Monitor REST API, exposing most of the REST API functionality to provide an easy to use and install python interface. The post_event() function can be used to send events to Sysdig Monitor from any custom script. An example script is shown below:

                                                                                                                                                                                                                                                                                                                                                            import os
                                                                                                                                                                                                                                                                                                                                                            import sys
                                                                                                                                                                                                                                                                                                                                                            
                                                                                                                                                                                                                                                                                                                                                            sys.path.insert(0, os.path.join(os.path.dirname(os.path.realpath(sys.argv[0])), '..'))
                                                                                                                                                                                                                                                                                                                                                            
                                                                                                                                                                                                                                                                                                                                                            from sdcclient import SdcClient
                                                                                                                                                                                                                                                                                                                                                            
                                                                                                                                                                                                                                                                                                                                                            # Parse arguments
                                                                                                                                                                                                                                                                                                                                                            sdc_token = sys.argv[1]
                                                                                                                                                                                                                                                                                                                                                            name = sys.argv[2]
                                                                                                                                                                                                                                                                                                                                                            
                                                                                                                                                                                                                                                                                                                                                            # Instantiate the SDC client
                                                                                                                                                                                                                                                                                                                                                            sdclient = SdcClient(SDC_TOKEN)
                                                                                                                                                                                                                                                                                                                                                            
                                                                                                                                                                                                                                                                                                                                                            # Post the event using post_event(self, name, description=None, severity=None, event_filter=None, tags=None)
                                                                                                                                                                                                                                                                                                                                                            res = sdclient.post_event(NAME)
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            Curl Sample Client

                                                                                                                                                                                                                                                                                                                                                            The Sysdig Monitor REST API offers the full functionality of the Sysdig Monitor app over API, allowing custom events to be sent directly to the Sysdig Cloud over the REST API. The example below is a curl request:

                                                                                                                                                                                                                                                                                                                                                            #!/bin/bash
                                                                                                                                                                                                                                                                                                                                                            SDC_ACCESS_TOKEN='626abc7-YOUR-TOKEN-HERE-3a3ghj432'
                                                                                                                                                                                                                                                                                                                                                            ENDPOINT='app.sysdigcloud.com'
                                                                                                                                                                                                                                                                                                                                                            
                                                                                                                                                                                                                                                                                                                                                            curl -X POST -s https://'"${ENDPOINT}"'/api/v2/events -H 'Content-Type: application/json; charset=UTF-8' -H 'Accept: application/json, text/javascript, */*; q=0.01' -H "Authorization: Bearer ${SDC_ACCESS_TOKEN}" --data-binary '{"event": {"name": "Jenkins - start wordpress deploy", "description": "deploy", "severity": "MEDIUM", "scope": "host.hostName = \"ip-10-1-1-1\" and build = \"89\""}}}'
                                                                                                                                                                                                                                                                                                                                                            sleep 5s
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            See also Enable/Disable Event Data.

                                                                                                                                                                                                                                                                                                                                                            7.3 -

                                                                                                                                                                                                                                                                                                                                                            Severity and Status

                                                                                                                                                                                                                                                                                                                                                            Event Severity

                                                                                                                                                                                                                                                                                                                                                            Event severity is broken down into four categories in the Sysdig Monitor UI, to better visualize issue priority, and allow for easier filtering practices.

                                                                                                                                                                                                                                                                                                                                                            Scripts that used the former severity values (0-7) will continue to work as expected, as the new categories are simplified groupings of those values.

                                                                                                                                                                                                                                                                                                                                                            The image below outlines the severity value breakdown:

                                                                                                                                                                                                                                                                                                                                                            Event Status

                                                                                                                                                                                                                                                                                                                                                            There are two primary event states: triggered, and resolved. In addition, there are two additional statuses available to improve filtering practices.

                                                                                                                                                                                                                                                                                                                                                            Event Status

                                                                                                                                                                                                                                                                                                                                                            Description

                                                                                                                                                                                                                                                                                                                                                            Triggered

                                                                                                                                                                                                                                                                                                                                                            The circumstances that triggered the event remain in place (for example, the node remains down).

                                                                                                                                                                                                                                                                                                                                                            Resolved

                                                                                                                                                                                                                                                                                                                                                            The circumstances that triggered the event are no longer in place (for example, the metric value has returned to within a normal range).

                                                                                                                                                                                                                                                                                                                                                            Acknowledged

                                                                                                                                                                                                                                                                                                                                                            Manual label to assist in further filtering the events feed.

                                                                                                                                                                                                                                                                                                                                                            The acknowledged label is a purely visual marker. It does not reflect the current state (triggered/resolved) of the event.

                                                                                                                                                                                                                                                                                                                                                            Custom events cannot be marked as acknowledged.

                                                                                                                                                                                                                                                                                                                                                            Unacknowledged

                                                                                                                                                                                                                                                                                                                                                            Manual label to assist in further filtering the events feed.

                                                                                                                                                                                                                                                                                                                                                            All events are marked as unacknowledged by default.

                                                                                                                                                                                                                                                                                                                                                            Silenced

                                                                                                                                                                                                                                                                                                                                                            List of silenced event alerts. For more information, see Silence Alert Notifications.

                                                                                                                                                                                                                                                                                                                                                            For more information on filtering the Events feed, refer to Filtering and Searching Events.

                                                                                                                                                                                                                                                                                                                                                            7.4 -

                                                                                                                                                                                                                                                                                                                                                            Event Scope

                                                                                                                                                                                                                                                                                                                                                            By default, Events feed displays events from the entire environment. However, the feed can be configured to only show events from a particular scope within that environment. The scope of the event feeds can be configured by labels.

                                                                                                                                                                                                                                                                                                                                                            Labels refer to a set of meaningful key-value pair (whitelist) that is defined by Sysdig Monitor. As a user, you have the ability to configure the whitelist. For example, if you are using ECS and have custom container labels you have defined, you have the ability to configure the whitelist and add the labels you need. Once done, all the infrastructure events related to containers are enriched with these labels and the event scope will display associated metadata.

                                                                                                                                                                                                                                                                                                                                                            For more information on scoping, refer to the Grouping, Scoping, and Segmenting Metrics documentation.

                                                                                                                                                                                                                                                                                                                                                            Configure Event Scope

                                                                                                                                                                                                                                                                                                                                                            To configure the events feed scope:

                                                                                                                                                                                                                                                                                                                                                            1. From the Events module, click the Edit Scope link.

                                                                                                                                                                                                                                                                                                                                                            2. Open the top-level drop-down menu.

                                                                                                                                                                                                                                                                                                                                                            3. Select the desired label, either by scrolling through the list, or by typing the name/partial name into the search bar, and selecting it.

                                                                                                                                                                                                                                                                                                                                                            4. Open the Operator drop-down menu, and select the relevant option.

                                                                                                                                                                                                                                                                                                                                                            5. Open the Value drop-down menu, and select the relevant options.

                                                                                                                                                                                                                                                                                                                                                            6. Optional: Open the next level drop-down menu, and repeat steps 3-5.

                                                                                                                                                                                                                                                                                                                                                            7. Optional: Repeat step 6 for each additional layer of scope required.

                                                                                                                                                                                                                                                                                                                                                              Individual layers of the scope can be removed if necessary, by clicking the Delete (x) icon beside the relevant layer.

                                                                                                                                                                                                                                                                                                                                                            8. Click the Apply button to save the new scope.

                                                                                                                                                                                                                                                                                                                                                            Filter Events by Scope

                                                                                                                                                                                                                                                                                                                                                            Events are by default filtered by scope in Dashboards and Explore to show the most relevant events associated with the selected scope. This capability enables you to quickly narrow down the potential problems in the area under purview. However, you can turn the filtering off and see Events from the complete scope. To do so in Explore:

                                                                                                                                                                                                                                                                                                                                                            1. On the Explore module, click the Options (three dots) icon and select Events.

                                                                                                                                                                                                                                                                                                                                                              Event Scope Editor

                                                                                                                                                                                                                                                                                                                                                              The Events panel appears. you can do the following:

                                                                                                                                                                                                                                                                                                                                                              • Determine whether to show events or not.

                                                                                                                                                                                                                                                                                                                                                              • Determine the maximum number of events to be displayed in the Explore table.

                                                                                                                                                                                                                                                                                                                                                              • Filter events by

                                                                                                                                                                                                                                                                                                                                                                • Type: The types of events supported are custom events and alerts. See Event Types for more information.

                                                                                                                                                                                                                                                                                                                                                                • State: The types of events supported are triggered and resolved. See Severity and Status for more information.

                                                                                                                                                                                                                                                                                                                                                                • Severity: The supported severity levels are all severity types, high severity, and both high and medium levels. See Severity and Status for more information.

                                                                                                                                                                                                                                                                                                                                                                • Resolution: The supported resolutions are both acknowledged and unacknowledged, acknowledged only, and unacknowledged only. See Severity and Status for more information.

                                                                                                                                                                                                                                                                                                                                                              • Determine whether to show events by scope. Use the toggle button to turn off filtering by scope.

                                                                                                                                                                                                                                                                                                                                                                If you disable this option, the Explore table will show feed for all the events in the infrastructure, including those that are irrelevant to the selected scope. Leave the Filter events by selected scope option enabled to see only the relevant events.

                                                                                                                                                                                                                                                                                                                                                            2. Click Save.

                                                                                                                                                                                                                                                                                                                                                              Similarly, you can turn off filtering events by scope in Dashboards.

                                                                                                                                                                                                                                                                                                                                                            Reset the Environment Scope

                                                                                                                                                                                                                                                                                                                                                            To reset the scope to the entire environment:

                                                                                                                                                                                                                                                                                                                                                            1. From the Events module, click the Edit Scope link.

                                                                                                                                                                                                                                                                                                                                                            2. Click the Clear All link.

                                                                                                                                                                                                                                                                                                                                                            3. Click the Apply button to save the changes.

                                                                                                                                                                                                                                                                                                                                                            7.5 -

                                                                                                                                                                                                                                                                                                                                                            Configure Event Alerts

                                                                                                                                                                                                                                                                                                                                                            Event alerts can be created (for custom events) and configured (for alert events, and custom events with a previously created alert) from the Event Details panel:

                                                                                                                                                                                                                                                                                                                                                            1. From the Events module, select the event from the feed to open the Event Details panel.

                                                                                                                                                                                                                                                                                                                                                            2. Open the Configure Alert panel:

                                                                                                                                                                                                                                                                                                                                                              1. For existing alerts, click the Edit Alert link.

                                                                                                                                                                                                                                                                                                                                                              2. For new alerts, click the Create Alert from Event button.

                                                                                                                                                                                                                                                                                                                                                            3. Configure the alert as necessary. For more information on configuring alerts, refer to the Alerts documentation.

                                                                                                                                                                                                                                                                                                                                                              New alerts will be auto-filled with information from the custom event.

                                                                                                                                                                                                                                                                                                                                                            4. Click the Create button for new alerts, or the Save button for existing alerts.

                                                                                                                                                                                                                                                                                                                                                            7.6 -

                                                                                                                                                                                                                                                                                                                                                            Filtering and Searching Events

                                                                                                                                                                                                                                                                                                                                                            Filter Events

                                                                                                                                                                                                                                                                                                                                                            The events feed can be filtered in multiple ways, to drill-down into the environment’s history and refine the events displayed. The feed can be filtered by severity, type, and/or status. Examples of each are shown below.

                                                                                                                                                                                                                                                                                                                                                            The example below shows only high and medium severity events:

                                                                                                                                                                                                                                                                                                                                                            The example below shows only Kubernetes events:

                                                                                                                                                                                                                                                                                                                                                            The example below shows only events that are Unacknowledged:

                                                                                                                                                                                                                                                                                                                                                            The Acknowledged label is a purely visual marker, and does not reflect the current state (triggered/resolved) of the event. By default, all events are Unacknowledged.

                                                                                                                                                                                                                                                                                                                                                            The example below shows medium severity Alert events that remain Triggered, but have been acknowledged:

                                                                                                                                                                                                                                                                                                                                                            Search for an Event

                                                                                                                                                                                                                                                                                                                                                            The event feeds can be searched by using the search icon in the top bar:

                                                                                                                                                                                                                                                                                                                                                            7.7 -

                                                                                                                                                                                                                                                                                                                                                            Review Events

                                                                                                                                                                                                                                                                                                                                                            Events can be reviewed in detail by clicking on the event listing in the feed:

                                                                                                                                                                                                                                                                                                                                                            To review the environment at the time of the event in detail, click the Explore button to navigate to the Explore module. The Explore module will automatically drill-down to the impacted environment objects.

                                                                                                                                                                                                                                                                                                                                                            The Event Details Panel

                                                                                                                                                                                                                                                                                                                                                            The Event Details panel contains detailed information about the event. This information is different, depending on whether the event is an Alert event or a Custom event.

                                                                                                                                                                                                                                                                                                                                                            Alert Events

                                                                                                                                                                                                                                                                                                                                                            The example below is of an Alert event:

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Event IDThe unique ID of the event.
                                                                                                                                                                                                                                                                                                                                                            SeverityThe severity of the event (High, Medium, Low, Info).
                                                                                                                                                                                                                                                                                                                                                            StateThe current state of the event (Triggered, Resolved)
                                                                                                                                                                                                                                                                                                                                                            DurationThe length of time the event lasted.
                                                                                                                                                                                                                                                                                                                                                            AcknowledgedWhether the event has been acknowledged or not.
                                                                                                                                                                                                                                                                                                                                                            TriggerThe cause of the event (for example, the metric that exceeded the defined range, and the value it reached).
                                                                                                                                                                                                                                                                                                                                                            EntityThe entity on which the event occurred.
                                                                                                                                                                                                                                                                                                                                                            Start TimeThe date and time the event started.
                                                                                                                                                                                                                                                                                                                                                            End TimeThe date and time the event ended.
                                                                                                                                                                                                                                                                                                                                                            Alert NameThe name of the alert that was triggered.
                                                                                                                                                                                                                                                                                                                                                            TypeThe type of alert.
                                                                                                                                                                                                                                                                                                                                                            MetricsThe metric/s that were affected.
                                                                                                                                                                                                                                                                                                                                                            Trigger ConditionThe condition that was met to trigger the alert.
                                                                                                                                                                                                                                                                                                                                                            ScopeThe scope of the alert.
                                                                                                                                                                                                                                                                                                                                                            SegmentThe segmentation applied to the alert.

                                                                                                                                                                                                                                                                                                                                                            To configure the alert that created the event, click the Edit Alert link in the Event Details panel. For more information about alerts, refer to the Alerts documentation.

                                                                                                                                                                                                                                                                                                                                                            Security Events

                                                                                                                                                                                                                                                                                                                                                            Policy

                                                                                                                                                                                                                                                                                                                                                            The example shows an event notifying a potentially unauthorized terminal shell in a container. For more information on Policy alerts, see Secure Events.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Event IDThe unique ID of the event.
                                                                                                                                                                                                                                                                                                                                                            SeverityThe severity of the event (High, Medium, Low, Info).
                                                                                                                                                                                                                                                                                                                                                            Date / TimeThe date and time the event occurred.
                                                                                                                                                                                                                                                                                                                                                            HostThe hostname and physical address (MAC)
                                                                                                                                                                                                                                                                                                                                                            ContainerThe container name, unique identifier, and image.
                                                                                                                                                                                                                                                                                                                                                            SummaryA detailed description of what occurred.

                                                                                                                                                                                                                                                                                                                                                            Scanning

                                                                                                                                                                                                                                                                                                                                                            The example is a high severity event alerting a change in the scan result of an elasticsearch image on Quay. For more information on Scanning, see Scanning Alerts.

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Event IDThe unique ID of the event.
                                                                                                                                                                                                                                                                                                                                                            SeverityThe severity of the event (High, Medium, Low, Info).
                                                                                                                                                                                                                                                                                                                                                            Date / TimeThe date and time the event occurred.
                                                                                                                                                                                                                                                                                                                                                            Image RegistryThe repository where the image resides (for example, Quay).
                                                                                                                                                                                                                                                                                                                                                            TagThe image name associated with the image.
                                                                                                                                                                                                                                                                                                                                                            Image IDThe unique identifier of the image.
                                                                                                                                                                                                                                                                                                                                                            DigestA content-addressable identifier which contains the SHA256 hash of the image’s JSON configuration object.

                                                                                                                                                                                                                                                                                                                                                            Infrastructure and Custom Events

                                                                                                                                                                                                                                                                                                                                                            Infrastructure and custom events display the same set of information in the Event Details panel. The example below is a Docker event:

                                                                                                                                                                                                                                                                                                                                                            MetadataDescription
                                                                                                                                                                                                                                                                                                                                                            Event IDThe unique ID of the event.
                                                                                                                                                                                                                                                                                                                                                            SeverityThe severity of the event (High, Medium, Low, Info).
                                                                                                                                                                                                                                                                                                                                                            Date / TimeThe date and time the event occurred.
                                                                                                                                                                                                                                                                                                                                                            SourceThe source of the event (for example, Docker).
                                                                                                                                                                                                                                                                                                                                                            ScopeThe scope of the event.
                                                                                                                                                                                                                                                                                                                                                            DescriptionA detailed description of what occurred.

                                                                                                                                                                                                                                                                                                                                                            8 -

                                                                                                                                                                                                                                                                                                                                                            Alerts

                                                                                                                                                                                                                                                                                                                                                            Alert is the responsive component of Sysdig Monitor. Alerts notify you when an event/issue occurs that requires attention. Events and issues are identified based on changes in the metric values collected by Sysdig Monitor. The Alerts module displays out-of-the-box alerts and a wizard for creating and editing alerts as needed.

                                                                                                                                                                                                                                                                                                                                                            About Sysdig Alert

                                                                                                                                                                                                                                                                                                                                                            Sysdig Monitor can generate notifications based on certain conditions or events you configure. Using the alert feature, you can keep a tab on your infrastructure and find out about problems as they happen, or even before they happen with the alert conditions you define. In Sysdig Monitor, metrics serve as the central configuration artifact for alerts. A metric ties one or more conditions or events to the measures to take when the condition is met, or an event happens. Alerts work across Sysdig modules including Explore, Dashboard, Events, and Overview.

                                                                                                                                                                                                                                                                                                                                                            Alert Types

                                                                                                                                                                                                                                                                                                                                                            The types of alerts available in Sysdig Monitor:

                                                                                                                                                                                                                                                                                                                                                            • Downtime: Monitor any type of entity, such as a host, a container, or a process, and alert when the entity goes down.

                                                                                                                                                                                                                                                                                                                                                            • Metric: Monitor time-series metrics, and alert if they violate user-defined thresholds.

                                                                                                                                                                                                                                                                                                                                                            • PromQL: Monitor metrics through a PromQL query.

                                                                                                                                                                                                                                                                                                                                                            • Event: Monitor occurrences of specific events, and alert if the total number of occurrences violates a threshold. Useful for alerting on container, orchestration, and service events like restarts and unauthorized access.

                                                                                                                                                                                                                                                                                                                                                            • Anomaly Detection: Monitor hosts based on their historical behaviors, and alert when they deviate from the expected pattern.

                                                                                                                                                                                                                                                                                                                                                            • Group Outlier: Monitor a group of hosts and be notified when one acts differently from the rest. Group Outlier Alert is supported only on hosts.

                                                                                                                                                                                                                                                                                                                                                            • Alert Library: Sysdig Monitor provides a set of alerts by default. Use it as it is or as a template to create your own.

                                                                                                                                                                                                                                                                                                                                                            • Sysdig API: Use Sysdig’s Python client to create, list, delete, update and restore alerts. See examples.

                                                                                                                                                                                                                                                                                                                                                            Guidelines for Creating Alerts

                                                                                                                                                                                                                                                                                                                                                            Steps

                                                                                                                                                                                                                                                                                                                                                            Description

                                                                                                                                                                                                                                                                                                                                                            Decide What to monitor

                                                                                                                                                                                                                                                                                                                                                            Determine what type of problem you want to be alerted on. See Alert Types to choose a type of problem.

                                                                                                                                                                                                                                                                                                                                                            Define how it will be monitored

                                                                                                                                                                                                                                                                                                                                                            Specify exactly what behavior triggers a violation. For example, Marathon App is down on the Kubernetes Cluster named Production for ten minutes.

                                                                                                                                                                                                                                                                                                                                                            Decide Where to monitor

                                                                                                                                                                                                                                                                                                                                                            Narrow down your environment to receive fine-tuned results. Use Scope to choose an entity that you want to keep a close watch on. Specify additional segments (entities) to give context to the problem. For example, in addition to specifying a Kubernetes cluster, add a namespace and deployment to refine your scope.

                                                                                                                                                                                                                                                                                                                                                            Define when to notify

                                                                                                                                                                                                                                                                                                                                                            Define the threshold and time window for assessing the alert condition.

                                                                                                                                                                                                                                                                                                                                                            Single Alert fires an alert for your entire scope, while Multiple Alert fires if any or every segment breach the threshold at once.

                                                                                                                                                                                                                                                                                                                                                            Multiple Alerts include all the segments you specified to uniquely identify the location and thus provides a full qualification of where the problem occurred. The higher the number of segments the easier to uniquely identify the affected entities.

                                                                                                                                                                                                                                                                                                                                                            A good analogy for multiple alerts is alerting on cities. For example, creating multiple alerts on San Francisco would trigger an alert which will include information such as the country that it is part of is the USA and the continent is North America.

                                                                                                                                                                                                                                                                                                                                                            Trigger gives you control over how notifications are created. For example, you may want to receive a notification for every violation, or want only a single notification for a series of consecutive violations.

                                                                                                                                                                                                                                                                                                                                                            Decide how notifications are sent

                                                                                                                                                                                                                                                                                                                                                            Alert supports customizable notification channels, including email, mobile push notifications, OpsGenie, Slack, and more. To see supported services, see Set Up Notification Channels.

                                                                                                                                                                                                                                                                                                                                                            To create alerts, simply:

                                                                                                                                                                                                                                                                                                                                                            1. Choose an alert type.

                                                                                                                                                                                                                                                                                                                                                            2. Configure alert parameters.

                                                                                                                                                                                                                                                                                                                                                            3. Configure the notification channels you want to use for alert notification.

                                                                                                                                                                                                                                                                                                                                                            Sysdig sometimes deprecates outdated metrics. Alerts that use these metrics will not be modified or disabled, but will no longer be updated. See Heuristic and Deprecated Metrics.

                                                                                                                                                                                                                                                                                                                                                            Configure Alerts

                                                                                                                                                                                                                                                                                                                                                            Use the Alert wizard to create or edit alerts.

                                                                                                                                                                                                                                                                                                                                                            Open the Alert Wizard

                                                                                                                                                                                                                                                                                                                                                            There are multiple ways to access the Alert wizard:

                                                                                                                                                                                                                                                                                                                                                            From Explore

                                                                                                                                                                                                                                                                                                                                                            Do one of the following:

                                                                                                                                                                                                                                                                                                                                                            • Select New Alertbeside an entity:

                                                                                                                                                                                                                                                                                                                                                            • Click More Options (three dots), and select Create a new alert.

                                                                                                                                                                                                                                                                                                                                                            From Dashboards

                                                                                                                                                                                                                                                                                                                                                            Click the More Options (three dots) icon for a panel, and select Create Alert.

                                                                                                                                                                                                                                                                                                                                                            From Alerts

                                                                                                                                                                                                                                                                                                                                                            Do one of the following:

                                                                                                                                                                                                                                                                                                                                                            • Click Add Alerts.

                                                                                                                                                                                                                                                                                                                                                            • Select an existing alert and click Edit.

                                                                                                                                                                                                                                                                                                                                                            From Overview

                                                                                                                                                                                                                                                                                                                                                            From the Events panel on the Overview screen, select a custom or an Infrastructure type event. From the event description screen, click Create Alert from Event.

                                                                                                                                                                                                                                                                                                                                                            Create an Alert

                                                                                                                                                                                                                                                                                                                                                            Configure notification channels before you begin, so the channels are available to assign to the alert. Optionally, you can add a custom subject and body information into individual alert notifications.

                                                                                                                                                                                                                                                                                                                                                            Enter Basic Alert Information

                                                                                                                                                                                                                                                                                                                                                            Configuration slightly defers for each Alert type. See respective pages to learn more. This section covers general instructions to help you acquainted with and navigate the Alerts user interface.

                                                                                                                                                                                                                                                                                                                                                            To configure an alert, open the Alert wizard and set the following parameters:

                                                                                                                                                                                                                                                                                                                                                            • Create the alert:

                                                                                                                                                                                                                                                                                                                                                              • Type: Select the desired Alert Types.

                                                                                                                                                                                                                                                                                                                                                                Each type has different parameters, but they follow the same pattern:

                                                                                                                                                                                                                                                                                                                                                                • Name: Specify a meaningful name that can uniquely represent the Alert that you are creating. For example, the entity that an alert targets, such as Production Cluster Failed Scheduling pods.

                                                                                                                                                                                                                                                                                                                                                                • Group (optional): Specify a meaningful group name for the alert you are creating. Group name helps you narrow down the problem area and focus on the infrastructure view that needs your attention. For example, you can enter Redis for alerts related to Redis services. When the alert triggers you will know which service in your workload requires inspection. Alerts that have no group name will be added to the Default Group. Group name is editable. Edit the alert to do so.

                                                                                                                                                                                                                                                                                                                                                                  An alert can belong to only one group. An alert created from an alert template will have the group already configured by the Monitor Integrations. You can see the existing alert groups on the Alerts details page.

                                                                                                                                                                                                                                                                                                                                                                  See Groupings for more information on how Sysdig handles infrastructure views.

                                                                                                                                                                                                                                                                                                                                                                • Description (optional): Briefly expand on the alert name or alert condition to give additional context for the recipient.

                                                                                                                                                                                                                                                                                                                                                                • Priority: Select a priority. High, Medium, Low, and Info. You can later sort by the severity by using the top navigation pane.

                                                                                                                                                                                                                                                                                                                                                                • Specify the parameters in the Define, Notify, and Act sections.

                                                                                                                                                                                                                                                                                                                                                            • Define:

                                                                                                                                                                                                                                                                                                                                                              Based on the alert type, define the parameters.

                                                                                                                                                                                                                                                                                                                                                              • Downtime: Select the entity to monitor. For more information, see Downtime Alert.

                                                                                                                                                                                                                                                                                                                                                              • Metric: Select a metric that this alert will monitor. You also define how the data is aggregated, such as average, maximum, minimum, or sum. Metrics are applied to a group of items (group aggregation). For more information, see Metric Alerts.

                                                                                                                                                                                                                                                                                                                                                              • PromQL: Enter the PromQL query and duration. For more information, see PromQL Alerts.

                                                                                                                                                                                                                                                                                                                                                              • Event: Filter the custom event to be alerted on by using the name, tag, description, and a source tag. For more information, see Event Alerts

                                                                                                                                                                                                                                                                                                                                                              • Anomaly Detection: Specify the metrics to be monitored for anomalies. For more information, see Anomaly Detection Alerts.

                                                                                                                                                                                                                                                                                                                                                              • Group Outlier: Specify the metrics to be monitored for outliers. For more information, see Group Outlier Alerts.

                                                                                                                                                                                                                                                                                                                                                            To alert on multiple metrics using boolean logic, click Create multi-condition alerts. See Multi-Condition Alerts.

                                                                                                                                                                                                                                                                                                                                                            • Scope: Everywhere, or a more limited scope to filter a specific component of the infrastructure monitored, such as a Kubernetes deployment, a Sysdig Agent, or a specific service.

                                                                                                                                                                                                                                                                                                                                                            • Trigger: Boundaries for assessing the alert condition, and whether to send a single alert or multiple alerts. Supported time scales are minute, hour, or day.

                                                                                                                                                                                                                                                                                                                                                              • Single alert: Single Alert fires an alert for your entire scope.

                                                                                                                                                                                                                                                                                                                                                              • Multiple alerts: Multiple Alert fires if any or every segment breaches the threshold at once.

                                                                                                                                                                                                                                                                                                                                                                Multiple alerts are triggered for each segment you specify. The specified segments will be represented in alerts. The higher the number of segments the easier to uniquely identify the affected entities.

                                                                                                                                                                                                                                                                                                                                                            For detailed description, see respective sections on Alert Types.

                                                                                                                                                                                                                                                                                                                                                            • (2) Notify

                                                                                                                                                                                                                                                                                                                                                              • Notification Channel: Select from the configured notification channels in the list. Supported channels are:

                                                                                                                                                                                                                                                                                                                                                                • Email

                                                                                                                                                                                                                                                                                                                                                                • Slack

                                                                                                                                                                                                                                                                                                                                                                • Amazon SNS Topic

                                                                                                                                                                                                                                                                                                                                                                • Opsgenie

                                                                                                                                                                                                                                                                                                                                                                • Pagerduty

                                                                                                                                                                                                                                                                                                                                                                • VictorOps

                                                                                                                                                                                                                                                                                                                                                                • Webhook

                                                                                                                                                                                                                                                                                                                                                                You can view the list of notification channels configured for each alert on the Alerts page.

                                                                                                                                                                                                                                                                                                                                                              • Notification Options: Set the time interval at which multiple alerts should be sent.

                                                                                                                                                                                                                                                                                                                                                              • Format Message: If applicable, add message format details. See Customize Notifications.

                                                                                                                                                                                                                                                                                                                                                            • (3) Act

                                                                                                                                                                                                                                                                                                                                                              • (Optional): Configure a Sysdig capture. See also Captures.

                                                                                                                                                                                                                                                                                                                                                                Sysdig capture files are not available for Event Alerts.

                                                                                                                                                                                                                                                                                                                                                            • Click Create.

                                                                                                                                                                                                                                                                                                                                                            Optional: Customize Notifications

                                                                                                                                                                                                                                                                                                                                                            You can optionally customize individual notifications to provide context for the errors that triggered the alert. All the notification channels support this added contextual information and customization flexibility.

                                                                                                                                                                                                                                                                                                                                                            Modify the subject, body, or both of the alert notification with the following:

                                                                                                                                                                                                                                                                                                                                                            • Plaintext: A custom message stating the problem. For example, Stalled Deployment.

                                                                                                                                                                                                                                                                                                                                                            • Hyperlink: For example, URL to a Dashboard.

                                                                                                                                                                                                                                                                                                                                                            • Dynamic Variable: For example, a hostname. Note the conventions:

                                                                                                                                                                                                                                                                                                                                                              • All variables that you insert must be enclosed in double curly braces, such as {{file_mount}}.

                                                                                                                                                                                                                                                                                                                                                              • Variables are case sensitive.

                                                                                                                                                                                                                                                                                                                                                              • The variables should correspond to the segment values you created the alert for. For example, if an alert is segmented byhost.hostName andcontainer.name, the corresponding variables will be{{host.hostName}}and {{container.name}} respectively. In addition to these segment variables, __alert_name__  and __alert_status__ are supported. No other segment variables are allowed in the notification subject and body.

                                                                                                                                                                                                                                                                                                                                                              • Notification subjects will not show up on the Event feed.

                                                                                                                                                                                                                                                                                                                                                              • Using a variable that is not a part of the segment will trigger an error.

                                                                                                                                                                                                                                                                                                                                                              • The segment variables used in an alert are turned to the current system values upon sending the alert.

                                                                                                                                                                                                                                                                                                                                                            The body of the notification message contains a Default Alert Template. It is the default alert notification generated by Sysdig Monitor. You may add free text, variables, or hyperlinks before and after the template.

                                                                                                                                                                                                                                                                                                                                                            You can send a customized alert notification to the following channels:

                                                                                                                                                                                                                                                                                                                                                            • Email

                                                                                                                                                                                                                                                                                                                                                            • Slack

                                                                                                                                                                                                                                                                                                                                                            • Amazon SNS Topic

                                                                                                                                                                                                                                                                                                                                                            • Opsgenie

                                                                                                                                                                                                                                                                                                                                                            • Pagerduty

                                                                                                                                                                                                                                                                                                                                                            • VictorOps

                                                                                                                                                                                                                                                                                                                                                            • Webhook

                                                                                                                                                                                                                                                                                                                                                            Multi-Condition Alerts

                                                                                                                                                                                                                                                                                                                                                            Multi-condition alerts are advanced alert threshold created on complex conditions. To do so, you define alert thresholds as custom boolean expressions that can involve multiple conditions. Click Create multi-condition alerts to enable adding conditions as boolean expressions.

                                                                                                                                                                                                                                                                                                                                                            These advanced alerts require specific syntax, as described in the examples below.

                                                                                                                                                                                                                                                                                                                                                            Format and Operations

                                                                                                                                                                                                                                                                                                                                                            Each condition has five parts:

                                                                                                                                                                                                                                                                                                                                                            • Metric Name : Use the exact metric names. To avoid typos, click the HELP link to access the drop-down list of available metrics. Selecting a metric from the list will automatically add the name to the threshold expression being edited.

                                                                                                                                                                                                                                                                                                                                                            • Group Aggregation (optional): If no group aggregation type is selected, the appropriate default for the metric will be applied (either sum or average). Group aggregation functions must be applied outside of time aggregation functions.

                                                                                                                                                                                                                                                                                                                                                            • Time aggregation : It’s the historical data rolled up over a selected period of time.

                                                                                                                                                                                                                                                                                                                                                            • Operator: Both logical and relational operators are supported.

                                                                                                                                                                                                                                                                                                                                                            • Value: A static numerical value against which a condition is evaluated.

                                                                                                                                                                                                                                                                                                                                                            The table below displays supported time aggregation functions, group aggregation functions, and relational operators:

                                                                                                                                                                                                                                                                                                                                                            Time Aggregation FunctionGroup Aggregation FunctionRelational Operator
                                                                                                                                                                                                                                                                                                                                                            timeAvg()avg()=
                                                                                                                                                                                                                                                                                                                                                            min()min()<
                                                                                                                                                                                                                                                                                                                                                            max()max()>
                                                                                                                                                                                                                                                                                                                                                            sum()sum()<=
                                                                                                                                                                                                                                                                                                                                                            >=
                                                                                                                                                                                                                                                                                                                                                            !=

                                                                                                                                                                                                                                                                                                                                                            The format is:

                                                                                                                                                                                                                                                                                                                                                            condition1 AND condition2
                                                                                                                                                                                                                                                                                                                                                            condition1 OR condition2
                                                                                                                                                                                                                                                                                                                                                            NOT condition1
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            The order of operations can also be altered via parenthesis:

                                                                                                                                                                                                                                                                                                                                                            NOT (condition1 AND (condition2 OR condition3))
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            Conditions take the following form:

                                                                                                                                                                                                                                                                                                                                                            groupAggregation(timeAggregation(metric.name)) operator value
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            Example Expressions

                                                                                                                                                                                                                                                                                                                                                            Several examples of advanced alerts are given below:

                                                                                                                                                                                                                                                                                                                                                            timeAvg(cpu.used.percent) > 50 AND timeAvg(memory.used.percent) > 75
                                                                                                                                                                                                                                                                                                                                                            timeAvg(cpu.used.percent) > 50 OR timeAvg(memory.used.percent) > 75
                                                                                                                                                                                                                                                                                                                                                            timeAvg(container.count) != 10
                                                                                                                                                                                                                                                                                                                                                            min(min(cpu.used.percent)) <= 30 OR max(max(cpu.used.percent)) >= 60
                                                                                                                                                                                                                                                                                                                                                            sum(file.bytes.total) > 0 OR sum(net.bytes.total) > 0
                                                                                                                                                                                                                                                                                                                                                            timeAvg(cpu.used.percent) > 50 AND (timeAvg(mysql.net.connections) > 20 OR timeAvg(memory.used.percent) > 75)
                                                                                                                                                                                                                                                                                                                                                            
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            8.1 -

                                                                                                                                                                                                                                                                                                                                                            Manage Alerts

                                                                                                                                                                                                                                                                                                                                                            Alerts can be managed individually, or as a group, by using the checkboxes on the left side of the Alert UI and the customization bar. The columns of the table can also be configured, to provide you with the necessary data for your use cases.

                                                                                                                                                                                                                                                                                                                                                            Select a group of alerts and perform several batch operations, such as filtering, deleting, enabling, disabling, or exporting to a JSON object. Select individual alerts to perform tasks such as creating a copy for a different team.

                                                                                                                                                                                                                                                                                                                                                            View Alert Details

                                                                                                                                                                                                                                                                                                                                                            The bell button next to an alert indicates that you have not resolved the corresponding events. The Activity Over Last Two Weeks column visually notifies you with an event chart showing the number of events that were triggered over the last two weeks. The color of the event chart represents what severity level they are.

                                                                                                                                                                                                                                                                                                                                                            To view alert details, click the corresponding alert row. The slider with the alert details will appear. Click an individual event to Take Action. You can do one of the following:

                                                                                                                                                                                                                                                                                                                                                            • Acknowledge: Mark that the event has been acknowledged by the intended recipient.

                                                                                                                                                                                                                                                                                                                                                            • Create Silence from Event: If you no longer want to be notified, use this option. You can choose the scope for alert silence. When silenced, alerts will still be triggered but will not send you any notifications.

                                                                                                                                                                                                                                                                                                                                                            • Explore: Use this option to troubleshoot by using the PromQL Query.

                                                                                                                                                                                                                                                                                                                                                            The event feed will be empty and The Activity Over Last Two Weeks column will have no event chart if no events are reported in the past two weeks.

                                                                                                                                                                                                                                                                                                                                                            Enable/Disable Alerts

                                                                                                                                                                                                                                                                                                                                                            Alerts can be enabled or disabled using the slider or the customization bar. You can perform these operations on a single alert or on multiple alerts as a batch operation.

                                                                                                                                                                                                                                                                                                                                                            1. From the Alerts module, check the boxes beside the relevant alerts.

                                                                                                                                                                                                                                                                                                                                                            2. Click Enable Selected or Disable Selected as necessary.

                                                                                                                                                                                                                                                                                                                                                            Use the slider beside the alert to disable or enable individual alerts.

                                                                                                                                                                                                                                                                                                                                                            Edit an Existing Alert

                                                                                                                                                                                                                                                                                                                                                            To edit an existing alert:

                                                                                                                                                                                                                                                                                                                                                            1. Do one of the following::

                                                                                                                                                                                                                                                                                                                                                              • Click the Edit button beside the alert.

                                                                                                                                                                                                                                                                                                                                                              • Click an alert to open the detail view, then click Edit on the top right corner

                                                                                                                                                                                                                                                                                                                                                            2. Edit the alert, and click Save to confirm the changes.

                                                                                                                                                                                                                                                                                                                                                            Copy an Alert

                                                                                                                                                                                                                                                                                                                                                            Alerts can be copied within the current team to allow for similar alerts to be created quickly, or copied to a different team to share alerts.

                                                                                                                                                                                                                                                                                                                                                            Copy an Alert to the Current Team

                                                                                                                                                                                                                                                                                                                                                            To copy an alert within the current team:

                                                                                                                                                                                                                                                                                                                                                            1. Highlight the alert to be copied.

                                                                                                                                                                                                                                                                                                                                                              The detail view is displayed.

                                                                                                                                                                                                                                                                                                                                                            2. Click Copy.

                                                                                                                                                                                                                                                                                                                                                              The Copy Alert screen is displayed.

                                                                                                                                                                                                                                                                                                                                                            3. Select Current from the drop-down.

                                                                                                                                                                                                                                                                                                                                                            4. Click Copy and Open.

                                                                                                                                                                                                                                                                                                                                                              The particular alert in the edit mode appears.

                                                                                                                                                                                                                                                                                                                                                            5. Make necessary changes and save the alert.

                                                                                                                                                                                                                                                                                                                                                            Copy an Alert to a Different Team

                                                                                                                                                                                                                                                                                                                                                            1. Highlight the alert to be copied.

                                                                                                                                                                                                                                                                                                                                                              The detail view is displayed.

                                                                                                                                                                                                                                                                                                                                                            2. Click Copy.

                                                                                                                                                                                                                                                                                                                                                              The Copy Alert screen is displayed.

                                                                                                                                                                                                                                                                                                                                                            3. Select the teams that the alert should be copied to.

                                                                                                                                                                                                                                                                                                                                                            4. Click Send Copy.

                                                                                                                                                                                                                                                                                                                                                            Search for an Alert

                                                                                                                                                                                                                                                                                                                                                            Search Using Strings

                                                                                                                                                                                                                                                                                                                                                            The Alerts table can be searched using partial or full strings. For example, the search below displays only events that contain kubernetes:

                                                                                                                                                                                                                                                                                                                                                            Filter Alerts

                                                                                                                                                                                                                                                                                                                                                            The alert feed can be filtered in multiple ways, to drill-down into the environment’s history and refine the alert displayed. The feed can be filtered by severity or status. Examples of each are shown below.

                                                                                                                                                                                                                                                                                                                                                            The example below shows only high and medium severity:

                                                                                                                                                                                                                                                                                                                                                            The example below shows the alerts that are invalid:

                                                                                                                                                                                                                                                                                                                                                            Export Alerts as JSON

                                                                                                                                                                                                                                                                                                                                                            A JSON file can be exported to a local machine, containing JSON snippets for each selected alert:

                                                                                                                                                                                                                                                                                                                                                            1. Click the checkboxes beside the relevant alerts to be exported.

                                                                                                                                                                                                                                                                                                                                                            2. Click Export JSON.

                                                                                                                                                                                                                                                                                                                                                            Delete Alerts

                                                                                                                                                                                                                                                                                                                                                            Open the Alert page and use one of the following methods to delete alerts :

                                                                                                                                                                                                                                                                                                                                                            • Hover on a specific alert and click Delete.

                                                                                                                                                                                                                                                                                                                                                            • Hover on one or more alerts, click the checkbox, then click Delete on the bulk-action toolbar.

                                                                                                                                                                                                                                                                                                                                                            • Click an alert to see the detailed view, then click Delete on the top right corner.

                                                                                                                                                                                                                                                                                                                                                            8.2 -

                                                                                                                                                                                                                                                                                                                                                            Silence Alert Notifications

                                                                                                                                                                                                                                                                                                                                                            Sysdig Monitor allows you to silence alerts for a given scope for a predefined amount of time. When silenced, alerts will still be triggered but will not send any notifications. You can schedule silencing in advance. This helps administrators to temporarily mute notifications during planned downtime or maintenance and send downtime notifications to selected channels.

                                                                                                                                                                                                                                                                                                                                                            With an active silence, the only notifications you will receive are those indicating the start time and the end time of the silence. All other notifications for events from that scope will be silenced. When a silence is active, creating an alert triggers the alert but no notification will be sent. Additionally, a triggering event will be generated stating that the alert is silenced.

                                                                                                                                                                                                                                                                                                                                                            See Working with Alert APIs for programmatically silencing alert notifications.

                                                                                                                                                                                                                                                                                                                                                            Configure a Silence

                                                                                                                                                                                                                                                                                                                                                            When you create a new silence, it is by default enabled and scheduled. When the start time arrives for a scheduled silence, it becomes active and the list shows the time remaining. When the end time arrives, the silence becomes completed and cannot be enabled again.

                                                                                                                                                                                                                                                                                                                                                            To configure a silence:

                                                                                                                                                                                                                                                                                                                                                            1. Click Alerts on the left navigation on the Monitor UI.

                                                                                                                                                                                                                                                                                                                                                            2. Click the Silence tab.

                                                                                                                                                                                                                                                                                                                                                              The page shows the list of all the existing silences.

                                                                                                                                                                                                                                                                                                                                                            3. Click Set a Silence.

                                                                                                                                                                                                                                                                                                                                                              The Silence for Scope window is displayed.

                                                                                                                                                                                                                                                                                                                                                            1. Specify the following:

                                                                                                                                                                                                                                                                                                                                                              • Scope: Specify the entity you want to apply the scope as. For example, a particular workload or namespace, from environments that may include thousands of entities.

                                                                                                                                                                                                                                                                                                                                                              • Begins: Specify one of the following: Today, Tomorrow, Pick Another Day. Select the time from the drop-down.

                                                                                                                                                                                                                                                                                                                                                              • Duration: Specify how long notifications should be suppressed.

                                                                                                                                                                                                                                                                                                                                                              • Name: Specify a name to identify the silence.

                                                                                                                                                                                                                                                                                                                                                              • Notify: Select a channel you want to notify about the silence.

                                                                                                                                                                                                                                                                                                                                                            2. Click Save.

                                                                                                                                                                                                                                                                                                                                                            Silence Alert Notifications from Event Feed

                                                                                                                                                                                                                                                                                                                                                            You can also create and edit silences and view silenced alert events on the Events feeds across the Monitor UI. When you create a silence, the alert will still be triggered and posted on the Events feed and in the graph overlays but will indicate that the alert has been silenced.

                                                                                                                                                                                                                                                                                                                                                            If you have an alert with no notification channel configured, events generated from that alert won’t be marked as silenced. They won’t be visually represented in the events feed as well with the crossed bell icon and the option to silence events.

                                                                                                                                                                                                                                                                                                                                                            To do so,

                                                                                                                                                                                                                                                                                                                                                            1. On the event feed, select the alert event that you want to silence.

                                                                                                                                                                                                                                                                                                                                                            2. On the event details slider, click Take Action.

                                                                                                                                                                                                                                                                                                                                                            3. Click Create Silence from Event.

                                                                                                                                                                                                                                                                                                                                                              The Silence for Scope window is displayed.

                                                                                                                                                                                                                                                                                                                                                            4. Continue configuring the silence as described in 4.

                                                                                                                                                                                                                                                                                                                                                            Manage Silences

                                                                                                                                                                                                                                                                                                                                                            Silences can be managed individually, or as a group, by using the checkboxes on the left side of the Silence UI and the customization bar. Select a group of silences and perform batch delete operations. Select individual silences to perform tasks such as enabling, disabling, duplicating, and editing.

                                                                                                                                                                                                                                                                                                                                                            Change States

                                                                                                                                                                                                                                                                                                                                                            You can enable or disable a silence by sliding the state bar next to the silences. There are two kinds of silences that will show as enabled: active (a running silence) and a scheduled silence (which will start in the future). Its starting date is back in time but the end date is yet to happen. A clock icon visually represents an active silence.

                                                                                                                                                                                                                                                                                                                                                            Completed silences cannot be re-enabled once a silenced period is finished. However, you can duplicate it with all the data but you need to set a new silencing period.

                                                                                                                                                                                                                                                                                                                                                            A silence can be disabled only when:

                                                                                                                                                                                                                                                                                                                                                            • The silence is not yet started

                                                                                                                                                                                                                                                                                                                                                            • The silence is in progress.

                                                                                                                                                                                                                                                                                                                                                            Filter Silences

                                                                                                                                                                                                                                                                                                                                                            Use the search bar to filter silences. You can either perform a simple auto-complete text search or use the categories. The feed can be filtered by the following categories: Active, Scheduled, Completed.

                                                                                                                                                                                                                                                                                                                                                            For example, the following shows the completed silences that start with “ag”.

                                                                                                                                                                                                                                                                                                                                                            Duplicate a Silence

                                                                                                                                                                                                                                                                                                                                                            Do one of the following to duplicate a silence:

                                                                                                                                                                                                                                                                                                                                                            • Click the Duplicate hover-the-row button on the menu.

                                                                                                                                                                                                                                                                                                                                                            • Click the row for the Silence for Scope window to open. On the window, make necessary changes if required and click Duplicate.

                                                                                                                                                                                                                                                                                                                                                            Edit Silence

                                                                                                                                                                                                                                                                                                                                                            You can edit scheduled silences. For the active ones, you can only extend the time. You cannot edit completed silences.

                                                                                                                                                                                                                                                                                                                                                            To edit a silence, do one of the following:

                                                                                                                                                                                                                                                                                                                                                            • Click the row for the Silence for Scope window to open. Make necessary changes and click Update.

                                                                                                                                                                                                                                                                                                                                                            • Click the Edit hover-the-row button on the menu. The Silence for Scope window will be displayed.

                                                                                                                                                                                                                                                                                                                                                              Make necessary changes and click Update.

                                                                                                                                                                                                                                                                                                                                                            Extend the Time Duration

                                                                                                                                                                                                                                                                                                                                                            For the active silences, you can extend the duration to one of the following:

                                                                                                                                                                                                                                                                                                                                                            • 1 Hour

                                                                                                                                                                                                                                                                                                                                                            • 2 Hours,

                                                                                                                                                                                                                                                                                                                                                            • 6 Hours,

                                                                                                                                                                                                                                                                                                                                                            • 12 Hours

                                                                                                                                                                                                                                                                                                                                                            • 24 Hours

                                                                                                                                                                                                                                                                                                                                                            To do so, click the extend the time duration button on the menu and choose the duration. You can extend the time of an active silence even from the Silence for Scope window.

                                                                                                                                                                                                                                                                                                                                                            Extending the time duration will notify the configured notification channels that the downtime is extended. You can also extend the time from a Slack notification of a silence by clicking the given link. It opens the Silence for Scope window of the running silence where you can make necessary adjustments.

                                                                                                                                                                                                                                                                                                                                                            You cannot extend the duration of completed silences.

                                                                                                                                                                                                                                                                                                                                                            8.3 -

                                                                                                                                                                                                                                                                                                                                                            Alerts Library

                                                                                                                                                                                                                                                                                                                                                            To help you get started quickly, Sysdig provides a set of curated alert templates called Alerts Library. Powered by Monitor Integrations , Sysdig automatically detects the applications and services running in your environment and recommends alerts that you can enable.

                                                                                                                                                                                                                                                                                                                                                            Two types of alert templates are included in Alerts Library:

                                                                                                                                                                                                                                                                                                                                                            • Recommended: Alert suggestions based on the services that are detected running in your infrastructure.

                                                                                                                                                                                                                                                                                                                                                            • All templates: You can browse templates for all the services. For some templates, you might need to configure Monitor Integrations.

                                                                                                                                                                                                                                                                                                                                                            Access Alerts Library

                                                                                                                                                                                                                                                                                                                                                            1. Log in to Sysdig Monitor.

                                                                                                                                                                                                                                                                                                                                                            2. Click Alerts from the left navigation pane.

                                                                                                                                                                                                                                                                                                                                                            3. On the Alerts tab, click  Library.

                                                                                                                                                                                                                                                                                                                                                            Import an Alert

                                                                                                                                                                                                                                                                                                                                                            1. Locate the service that you want to configure an alert for.

                                                                                                                                                                                                                                                                                                                                                              To do so, either use the text search or identify from a list of services.

                                                                                                                                                                                                                                                                                                                                                            2. For example, click Redis.

                                                                                                                                                                                                                                                                                                                                                              Eight template suggestions are displayed for 14 Redis services running on the environment.

                                                                                                                                                                                                                                                                                                                                                            3. From a list of template suggestions, choose the desired template.

                                                                                                                                                                                                                                                                                                                                                              The Redis page shows the alerts that are already in use and that you can enable.

                                                                                                                                                                                                                                                                                                                                                            4. Enable one or more alert templates. To do so, you can do one of the following:

                                                                                                                                                                                                                                                                                                                                                              • Click Enable Alert.

                                                                                                                                                                                                                                                                                                                                                              • Bulk enable templates. Select the check box corresponding to the alert templates and click Enable Alert on the top-right corner.

                                                                                                                                                                                                                                                                                                                                                              • Click on the alert template to display the slider. Click the Enable Alert on the slider.

                                                                                                                                                                                                                                                                                                                                                            5. On the Configure Redis Alert page, specify the Scope and select the Notification channels.

                                                                                                                                                                                                                                                                                                                                                            6. Click Enable Alert.

                                                                                                                                                                                                                                                                                                                                                              You will see a message stating that the Redis Alert has been successfully created.

                                                                                                                                                                                                                                                                                                                                                            Use Alerts Library

                                                                                                                                                                                                                                                                                                                                                            In addition to importing an alert, you can also do the following with the Alerts Library:

                                                                                                                                                                                                                                                                                                                                                            • Identify Alert templates associated with the services running in your infrastructure.

                                                                                                                                                                                                                                                                                                                                                            • Bulk import Alert templates. See Import an Alert.

                                                                                                                                                                                                                                                                                                                                                            • View alerts that are already configured.

                                                                                                                                                                                                                                                                                                                                                            • Filter Alert templates. Enter the search string to display the matching results.

                                                                                                                                                                                                                                                                                                                                                            • Discover the workloads where a service is running. To do so, click on the Alert template to display the slider. On the slider, click Workloads.

                                                                                                                                                                                                                                                                                                                                                            • View the alerts in use. To do so, click on an Alert template to display the slider. On the slider, click Alerts in use.

                                                                                                                                                                                                                                                                                                                                                            • Configure an alert.

                                                                                                                                                                                                                                                                                                                                                              Additional alert configuration, such as changing the alert name, description, and severity can be done after the import.

                                                                                                                                                                                                                                                                                                                                                            8.4 -

                                                                                                                                                                                                                                                                                                                                                            Downtime Alert

                                                                                                                                                                                                                                                                                                                                                            Sysdig Monitor continuously surveils any type of entity in your infrastructure, such as a host, a container, a process, or a service, and sends notifications when the monitored entity is not available or responding. Downtime alert focuses mainly on unscheduled downtime of your infrastructure.

                                                                                                                                                                                                                                                                                                                                                            In this example, a Kubernetes cluster is monitored and the alert is segmented on both cluster and namespace. When a Kubernetes cluster in the selected availability zone goes down, notifications will be sent with necessary information on both cluster and affected namespace.

                                                                                                                                                                                                                                                                                                                                                            The lines shown in the preview chart represent the values for the segments selected to monitor. The popup is a color-coded legend to show which segment (or combination of segments if there is more than one) the lines represent. You can also deselect some segment lines to prevent them from showing in the chart. Note that there is a limit of 10 lines that Sysdig Monitor ever shows in the preview chart. For downtime alerts, segments are actually what you select for the “Select entity to monitor” option.

                                                                                                                                                                                                                                                                                                                                                            Define a Downtime Alert

                                                                                                                                                                                                                                                                                                                                                            Guidelines

                                                                                                                                                                                                                                                                                                                                                            • Set a unique name and description: Set a meaningful name and description that help recipients easily identify the alert.

                                                                                                                                                                                                                                                                                                                                                            • Severity: Set a severity level for your alert. The Priority—High, Medium, Low, and Info—are reflected in the Alert list, where you can sort by the severity of the Alert. You can use severity as a criterion when creating alerts, for example: if there are more than 10 high severity events, notify.

                                                                                                                                                                                                                                                                                                                                                            • Specify multiple segments: Selecting a single segment might not always supply enough information to troubleshoot. Enrich the selected entity with related information by adding additional related segments. Enter hierarchical entities so you have the bottom-down picture of what went wrong and where. For example, specifying a Kubernetes Cluster alone does not provide the context necessary to troubleshoot. In order to narrow down the issue, add further contextual information, such as Kubernetes Namespace, Kubernetes Deployment, and so on.

                                                                                                                                                                                                                                                                                                                                                            Specify Entity

                                                                                                                                                                                                                                                                                                                                                            1. Select an entity whose downtime you want to monitor for.

                                                                                                                                                                                                                                                                                                                                                              In this example, you are monitoring the unscheduled downtime of a host.

                                                                                                                                                                                                                                                                                                                                                            2. Specify additional segments:

                                                                                                                                                                                                                                                                                                                                                              The specified entities are segmented on and notified with the default notification template as well as on the Preview. In this example, data is segmented on Kubernetes cluster name and namespace name. When a cluster is affected, the notification will not only include the affected cluster details but also the associated namespaces.

                                                                                                                                                                                                                                                                                                                                                            Configure Scope

                                                                                                                                                                                                                                                                                                                                                            Filter the environment on which this alert will apply. An alert will fire when a host goes down in the availability zone, us-east-1b.

                                                                                                                                                                                                                                                                                                                                                            Use in or contain operators to match multiple different possible values to apply scope.

                                                                                                                                                                                                                                                                                                                                                            The contain and not contain operators help you retrieve values if you know part of the values. For example, us retrieves values that contain strings that start with “us”, such as “us-east-1b”, “us-west-2b”, and so on.

                                                                                                                                                                                                                                                                                                                                                            The in and not in operators help you filter multiple values.

                                                                                                                                                                                                                                                                                                                                                            You can also create alerts directly from Explore and Dashboards for automatically populating this scope.

                                                                                                                                                                                                                                                                                                                                                            Configure Trigger

                                                                                                                                                                                                                                                                                                                                                            Define the threshold and time window for assessing the alert condition. Supported time scales are minute, hour, or day.

                                                                                                                                                                                                                                                                                                                                                            If the monitored host or Kubernetes cluster is not available or not responding for the last 10 minutes, recipients will be notified.

                                                                                                                                                                                                                                                                                                                                                            You can set any value for % and a value greater than 1 for the time window. For example, If you choose 50% instead of 100%, a notification will be triggered when the entity is down for 5 minutes in the selected time window of 10 minutes.

                                                                                                                                                                                                                                                                                                                                                            Use Cases

                                                                                                                                                                                                                                                                                                                                                            • Your e-commerce website is down during the peak hours of Black Friday, Christmas, or New Year season.

                                                                                                                                                                                                                                                                                                                                                            • Production servers of your data center experience a critical outage

                                                                                                                                                                                                                                                                                                                                                            • MySQL database is unreachable

                                                                                                                                                                                                                                                                                                                                                            • File upload does not work on your marketing website.

                                                                                                                                                                                                                                                                                                                                                            8.5 -

                                                                                                                                                                                                                                                                                                                                                            PromQL Alerts

                                                                                                                                                                                                                                                                                                                                                            Sysdig Monitor enables you to use PromQL to define metric expressions that you can alert on. You define the alert conditions using the PromQL-based metric expression. This way, you can combine different metrics and warn on cases like service-level agreement breach, running out of disk space in a day, and so on.

                                                                                                                                                                                                                                                                                                                                                            Examples

                                                                                                                                                                                                                                                                                                                                                            For PromQL alerts, you can use any metric that is available in PromQL, including Sysdig native metrics. For more details see the various integrations available on promcat.io.

                                                                                                                                                                                                                                                                                                                                                            Low Disk Space Alert

                                                                                                                                                                                                                                                                                                                                                            Warn if disk space falls below a specified quantity. For example disk space is below 10GB in the 24h hour:

                                                                                                                                                                                                                                                                                                                                                            predict_linear(sysdig_fs_free_bytes{fstype!~"tmpfs"}[1h], 24*3600) < 10000000000
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            Slow Etcd Requests

                                                                                                                                                                                                                                                                                                                                                            Notify if etcd requests are slow. This example uses the promcat.io integration.

                                                                                                                                                                                                                                                                                                                                                            histogram_quantile(0.99, rate(etcd_http_successful_duration_seconds_bucket[5m]) > 0.15
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            High Heap Usage

                                                                                                                                                                                                                                                                                                                                                            Warn when the heap usage in ElasticSearch is more than 80%. This example uses the promcat.io integration.

                                                                                                                                                                                                                                                                                                                                                            (elasticsearch_jvm_memory_used_bytes{area="heap"} / elasticsearch_jvm_memory_max_bytes{area="heap"}) * 100 > 80
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            Guidelines

                                                                                                                                                                                                                                                                                                                                                            Sysdig Monitor does not currently support the following:

                                                                                                                                                                                                                                                                                                                                                            • Interact with the Prometheus alert manager or import alert manager configuration.

                                                                                                                                                                                                                                                                                                                                                            • Provide the ability to use, copy, paste, and import predefined alert rules.

                                                                                                                                                                                                                                                                                                                                                            • Convert the alert rules to map to the Sysdig alert editor.

                                                                                                                                                                                                                                                                                                                                                            Create a PromQL Alert

                                                                                                                                                                                                                                                                                                                                                            Set a meaningful name and description that help recipients easily identify the alert.

                                                                                                                                                                                                                                                                                                                                                            Set a Priority

                                                                                                                                                                                                                                                                                                                                                            Select a priority for the alert that you are creating. The supported priorities are High, Medium, Low, and Info. You can also view and sort events in the dashboard and explore UI, as well as sort them by severity.

                                                                                                                                                                                                                                                                                                                                                            Define a PromQL Alert

                                                                                                                                                                                                                                                                                                                                                            PromQL: Enter a valid PromQL expression. The query will be executed every minute. However, the alert will be triggered only if the query returns data for the specified duration.

                                                                                                                                                                                                                                                                                                                                                            In this example, you will be alerted when the rate of HTTP requests has doubled over the last 5 minutes.

                                                                                                                                                                                                                                                                                                                                                            Duration: Specify the time window for evaluating the alert condition in minutes, hour, or day. The alert will be triggered if the query returns data for the specified duration.

                                                                                                                                                                                                                                                                                                                                                            Define Notification

                                                                                                                                                                                                                                                                                                                                                            Notification Channels: Select from the configured notification channels in the list.

                                                                                                                                                                                                                                                                                                                                                            Re-notification Options: Set the time interval at which multiple alerts should be sent if the problem remains unresolved.

                                                                                                                                                                                                                                                                                                                                                            Notification Message & Events: Enter a subject and body. Optionally, you can choose an existing template for the body. Modify the subject, body, or both for the alert notification with a hyperlink, plain text, or dynamic variables.

                                                                                                                                                                                                                                                                                                                                                            Import Prometheus Alert Rules

                                                                                                                                                                                                                                                                                                                                                            Sysdig Alert allows you to import Prometheus rules or create new rules on the fly and add them to the existing list of alerts. Click the Upload Prometheus Rules option and enter the rules as YAML in the Upload Prometheus Rules YAML editor. Importing your Prometheus alert rules will convert them to PromQL-based Sysdig alerts. Ensure that the alert rules are valid YAML.

                                                                                                                                                                                                                                                                                                                                                            You can upload one or more alert rules in a single YAML and create multiple alerts simultaneously.

                                                                                                                                                                                                                                                                                                                                                            Once the rules are imported to Sysdig Monitor, the alert list will be automatically sorted by last modified date.

                                                                                                                                                                                                                                                                                                                                                            Besides the pre-populated template, each rule specified in the Upload Prometheus Rules YAML editor requires the following fields:

                                                                                                                                                                                                                                                                                                                                                            • alert

                                                                                                                                                                                                                                                                                                                                                            • expr 

                                                                                                                                                                                                                                                                                                                                                            •  for

                                                                                                                                                                                                                                                                                                                                                            See the following examples to understand the format of Prometheus Rules YAML. Ensure that the alert rules are valid YAML to pass validation.

                                                                                                                                                                                                                                                                                                                                                            Example: Alert Prometheus Crash Looping

                                                                                                                                                                                                                                                                                                                                                            To alert potential Prometheus crash looping. Create a rule to alert when Prometheus restart more than twice in the last 10 minutes.

                                                                                                                                                                                                                                                                                                                                                            groups:
                                                                                                                                                                                                                                                                                                                                                            - name: crashlooping
                                                                                                                                                                                                                                                                                                                                                              rules:
                                                                                                                                                                                                                                                                                                                                                              - alert: PrometheusTooManyRestarts
                                                                                                                                                                                                                                                                                                                                                                expr: changes(process_start_time_seconds{job=~"prometheus|pushgateway|alertmanager"}[10m]) > 2
                                                                                                                                                                                                                                                                                                                                                                for: 0m
                                                                                                                                                                                                                                                                                                                                                                labels:
                                                                                                                                                                                                                                                                                                                                                                  severity: warning
                                                                                                                                                                                                                                                                                                                                                                annotations:
                                                                                                                                                                                                                                                                                                                                                                  summary: Prometheus too many restarts (instance {{ $labels.instance }})
                                                                                                                                                                                                                                                                                                                                                                  description: Prometheus has restarted more than twice in the last 15 minutes. It might be crashlooping.\n  VALUE = {{ $value }}\n
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            Example: Alert HTTP Error Rate

                                                                                                                                                                                                                                                                                                                                                            To alert HTTP requests with status 5xx (> 5%) or high latency:

                                                                                                                                                                                                                                                                                                                                                            groups:
                                                                                                                                                                                                                                                                                                                                                            - name: default
                                                                                                                                                                                                                                                                                                                                                              rules:
                                                                                                                                                                                                                                                                                                                                                              # Paste your rules here
                                                                                                                                                                                                                                                                                                                                                              - alert: NginxHighHttp5xxErrorRate
                                                                                                                                                                                                                                                                                                                                                                expr: sum(rate(nginx_http_requests_total{status=~"^5.."}[1m])) / sum(rate(nginx_http_requests_total[1m])) * 100 > 5
                                                                                                                                                                                                                                                                                                                                                                for: 1m
                                                                                                                                                                                                                                                                                                                                                                labels:
                                                                                                                                                                                                                                                                                                                                                                  severity: critical
                                                                                                                                                                                                                                                                                                                                                                annotations:
                                                                                                                                                                                                                                                                                                                                                                  summary: Nginx high HTTP 5xx error rate (instance {{ $labels.instance }})
                                                                                                                                                                                                                                                                                                                                                                  description: Too many HTTP requests with status 5xx
                                                                                                                                                                                                                                                                                                                                                              - alert: NginxLatencyHigh
                                                                                                                                                                                                                                                                                                                                                                expr: histogram_quantile(0.99, sum(rate(nginx_http_request_duration_seconds_bucket[2m])) by (host, node)) > 3
                                                                                                                                                                                                                                                                                                                                                                for: 2m
                                                                                                                                                                                                                                                                                                                                                                labels:
                                                                                                                                                                                                                                                                                                                                                                  severity: warning
                                                                                                                                                                                                                                                                                                                                                                annotations:
                                                                                                                                                                                                                                                                                                                                                                  summary: Nginx latency high (instance {{ $labels.instance }})
                                                                                                                                                                                                                                                                                                                                                                  description: Nginx p99 latency is higher than 3 seconds
                                                                                                                                                                                                                                                                                                                                                            

                                                                                                                                                                                                                                                                                                                                                            Learn More

                                                                                                                                                                                                                                                                                                                                                            8.6 -

                                                                                                                                                                                                                                                                                                                                                            Metric Alerts

                                                                                                                                                                                                                                                                                                                                                            Sysdig Monitor keeps a watch on time-series metrics, and alert if they violate user-defined thresholds.

                                                                                                                                                                                                                                                                                                                                                            The lines shown in the preview chart represent the values for the segments selected to monitor. The popup is a color-coded legend to show which segment (or combination of segments if there is more than one) the lines represent. You can also deselect some segment lines to prevent them from showing in the chart. Note that there is a limit of 10 lines that Sysdig Monitor ever shows in the preview chart.

                                                                                                                                                                                                                                                                                                                                                            Defining a Metric Alert

                                                                                                                                                                                                                                                                                                                                                            Guidelines

                                                                                                                                                                                                                                                                                                                                                            • Set a unique name and description: Set a meaningful name and description that help recipients easily identify the alert

                                                                                                                                                                                                                                                                                                                                                            • Specify multiple segments: Selecting a single segment might not always supply enough information to troubleshoot. Enrich the selected entity with related information by adding additional related segments. Enter hierarchical entities so you have the bottom-down picture of what went wrong and where. For example, specifying a Kubernetes Cluster alone does not provide the context necessary to troubleshoot. In order to narrow down the issue, add further contextual information, such as Kubernetes Namespace, Kubernetes Deployment, and so on.

                                                                                                                                                                                                                                                                                                                                                            Specify Metrics

                                                                                                                                                                                                                                                                                                                                                            Select a metric that this alert will monitor. You can also define how data is aggregated, such as avg, max, min or sum. To alert on multiple metrics using boolean logic, switch to multi-condition alert.

                                                                                                                                                                                                                                                                                                                                                            Configure Scope

                                                                                                                                                                                                                                                                                                                                                            Filter the environment on which this alert will apply.

                                                                                                                                                                                                                                                                                                                                                            Filter the environment on which this alert will apply. An alert will fire when a host goes down in the availability zone, us-east-1b.

                                                                                                                                                                                                                                                                                                                                                            Use advanced operators to include, exclude, or pattern-match groups, tags, and entities. See Multi-Condition Alerts.

                                                                                                                                                                                                                                                                                                                                                            You can also create alerts directly from Explore and Dashboards for automatically populating this scope.

                                                                                                                                                                                                                                                                                                                                                            Configure Trigger

                                                                                                                                                                                                                                                                                                                                                            Define the threshold and time window for assessing the alert condition. Single Alert fires an alert for your entire scope, while Multiple Alert fires if any or every segment breach the threshold at once.

                                                                                                                                                                                                                                                                                                                                                            Metric alerts can be triggered to notify you of different aggregations:

                                                                                                                                                                                                                                                                                                                                                            Aggregation

                                                                                                                                                                                                                                                                                                                                                            Description

                                                                                                                                                                                                                                                                                                                                                            on average

                                                                                                                                                                                                                                                                                                                                                            The average of the retrieved metric values across the time period. Actual number of samples retrieved is used to calculate the value.

                                                                                                                                                                                                                                                                                                                                                            For example, if new data is retrieved in the 7th minute of a 10-minutes sample and the alert is defined as on average, the alert will be calculated by summing the 3 recorded values and dividing by 3.

                                                                                                                                                                                                                                                                                                                                                            as a rate

                                                                                                                                                                                                                                                                                                                                                            The average value of the metric across the time period evaluated. The expected number of values is used to calculate the rate to trigger the alert.

                                                                                                                                                                                                                                                                                                                                                            For example, if new data is retrieved in the 7th minute of a 10-minutes sample and the alert is defined as as a rate, the alert will be calculated by summing the 3 recorded values and dividing by 10 ( 10 x 1 minute samples).

                                                                                                                                                                                                                                                                                                                                                            in sum

                                                                                                                                                                                                                                                                                                                                                            The combined sum of the metric across the time period evaluated.

                                                                                                                                                                                                                                                                                                                                                            at least once

                                                                                                                                                                                                                                                                                                                                                            The trigger value is met for at least one sample in the evaluated period.

                                                                                                                                                                                                                                                                                                                                                            for the entire time

                                                                                                                                                                                                                                                                                                                                                            The trigger value is met for a every sample in the evaluated period.

                                                                                                                                                                                                                                                                                                                                                            as a rate of change

                                                                                                                                                                                                                                                                                                                                                            The trigger value is met the change in value over the evaluated period.

                                                                                                                                                                                                                                                                                                                                                            For example, if the file system used percentage goes above 75 for the last 5 minutes on an average, multiple alerts will be triggered. The mac address of the host and mount directory of the file system will be represented in the alert notification.

                                                                                                                                                                                                                                                                                                                                                            Usecases

                                                                                                                                                                                                                                                                                                                                                            • Number of processes running on a host is not normal

                                                                                                                                                                                                                                                                                                                                                            • Root volume disk usage in a container is high

                                                                                                                                                                                                                                                                                                                                                            8.7 -

                                                                                                                                                                                                                                                                                                                                                            Event Alerts

                                                                                                                                                                                                                                                                                                                                                            Monitor occurrences of specific events, and alert if the total number of occurrences violates a threshold. Useful for alerting on container, orchestration, and service events like restarts and deployments.

                                                                                                                                                                                                                                                                                                                                                            Alerts on events support only one segmentation label. An alert is generated for each segment.

                                                                                                                                                                                                                                                                                                                                                            Defining a Metric Alert

                                                                                                                                                                                                                                                                                                                                                            Guidelines

                                                                                                                                                                                                                                                                                                                                                            • Set a unique name and description: Set a meaningful name and description that help recipients easily identify the alert.

                                                                                                                                                                                                                                                                                                                                                            • Severity: Set a severity level for your alert. The Priority: High, Medium, Low,andInfo are reflected in the Alert list, where you can sort by the severity by using the top navigation pane. You can use severity as a criterion when creating events and alerts, for example: if there are more than 10 high severity events, notify.

                                                                                                                                                                                                                                                                                                                                                            • Source Tag: Supported source tags are Kubernetes, Docker, and Containerd.

                                                                                                                                                                                                                                                                                                                                                            • Trigger: Specify the trigger condition in terms of the number of events for a given duration.

                                                                                                                                                                                                                                                                                                                                                              Event alert support only one segmentation label. If you choose Multiple Alerts, Sysdig generates only one alert for a selected segment.

                                                                                                                                                                                                                                                                                                                                                            Specify Event

                                                                                                                                                                                                                                                                                                                                                            1. Specify the name, tag, or description of an event.

                                                                                                                                                                                                                                                                                                                                                            2. Specify a Source Tag.

                                                                                                                                                                                                                                                                                                                                                            Configure Scope

                                                                                                                                                                                                                                                                                                                                                            Filter the environment on which this alert will apply. Use advanced operators to include, exclude, or pattern-match groups, tags, and entities. You can also create alerts directly from Explore and Dashboards for automatically populating this scope.

                                                                                                                                                                                                                                                                                                                                                            In this example, failing a liveness probe in the agent-process-whitelist-cluster cluster triggers an alert.

                                                                                                                                                                                                                                                                                                                                                            Configure Trigger

                                                                                                                                                                                                                                                                                                                                                            Define the threshold and time window for assessing the alert condition. Single Alert fires an alert for your entire scope, while Multiple Alert fires if any or every segment breach the threshold at once.

                                                                                                                                                                                                                                                                                                                                                            If the number of events triggered in the monitored entity is greater than 5 for the last 10 minutes, recipients will be notified through the selected channel.

                                                                                                                                                                                                                                                                                                                                                            8.8 -

                                                                                                                                                                                                                                                                                                                                                            Anomaly Detection Alerts

                                                                                                                                                                                                                                                                                                                                                            Anomaly refers to an outlier in a given data set polled from an environment. It is a deviation from a conformed pattern. Anomaly detection is about identifying these anomalous observations. A set of data points collectively, a single instance of data or context-specific abnormalities help detect anomalies. For example, unauthorized copying of a directory from a container, high CPU or memory consumption, and so on.

                                                                                                                                                                                                                                                                                                                                                            Define a Group Outlier Alert

                                                                                                                                                                                                                                                                                                                                                            Guidelines

                                                                                                                                                                                                                                                                                                                                                            • Set a unique name and description: Set a meaningful name and description that help recipients easily identify the alert

                                                                                                                                                                                                                                                                                                                                                            • Severity: Set a severity level for your alert. The Priority: High, Medium, Low,andInfo are reflected in the Alert list, where you can sort by the severity by using the top navigation pane. You can use severity as a criterion when creating events and alerts, for example: if there are more than 10 high severity events, notify.

                                                                                                                                                                                                                                                                                                                                                            • Specify multiple segments: Selecting a single segment might not always supply enough information to troubleshoot. Enrich the selected entity with related information by adding additional related segments. Enter hierarchical entities so you have the bottom-down picture of what went wrong and where. For example, specifying a Kubernetes Cluster alone does not provide the context necessary to troubleshoot. In order to narrow down the issue, add further contextual information, such as Kubernetes Namespace, Kubernetes Deployment, and so on.

                                                                                                                                                                                                                                                                                                                                                            Specify Entity

                                                                                                                                                                                                                                                                                                                                                            Select one or more metrics whose behavior you want to monitor.

                                                                                                                                                                                                                                                                                                                                                            Configure Scope

                                                                                                                                                                                                                                                                                                                                                            Filter the environment on which this alert will apply. An alert will fire when the value returned by one of the selected metrics does not follow the pattern in the availability zone, us-east-1b.

                                                                                                                                                                                                                                                                                                                                                            You can also create alerts directly from Explore and Dashboards for automatically populating this scope.

                                                                                                                                                                                                                                                                                                                                                            Configure Trigger

                                                                                                                                                                                                                                                                                                                                                            Trigger gives you control over how notifications are created and help prevent flooding your notification channel with notifications. For example, you may want to receive a notification for every violation, or only want a single notification for a series of consecutive violations.

                                                                                                                                                                                                                                                                                                                                                            Define the threshold and time window for assessing the alert condition. Supported time scales are minute, hour, or day.

                                                                                                                                                                                                                                                                                                                                                            If the monitored host or Kubernetes cluster is not available or not responding for the last 5 minutes, recipients will be notified.

                                                                                                                                                                                                                                                                                                                                                            You can set any value for % and a value greater than 1 for the time window. For example, If you choose 50% instead of 100%, a notification will be triggered when the entity is down for 2.5 minutes in the selected time window of 5 minutes.

                                                                                                                                                                                                                                                                                                                                                            8.9 -

                                                                                                                                                                                                                                                                                                                                                            Group Outlier Alerts

                                                                                                                                                                                                                                                                                                                                                            Sysdig Monitor observes a group of hosts and notifies you when one acts differently from the rest.

                                                                                                                                                                                                                                                                                                                                                            Define a Group Outlier Alert

                                                                                                                                                                                                                                                                                                                                                            Guidelines

                                                                                                                                                                                                                                                                                                                                                            • Set a unique name and description: Set a meaningful name and description that help recipients easily identify the alert

                                                                                                                                                                                                                                                                                                                                                            • Severity: Set a severity level for your alert. The Priority: High, Medium, Low,andInfo are reflected in the Alert list, where you can sort by the severity by using the top navigation pane. You can use severity as a criterion when creating events and alerts, for example: if there are more than 10 high severity events, notify.

                                                                                                                                                                                                                                                                                                                                                            Specify Entity

                                                                                                                                                                                                                                                                                                                                                            Select one or more metrics whose behavior you want to monitor.

                                                                                                                                                                                                                                                                                                                                                            Configure Scope

                                                                                                                                                                                                                                                                                                                                                            Filter the environment on which this alert will apply. An alert will fire when the value returned by one of the selected metrics does not follow the pattern in the availability zone, us-east-1b.

                                                                                                                                                                                                                                                                                                                                                            You can also create alerts directly from Explore and Dashboards for automatically populating this scope.

                                                                                                                                                                                                                                                                                                                                                            Configure Trigger

                                                                                                                                                                                                                                                                                                                                                            Trigger gives you control over how notifications are created and help prevent flooding your notification channel with notifications. For example, you may want to receive a notification for every violation, or only want a single notification for a series of consecutive violations.

                                                                                                                                                                                                                                                                                                                                                            Define the threshold and time window for assessing the alert condition. Supported time scales are minute, hour, or day.

                                                                                                                                                                                                                                                                                                                                                            If the monitored host or Kubernetes cluster is not available or not responding for the last 5 minutes, recipients will be notified.

                                                                                                                                                                                                                                                                                                                                                            You can set any value for % and a value greater than 1 for the time window. For example, If you choose 50% instead of 100%, a notification will be triggered when the entity is down for 2.5 minutes in the selected time window of 5 minutes.

                                                                                                                                                                                                                                                                                                                                                            Usecases

                                                                                                                                                                                                                                                                                                                                                            • Load balancer servers have uneven workloads

                                                                                                                                                                                                                                                                                                                                                            • Changes in applications or instances deployed in different availability zones.

                                                                                                                                                                                                                                                                                                                                                            • Network hogging hosts in a cluster

                                                                                                                                                                                                                                                                                                                                                            9 -

                                                                                                                                                                                                                                                                                                                                                            Captures

                                                                                                                                                                                                                                                                                                                                                            Sysdig capture files contain system calls and other OS events that can be analyzed with either the open-source sysdig or csysdig (curses-based) utilities, and are displayed in the Captures module.

                                                                                                                                                                                                                                                                                                                                                            The Captures module contains a table listing the capture file name, the host it was retrieved from, the time frame, and the size of the capture. When the capture file status is uploaded, the file has been successfully transmitted from the Sysdig agent to the storage bucket, and is available for download and analysis.

                                                                                                                                                                                                                                                                                                                                                            Store Capture Files

                                                                                                                                                                                                                                                                                                                                                            Sysdig capture files are stored in Sysdig’s AWS S3 storage (for SaaS environments), or in the Cassandra DB (for on-premises environments) by default.

                                                                                                                                                                                                                                                                                                                                                            Learn more about creating, configuring, and analyzing capture files:

                                                                                                                                                                                                                                                                                                                                                            This feature is available in the Enterprise tier of the Sysdig product. See https://sysdig.com/pricing for details, or contact sales@sysdig.com.

                                                                                                                                                                                                                                                                                                                                                            9.1 -

                                                                                                                                                                                                                                                                                                                                                            Configure Sysdig Captures

                                                                                                                                                                                                                                                                                                                                                            Create a Capture File

                                                                                                                                                                                                                                                                                                                                                            To create a capture file:

                                                                                                                                                                                                                                                                                                                                                            1. From the Explore module, select a host or container.

                                                                                                                                                                                                                                                                                                                                                            2. Click the Settings (ellipsis) drop-down menu, and select Sysdig Capture. The Sysdig Capture pop-up window will open.

                                                                                                                                                                                                                                                                                                                                                            3. Define the following parameters, and click the Start Capture button:

                                                                                                                                                                                                                                                                                                                                                              Parameter

                                                                                                                                                                                                                                                                                                                                                              Description

                                                                                                                                                                                                                                                                                                                                                              Capture path and name

                                                                                                                                                                                                                                                                                                                                                              The name of the capture file. The default name includes the date and time stamp the capture was created.

                                                                                                                                                                                                                                                                                                                                                              Time frame

                                                                                                                                                                                                                                                                                                                                                              The period of time captured. The default time is 15 seconds; the maximum capture time available is 24 hours. The capture file size limit is 100MB.

                                                                                                                                                                                                                                                                                                                                                              Sysdig recommends using the default time to ensure captures are small and manageable.

                                                                                                                                                                                                                                                                                                                                                              Filter

                                                                                                                                                                                                                                                                                                                                                              Restricts the amount of trace information collected. For more information, including examples of available filters, refer to the Sysdig Github page.

                                                                                                                                                                                                                                                                                                                                                              Storage

                                                                                                                                                                                                                                                                                                                                                              The storage location for the capture files. The default storage location is the Sysdig Cloud Amazon S3 bucket. To configure a custom S3 storage bucket, refer to Configure AWS Capture File Storage.

                                                                                                                                                                                                                                                                                                                                                            The Sysdig agent will be signaled to start a capture, and send back the resulting trace file. The file will then be displayed in the Captures module.

                                                                                                                                                                                                                                                                                                                                                            Download a Capture File

                                                                                                                                                                                                                                                                                                                                                            To download a capture file:

                                                                                                                                                                                                                                                                                                                                                            1. From the Captures module, navigate to the target capture file.

                                                                                                                                                                                                                                                                                                                                                            2. Select the target capture file.

                                                                                                                                                                                                                                                                                                                                                            3. Click theDownloadbutton. A capture file will be automatically downloaded to your local machine.

                                                                                                                                                                                                                                                                                                                                                            Delete Capture Files

                                                                                                                                                                                                                                                                                                                                                            To delete a single capture file:

                                                                                                                                                                                                                                                                                                                                                            1. From the Captures module, select the capture file to be deleted.

                                                                                                                                                                                                                                                                                                                                                            2. Click the Delete button at the bottom of the Captures module:

                                                                                                                                                                                                                                                                                                                                                            3. On the Keep File prompt, click the Delete button to confirm, or the Keep File button to cancel.

                                                                                                                                                                                                                                                                                                                                                            To delete all capture files:

                                                                                                                                                                                                                                                                                                                                                            1. From the Captures module, click the Delete All button:

                                                                                                                                                                                                                                                                                                                                                            2. Click the Yes, Delete Captures button to confirm, or the Cancel button.

                                                                                                                                                                                                                                                                                                                                                            9.2 -

                                                                                                                                                                                                                                                                                                                                                            Review a Capture File

                                                                                                                                                                                                                                                                                                                                                            Explore a Capture File

                                                                                                                                                                                                                                                                                                                                                            1. From the Captures module, navigate to the target capture file.

                                                                                                                                                                                                                                                                                                                                                            2. Select the target capture file. You will see some action buttons at the bottom of the interface.

                                                                                                                                                                                                                                                                                                                                                            3. Click theExplorebutton. You will be directed to the Explore tab view of the capture.

                                                                                                                                                                                                                                                                                                                                                            Inspect a Capture File

                                                                                                                                                                                                                                                                                                                                                            1. From the Captures module, navigate to the target capture file.

                                                                                                                                                                                                                                                                                                                                                            2. Select the target capture file. You will see some action buttons at the bottom of the interface.

                                                                                                                                                                                                                                                                                                                                                            3. Click theInspectbutton. You will be directed to the Sysdig Inspect page of the capture.

                                                                                                                                                                                                                                                                                                                                                            10 -

                                                                                                                                                                                                                                                                                                                                                            Integrations for Sysdig Monitor

                                                                                                                                                                                                                                                                                                                                                            Integrate metrics with Sysdig Monitor from a number of platforms, orchestrators, and a wide range of applications. Sysdig collects metrics from Prometheus, JMX, StatsD, Kubernetes, and many application stacks to provide a 360-degree view of your infrastructure. Many metrics are collected by default out of the box; you can also extend the integration or create custom metrics.

                                                                                                                                                                                                                                                                                                                                                            Key Benefits

                                                                                                                                                                                                                                                                                                                                                            • Collects the richest data set for cloud-native visibility and security

                                                                                                                                                                                                                                                                                                                                                            • Polls data, auto-discover context in order to provide operational and security insights

                                                                                                                                                                                                                                                                                                                                                            • Extends the power of Prometheus metrics with additional insights from other metrics types and infrastructure stack

                                                                                                                                                                                                                                                                                                                                                            • Integrate Prometheus alert and events for Kubernetes monitoring needs

                                                                                                                                                                                                                                                                                                                                                            • Expose application metrics using Java JMX and MBeans monitoring

                                                                                                                                                                                                                                                                                                                                                            Key Integrations

                                                                                                                                                                                                                                                                                                                                                            • Prometheus Metrics

                                                                                                                                                                                                                                                                                                                                                              Describes how Sysdig Agent enables automatically collecting metrics from Prometheus exporters, how to set up your environment, and scrape Prometheus metrics from local as well as remote hosts.

                                                                                                                                                                                                                                                                                                                                                            • Java Management Extention (JMX) Metrics

                                                                                                                                                                                                                                                                                                                                                              Describes how to configure your Java virtual machines so Sysdig Agent can collect JMX metrics using the JMX protocol.

                                                                                                                                                                                                                                                                                                                                                            • StatsD Metrics

                                                                                                                                                                                                                                                                                                                                                              Describes how the Sysdig agent collects custom StatsD metrics with an embedded StatsD server.

                                                                                                                                                                                                                                                                                                                                                            • Node.JS Metrics

                                                                                                                                                                                                                                                                                                                                                              Illustrates how Sysdig is able to monitor node.js applications by linking a library to the node.js codebase.

                                                                                                                                                                                                                                                                                                                                                            • Integrate Applications

                                                                                                                                                                                                                                                                                                                                                              Describes the monitoring capabilities of Sysdig agent with application check scripts or ‘app checks’.

                                                                                                                                                                                                                                                                                                                                                            • Monitor Log Files

                                                                                                                                                                                                                                                                                                                                                              Learn how to search a string by using the chisel script called logwatcher.

                                                                                                                                                                                                                                                                                                                                                            Platform Metrics (IBM)

                                                                                                                                                                                                                                                                                                                                                            For Sysdig instances deployed on IBM Cloud Monitoring with Sysdig, an additional form of metrics collection is offered: Platform metrics. Rather than being collected by the Sysdig agent, when enabled, Platform metrics are reported to Sysdig directly by the IBM Cloud infrastructure.

                                                                                                                                                                                                                                                                                                                                                            Enable this feature by logging into the IBM Cloud console and selecting “Enable” for IBM Platform metrics under the Configure your resource section when creating a new IBM Cloud Monitoring with a Sysdig instance, as described here.

                                                                                                                                                                                                                                                                                                                                                            10.1 -

                                                                                                                                                                                                                                                                                                                                                            (Limited Availability) Configure Monitoring Integrations

                                                                                                                                                                                                                                                                                                                                                            Monitoring Integration is in a limited availability status and is not available in all the regions, yet. The feature is being rolled out to users.

                                                                                                                                                                                                                                                                                                                                                            Monitoring Integration provides an at-a-glance summary of workloads running in your infrastructure and a deeper insight into the health and performance of your services across platforms and the cloud. You can easily identify the workloads in your team scope, the service discovered (such as etcd) within each workload, and configure the Prometheus exporter integration to collect an