Monitor

dashboards and alert rules for the Infra module

Dashboard

Pigsty has the following dashboards for the Infra module:


Pigsty Home

Pigsty Monitoring System Homepage

Pigsty Home Dashboard

pigsty.jpg


INFRA Overview

INFRA Overview Dashboard

INFRA Overview Dashboard

infra-overview.jpg


Nginx Overview

Nginx Monitoring Metrics and Logs

Nginx Overview Dashboard

nginx-overview.jpg


Grafana Overview

Grafana Monitoring Metrics and Logs

Grafana Overview Dashboard

grafana-overview.jpg


Prometheus Overview

Prometheus Monitoring Metrics and Logs

Prometheus Overview Dashboard

prometheus-overview.jpg


Loki Overview

Loki Monitoring Metrics and Logs

Loki Overview Dashboard

loki-overview.jpg


Logs Instance

View logs on a single node

Logs Instance Dashboard

logs-instance.jpg


Logs Overview

View global logs

Logs Overview Dashboard

logs-overview.jpg


CMDB Overview

CMDB Visualization

CMDB Overview Dashboard

cmdb-overview.jpg


Alert Rules

Pigsty provides the following two alert rules for the INFRA module:

  • InfraDown : Infrastructure components are down
  • AgentDown : Monitoring agent is down

You can modify or add new infrastructure alert rules in files/prometheus/rules/infra.yml.

################################################################
#                Infrastructure Alert Rules                    #
################################################################
- name: infra-alert
  rules:

    #==============================================================#
    #                       Infra Aliveness                        #
    #==============================================================#
    # infra components (prometheus,grafana) down for 1m triggers a P1 alert
    - alert: InfraDown
      expr: infra_up < 1
      for: 1m
      labels: { level: 0, severity: CRIT, category: infra }
      annotations:
        summary: "CRIT InfraDown {{ $labels.type }}@{{ $labels.instance }}"
        description: |
          infra_up[type={{ $labels.type }}, instance={{ $labels.instance }}] = {{ $value  | printf "%.2f" }} < 1          

    #==============================================================#
    #                       Agent Aliveness                        #
    #==============================================================#

    # agent aliveness are determined directly by exporter aliveness
    # including: node_exporter, pg_exporter, pgbouncer_exporter, haproxy_exporter
    - alert: AgentDown
      expr: agent_up < 1
      for: 1m
      labels: { level: 0, severity: CRIT, category: infra }
      annotations:
        summary: 'CRIT AgentDown {{ $labels.ins }}@{{ $labels.instance }}'
        description: |
          agent_up[ins={{ $labels.ins }}, instance={{ $labels.instance }}] = {{ $value  | printf "%.2f" }} < 1          




Last modified 2025-04-09: update infra doc (5591e0a)