Monitor
dashboards and alert rules for the Infra module
Module:
Categories:
Dashboard
Pigsty has the following dashboards for the Infra module:
Pigsty Home
Pigsty Monitoring System Homepage
INFRA Overview
INFRA Overview Dashboard
Nginx Overview
Nginx Monitoring Metrics and Logs
Grafana Overview
Grafana Monitoring Metrics and Logs
Prometheus Overview
Prometheus Monitoring Metrics and Logs
Loki Overview
Loki Monitoring Metrics and Logs
Logs Instance
View logs on a single node
Logs Overview
View global logs
CMDB Overview
CMDB Visualization
Alert Rules
Pigsty provides the following two alert rules for the INFRA module:
InfraDown
: Infrastructure components are downAgentDown
: Monitoring agent is down
You can modify or add new infrastructure alert rules in files/prometheus/rules/infra.yml
.
################################################################
# Infrastructure Alert Rules #
################################################################
- name: infra-alert
rules:
#==============================================================#
# Infra Aliveness #
#==============================================================#
# infra components (prometheus,grafana) down for 1m triggers a P1 alert
- alert: InfraDown
expr: infra_up < 1
for: 1m
labels: { level: 0, severity: CRIT, category: infra }
annotations:
summary: "CRIT InfraDown {{ $labels.type }}@{{ $labels.instance }}"
description: |
infra_up[type={{ $labels.type }}, instance={{ $labels.instance }}] = {{ $value | printf "%.2f" }} < 1
#==============================================================#
# Agent Aliveness #
#==============================================================#
# agent aliveness are determined directly by exporter aliveness
# including: node_exporter, pg_exporter, pgbouncer_exporter, haproxy_exporter
- alert: AgentDown
expr: agent_up < 1
for: 1m
labels: { level: 0, severity: CRIT, category: infra }
annotations:
summary: 'CRIT AgentDown {{ $labels.ins }}@{{ $labels.instance }}'
description: |
agent_up[ins={{ $labels.ins }}, instance={{ $labels.instance }}] = {{ $value | printf "%.2f" }} < 1
Feedback
Was this page helpful?
Glad to hear it! Please tell us how we can improve.
Sorry to hear that. Please tell us how we can improve.