Skip to content

Operations

MazeVault Operations, Monitoring, and Maintenance

This section covers day-to-day operations, monitoring, backup procedures, and maintenance activities.

In This Section

Document Description
Health Checks Health endpoints and Kubernetes probes
Monitoring Prometheus metrics, alerting, and dashboards
Notifications & Alerting Notification integrations (JIRA, Teams, Slack, Email, Webhook), alert rules, incident response
Backup & Restore Backup strategy, RTO/RPO, recovery procedures
Maintenance Upgrade procedures, key rotation, scheduled maintenance
Gateway DR Failover Manual failover and failback procedure for gateway disaster recovery
Scaling Horizontal and vertical scaling recommendations