Domain Masterclass

Monitoring & Logging

You cannot fix what you cannot see. Build complete observability stacks with Prometheus, Grafana, and the ELK stack.

Start Learning
160+
Articles
9
Sub-topics
62%
Avg Completion
What You'll Learn
  • Prometheus 18 guides
  • Grafana 15 guides
  • Datadog 12 guides
  • New Relic 8 guides
  • ELK Stack 18 guides
  • Fluentd / Fluent Bit 10 guides
  • Loki & Promtail 8 guides
  • Alerting & On-Call 10 guides
  • + 1 more topics below

Prometheus

Metrics collection, PromQL, exporters and Alertmanager

18 guides

Grafana

Dashboards, datasources, alerts and Grafana Cloud

15 guides

Datadog

APM, infrastructure monitoring, logs and synthetics

12 guides

New Relic

Full-stack observability, NRQL and distributed tracing

8 guides

ELK Stack

Elasticsearch, Logstash, Kibana — centralized logging

18 guides

Fluentd / Fluent Bit

Log shipping, parsing, routing and Kubernetes logging

10 guides

Loki & Promtail

Grafana Loki log aggregation and LogQL queries

8 guides

Alerting & On-Call

PagerDuty, OpsGenie, alert routing and on-call runbooks

10 guides

Distributed Tracing

Jaeger, Zipkin, OpenTelemetry and trace analysis

10 guides

Core Concepts

The 3 Pillars

Metrics (Prometheus), Logs (ELK/Loki), and Traces (Jaeger/Zipkin).

PromQL & Dashboards

Writing PromQL queries and building Grafana dashboards.

Log Aggregation

Shipping, parsing, and searching logs at scale with the ELK stack.

Alerting Strategies

Alert thresholds, silencing, escalation policies, and SLO-based alerts.

Learning Roadmap

1
Phase 1: Beginner

Set up Prometheus with node_exporter, connect to Grafana, and create your first dashboard.

Prometheus
Grafana
Datadog
New Relic
2
Phase 2: Intermediate

ELK stack for centralized logging, custom dashboards, Alertmanager configuration.

ELK Stack
Fluentd / Fluent Bit
Loki & Promtail
Alerting & On-Call
3
Phase 3: Advanced

OpenTelemetry distributed tracing, SLO/SLA alerting, and observability-as-code with Grafonnet.

Distributed Tracing

Related Articles & Guides

Career Path

Become a Monitoring & Logging Expert

This domain is a core requirement for senior engineering roles.

View Full Path

Topics in This Domain

Prometheus Grafana ELK Loki Datadog APM Prometheus Grafana Datadog New Relic ELK Stack Fluentd / Fluent Bit Loki & Promtail Alerting & On-Call
Sandbox On-Demand

Practice Labs

Jump into interactive sandboxes and solve real-world Monitoring & Logging challenges.

devknow@host:~$ sandbox load monitoring
[LOAD] Calibrating live environment...
Ready (http://localhost:3000)
Go to Practice Labs