Section: IT & Technology · Cloud ComputingDifficulty: Easy

Infrastructure Monitoring

USUK

Tracking the health and performance of servers, networks, and cloud resources.

Definition

Infrastructure monitoring is the continuous observation of compute, networking, and storage resources to ensure availability, performance, and capacity. It collects metrics such as CPU utilization, memory usage, disk I/O, network throughput, and error rates from physical servers, virtual machines, containers, and cloud services. Platforms like Prometheus, Grafana, Datadog, and AWS CloudWatch aggregate and visualize these metrics, enabling proactive alerting and capacity planning before resource exhaustion causes outages.

Example

The operations team configured CloudWatch alarms to page on-call engineers when any EC2 instance's CPU utilization exceeds 90% for 5 consecutive minutes, catching scaling issues before they impact users.

Synonyms

  • server monitoring
  • cloud monitoring
  • resource monitoring
  • ops monitoring

Antonyms / Opposites

  • unmonitored infrastructure
  • blind infrastructure

Images

CC-licensed · free to use
More on Wikimedia
Loading images…

Video

  • monitoring
  • observability
  • sre
  • auto-scaling

Dictionary Entry

Back to IT & Technology