Monitoring the infrastructure of servers and clusters of instances to ensure high availability and effective performance. Used a combination of tools, like Prometheus, MySQL, Slack, Grafana, Data-dog, Pager-duty, etc that captured various metrics, such as CPU utilization, memory usage, disk space, and I/O operations.