site reliability engineering (SRE)

See the following -

On the Critical Role of Site Reliability Engineering

Understanding the basics and best practices for establishing and maintaining a Site Reliability Engineering (SRE) program in an organization...SREs are responsible for maximizing reliability, performance availability, latency, efficiency, monitoring, emergency response, change management, release planning, and capacity planning for both infrastructure and software. As applications and infrastructure grow more complex, SRE teams help ensure that these systems can evolve.

Read More »