Job Description
As the Site Reliability Engineer / Lead SRE you will be responsible for noticing and effectively responding to service failures, ensuring service availability within agreed SLAs. You'll measure and monitor availability, latency and overall system health; review panned systems changes and manage the deployment and removal of computing resources in a predictable manner via automation. You will take ownership of current services which are a mix of EC2 and Kubernetes environment and become the SME for services which you monitor, driving improvements.
As the Site Reliability Engineer / Lead SRE you will build a team around you and have line management responsibilities whilst remaining hands-on.
Driven by technology the company can offer a fully remote interview and onboarding process; you'll work with latest MacBook Pro and collaborate remotely with colleagues, joining the team in London for 2 days a week when able to do so.
Requirements:
*Experience of developing infrastructure architectures
*Strong AWS experience
*Experience of supporting live systems as part of a wider SRE / DevOps methodology
*Good knowledge of Containerisation technologies including Docker and Kubernetes (or Terraform)
*Scripting ability (Python, bash)
*Collaborative with good communication skills, confident dealing with senior stakeholders
As the Site Reliability Engineer / Lead SRE you will earn a competitive salary (to £90k) plus benefits.
Apply now or call to find out more about this Site Reliability Engineer / Lead SRE opportunity.
