Site Reliability Engineer
Anson McCade - Gloucester, ENG
Apply NowJob Description
Job Description Site Reliability Engineer (SRE)Location: Gloucester (Hybrid, 3 days onsite)Salary: Up to 65,000 + 7,000 bonusSecurity Clearance: Must be eligible for UK Developed Vetting (DV)Were hiring a Site Reliability Engineer to join a high-performing engineering environment delivering critical, complex systems. This role sits at the intersection of software engineering and operations, with a strong focus on automation, scalability, and system resilience.This is an excellent opportunity for someone with a software engineering background who is looking to move into a more systems-focused, reliability-driven career path without losing their hands-on technical edge.As an SRE, youll be responsible for ensuring the reliability, availability, and performance of mission-critical systems. Youll apply software engineering principles to infrastructure and operations challenges, reducing manual effort through automation and improving system design.Key Responsibilities Include:Supporting and maintaining live services, ensuring high availability and performanceAutomating operational processes to reduce manual interventionMonitoring, alerting, and observability improvements across systemsDiagnosing and resolving incidents across the full technology stackWorking closely with engineering teams to influence system design and reliabilityParticipating in an on-call rota (project-dependent)Contributing to continuous improvement of DevOps and SRE practicesWhat Were Looking ForWere interested in candidates who bring a strong engineering mindset and enjoy solving complex systems problems.Core Experience:2+ years commercial experience in this areaBackground in software engineering (e.g. Java, JavaScript, or similar)Experience working with cloud platforms (AWS, Azure, or similar)Strong Linux/Windows command line skills (Bash, PowerShell)Understanding of distributed systems, scalability, and resilienceExperience with monitoring/observability tools (e.g. ELK stack or similar)Familiarity with containers and microservices (e.g. Docker)Experience troubleshooting across infrastructure and application layersDesirable:Exposure to 2nd or 3rd line support environmentsKnowledge of CI/CD and deployment toolingExperience with infrastructure as code or configuration management toolsUnderstanding of ITIL or service management practicesAdditional RequirementsWillingness to participate in on-call support (depending on project)If youre a software engineer looking to broaden your impact into reliability, systems, and large-scale infrastructure, this role offers a strong platform to do exactly that.
Created: 2026-04-04