Site Reliability Engineer
BJSS Limited, Casnewydd
Site Reliability Engineer
Salary not available. View on company website.
BJSS Limited, Casnewydd
- Full time
- Permanent
- Remote working
Posted 3 days ago, 23 Jun | Get your application in today.
Closing date: Closing date not specified
Job ref: 958b4d2611e94c3583d9ad87a6567318
Location ref: Casnewydd
Full Job Description
Type: ContractWFH: Fully RemoteRate: £300-400 per day (Inside)Location: RemoteExperience level: Around 3-5 years, though candidates with slightly more experience may be relevant if rate-aligned
A market-leading consultancy is looking for a Site Reliability Engineer to support an AWS-hosted data platform, working across reliability, observability, automation and operational excellence.This role would suit an SRE, DevOps Engineer or Platform Engineer with strong AWS experience, hands-on Kubernetes/EKS exposure, and a good understanding of observability, monitoring and incident management. The successful contractor will help define and operationalise SLIs, SLOs and error budgets across critical data services, ensuring the platform is reliable, scalable and well-monitored. Public sector experience is highly desirable, and SC clearance is preferred but not required.
Key ResponsibilitiesYou will define and operationalise SLIs, SLOs and error budgets for critical data services and platform components.You will build and maintain observability dashboards and monitoring frameworks using tools such as Dynatrace, Prometheus and associated monitoring/logging/tracing platforms.You will implement end-to-end monitoring across metrics, logs and traces, helping the team detect issues proactively before they impact users.You will work across the AWS ecosystem, supporting workloads running on EKS / Kubernetes.You will collaborate closely with developers, architects and platform teams to improve reliability, scalability, performance and operational resilience.You will support incident response, root cause analysis and blameless post-mortems, helping drive long-term improvements rather than short-term fixes.You will automate repetitive operational tasks to reduce toil and improve platform efficiency.You will help establish and track the key golden signals: latency,
traffic, errors and saturation.You will contribute to reliability and resilience backlogs, helping identify improvements across monitoring, alerting, automation and platform stability.
Essential SkillsStrong commercial experience as an SRE, DevOps Engineer, Platform Engineer or Cloud Engineer.Strong AWS experience, ideally within production-scale environments.Hands-on experience with Kubernetes, ideally Amazon EKS.Experience with observability and monitoring tools such as Dynatrace, Prometheus, Grafana, CloudWatch, OpenTelemetry, ELK or similar.Understanding of SLIs, SLOs, error budgets and golden signals.Experience supporting incident management, root cause analysis and post-incident improvement work.Automation experience using scripting or IaC tooling such as Terraform, Python, Bash, Ansible or similar.Good understanding of platform reliability, scalability, resilience and performance.Desirable SkillsExperience working with data platforms, data pipelines or data-heavy environments.Exposure to batch or streaming data workloads, such as Kafka, Spark, Airflow, Glue, EMR, Databricks or similar.Data observability experience.Previous public sector experience across multiple
engagements.Active or lapsed SC clearance.Consultancy experience would be beneficial, especially within environments such as Capgemini, Accenture, BJSS, Kainos, Sopra Steria, CGI, PA Consulting, Leidos, BAE Digital Intelligence, Deloitte, Cognizant or similar.
Direct job link
Relevant jobs
- Tech / Digital / IT Jobs in Aberdare, Rhondda Cynon Taf - Rhondda Cynon Taf
- Tech / Digital / IT Jobs in Aberdaugleddau, Sir Benfro - Pembrokeshire
- Tech / Digital / IT Jobs in Y Fenni, Sir Fynwy - Monmouthshire
- Tech / Digital / IT Jobs in Aberteifi, Sir Ceredigion - Ceredigion
- Tech / Digital / IT Jobs in Bangor
- Tech / Digital / IT Jobs in Barri, Bro Morgannwg - the Vale of Glamorgan
- Tech / Digital / IT Jobs in Bridgend, Cumbria
- Tech / Digital / IT Jobs in Caerffili, Caerffili - Caerphilly
- Tech / Digital / IT Jobs in Cardiff
- Tech / Digital / IT Jobs in Caerfyrddin, Sir Gaerfyrddin - Carmarthenshire
- Tech / Digital / IT Jobs in Casnewydd
- Tech / Digital / IT Jobs in Bae Colwyn, Conwy - Conwy
- Tech / Digital / IT Jobs in Cwmbran, Tor-faen - Torfaen
- Tech / Digital / IT Jobs in Doc Penfro, Sir Benfro - Pembrokeshire
- Tech / Digital / IT Jobs in Glyn Ebwy, Blaenau Gwent - Blaenau Gwent
- Tech / Digital / IT Jobs in Hwlffordd, Sir Benfro - Pembrokeshire
- Tech / Digital / IT Jobs in Llanelli, Sir Gaerfyrddin - Carmarthenshire
- Tech / Digital / IT Jobs in Maesteg, Pen-y-bont ar Ogwr - Bridgend
- Tech / Digital / IT Jobs in Merthyr Tudful, Merthyr Tudful - Merthyr Tydfil
- Tech / Digital / IT Jobs in Mold, Sir y Fflint - Flintshire
- Tech / Digital / IT Jobs in Castell-nedd, Castell-nedd Port Talbot - Neath Port Talbot
- Tech / Digital / IT Jobs in Newport, Middlesbrough
- Tech / Digital / IT Jobs in Penarth, Bro Morgannwg - the Vale of Glamorgan
- Tech / Digital / IT Jobs in Pontypool, Tor-faen - Torfaen
- Tech / Digital / IT Jobs in Pontypridd, Rhondda Cynon Taf - Rhondda Cynon Taf
- Tech / Digital / IT Jobs in Port Talbot, Castell-nedd Port Talbot - Neath Port Talbot
- Tech / Digital / IT Jobs in Prestatyn, Sir Ddinbych - Denbighshire
- Tech / Digital / IT Jobs in Rhondda, Rhondda Cynon Taf - Rhondda Cynon Taf
- Tech / Digital / IT Jobs in Rhuthun, Sir Ddinbych - Denbighshire
- Tech / Digital / IT Jobs in Abertawe
- Tech / Digital / IT Jobs in Wrecsam, Wrecsam - Wrexham