Lead Cloud Site Reliability Engineer

Lloyds Banking Group, Leeds

Lead Cloud Site Reliability Engineer

Salary not available. View on company website.

Lloyds Banking Group, Leeds

  • Full time
  • Permanent
  • Onsite working

Posted 6 days ago, 18 Apr | Get your application in now to be included in the first week's applications.

Closing date: Closing date not specified

Job ref: 317e5eeaf1f74a0a852709e0100a5279

Location ref: Leeds

Full Job Description

But the world is changing, fast. And we're changing too. It's never been a more exciting time to join us as we transform our business to shape finance as a force for good. We're modernising with cloud, a platform that is quick, secure and resilient for customers and easy, modern and green for developers. We're looking for a Lead Site Reliability Engineer (SRE) to help us strengthen reliability, observability and operational excellence across our Azure and GCP platforms. This is an exciting opportunity to lead a team of highly skilled SREs, influence engineering standards across the Group, and drive improvements that make a tangible difference to both customers and engineers. What you'll do As a Lead SRE, you will:

  • Lead a team of SREs (up to ~15) and create a culture of continuous improvement, learning, and engineering excellence.
  • Work closely with application teams during application migrations to the Cloud
  • Work closely with Product Owners and Engineering Leads to balance new feature delivery with reliability, performance and system health.
  • Use data, observability tooling and SRE principles to detect issues early, improve system performance, and reduce operational toil.
  • Lead and mature incident and problem management practices, ensuring strong root-cause analysis, learning, and reduction of MTTF/MTTR.
  • Champion error budgets, SLOs, and reliability-first thinking across your aligned Cloud Labs.
  • Influence platform direction and engineering standards, helping shape how we build resilient cloud services at scale.

    Strong cloud engineering background - ideally across GCP and Azure - with experience designing or operating large-scale, resilient cloud platforms.
  • Deep understanding of observability tooling (metrics, logs, traces) and how to drive reliability improvements using data.
  • Hands-on experience of modern SRE practices:
  • SLOs / SLIs
  • Error budgets
  • Reducing toil through automation
  • Production readiness and post-mortem best practice
  • Leadership & Collaboration
  • Experience leading engineering teams and fostering an inclusive, high-performing culture
  • Ability to navigate complex stakeholder groups and communicate technical topics in a clear, accessible way.
  • Mindset and Behaviours
  • Technology-agnostic, adaptable thinker who selects the best tool or approach for the job.
  • Curiosity and a commitment to continuous learning and improvement - both for yourself and your team.
  • Passion for engineering excellence, platform health, and proactive reliability.
  • About You You're someone who:
  • Is passionate about building resilient, observable, customer-focused platforms.
  • Strong understanding of Github pipelines and Terraform Modules
  • Enjoys coaching others, sharing knowledge and shaping engineering culture.
  • Looks for opportunities to remove toil and introduce automation.
  • Thrives in collaborative, multi-functional environments.
  • Adopts new tools, technologies and modern engineering approaches.
  • Values diverse perspectives, psychological safety and inclusive ways of working.

    At Lloyds Banking Group, we're all driven by our purpose, to Help Britain Prosper. It's why we exist - it's our reason to get out of bed in the morning. The choices we make, our success and our future really matter., At Lloyds Banking Group, we're driven by a clear purpose; to help Britain prosper. Across the Group, our colleagues are focused on making a difference to customers, businesses and communities. With us you'll have a key role to play in shaping the financial services of the future, whilst the scale and reach of our Group means you'll have many opportunities to learn, grow and develop.
  • We keep your data safe. So, we'll only ever ask you to provide confidential or sensitive information once you have formally been invited along to an interview or accepted a verbal offer to join us which is when we run our background checks. We'll always explain what we need and why, with any request coming from a trusted Lloyds Banking Group person. We're focused on creating a values-led culture and are committed to building a workforce which reflects the diversity of the customers and communities we serve. Together we're building a truly inclusive workplace where all of our colleagues have the opportunity to make a real difference.

    What You'll Get in Return You'll join a forward-thinking platform organisation that:
  • Is modernising at scale with Cloud, AI-enabled operations and real-time observability
  • Encourages innovation, autonomy and engineering craft
  • Invests in colleague development, learning pathways and progression
  • Champions diversity, equity and inclusion across everything we do
  • You'll help shape the future of cloud operations in one of the UK's largest financial institutions - and your work will have real impact on millions of customers. We also offer a wide-ranging benefits package, which includes:
  • A generous pension contribution of up to 15%
  • An annual performance-related bonus
  • Share schemes including free shares
  • Benefits you can adapt to your lifestyle, such as discounted shopping
  • 30 days' holiday, with bank holidays on top
  • A range of wellbeing initiatives and generous parental leave policies

Direct job link

https://www.jobs24.co.uk/job/lead-cloud-site-reliability-engineer-126712326

About this company

Lloyds Banking Group

View full company profile