Recently added jobs Remote working jobs Apprenticeships & Graduates Education & Public Sector jobs Paid by the hour / Shift work

640 Tech / Digital / IT jobs in Bexleyheath, Bexley

Senior IT Engineer (Contr...

Salary not available. View on company website.

Huawei, City of Westminster

  • Full time

Apply on company site

Posted 1 day ago, 22 Jul

Principal Systems Enginee...

Salary not available. View on company website.

The Bbc, Manor Park, Newham

  • Full time
  • Permanent

Apply on company site

Posted 1 day ago, 22 Jul

Senior Business Analyst -...

Salary not available. View on company website.

NTT DATA, City of Westminster

  • Full time
  • Permanent

Apply on company site

Posted 1 day ago, 22 Jul

Successful jobseekers create high quality email alerts

A great alert means less time searching & more time applying.

Associate Infrastructure ...

Salary not available. View on company website.

Visa, City of Westminster

  • Full time
  • Permanent

Apply on company site

Posted 1 day ago, 22 Jul

Full Stack Software Engin...

£81999-£91110

Lloyds Banking Group, City of Westminster

  • Full time
  • Permanent

Apply on company site

Posted 1 day ago, 22 Jul

Infrastructure and Connec...

£75000

Whitbread Plc, Holborn, Camden

  • Full time
  • Permanent

Apply on company site

Posted 1 day ago, 22 Jul

Senior Forward Deployed E...

Salary not available. View on company website.

Intercom, Inc., City of Westminster

  • Full time
  • Permanent

Apply on company site

Posted 1 day ago, 22 Jul

Technical Support Special...

Salary not available. View on company website.

Samsara Networks Inc., City of Westminster

  • Full time
  • Permanent

Apply on company site

Posted 1 day ago, 22 Jul

Technology Services Assis...

Salary not available. View on company website.

Lewis Silkin LLP, City of Westminster

  • Full time
  • Permanent

Apply on company site

Posted 1 day ago, 22 Jul

Technical Support Special...

Salary not available. View on company website.

Samsara Networks Inc., City of Westminster

  • Full time
  • Permanent

Apply on company site

Posted 1 day ago, 22 Jul

Technical Support Special...

Salary not available. View on company website.

Samsara Networks Inc., City of Westminster

  • Full time
  • Permanent

Apply on company site

Posted 1 day ago, 22 Jul

Lead Software Engineer

£90000-£100000

ITV Consumer Limited 2024, City of Westminster

  • Full time
  • Permanent

Apply on company site

Posted 1 day ago, 22 Jul

CRM - Solutions Architect

Salary not available. View on company website.

The Football Association, Wembley, Greater London

  • Full time
  • Permanent

Apply on company site

Posted 2 days ago, 21 Jul

Sr Chip Design Engineer

Salary not available. View on company website.

Qualitest, City of Westminster

  • Full time
  • Permanent

Apply on company site

Posted 2 days ago, 21 Jul

Broadcast Communications ...

Salary not available. View on company website.

Amazon.com, Inc, City of Westminster

  • Full time
  • Permanent

Apply on company site

Posted 2 days ago, 21 Jul

Engineering Manager - ML ...

Salary not available. View on company website.

Sainsbury’s Group, City of Westminster

  • Full time
  • Permanent

Apply on company site

Posted 2 days ago, 21 Jul

Field Engineer - Tech

Salary not available. View on company website.

Babble Cloud, Fryerns, Basildon

  • Full time
  • Permanent

Apply on company site

Posted 2 days ago, 21 Jul

DevOps Engineer

Salary not available. View on company website.

Telefónica Tech, City of Westminster

  • Full time
  • Permanent

Apply on company site

Posted 2 days ago, 21 Jul

Senior Data Engineer

Salary not available. View on company website.

OSB Group, Chatham, Medway

  • Full time
  • Permanent

Apply on company site

Posted 2 days ago, 21 Jul

Solutions Architect (1yr ...

Salary not available. View on company website.

The Football Association, Wembley, Greater London

  • Full time
  • Temporary

Apply on company site

Posted 2 days ago, 21 Jul
Close

Senior IT Engineer (Contractor) - AI Infrastructure Management

Salary not available. View on company website.

Huawei, City of Westminster

  • Onsite working
  • Full time

Posted 1 day ago, 22 Jul

Job ref: f9a0a9692609484781f84c41b8da1300

Full Job Description

We are looking for a highly skilled Senior IT Engineer to manage a large-scale AI development and training infrastructure. The role involves overseeing GPU servers, Kubernetes clusters (Rancher), and storage systems to ensure seamless operations and optimized performance. You will collaborate with development teams, ensuring they have the resources and support needed to run their projects efficiently. This is a critical technical position requiring expertise in Kubernetes, hardware management, automation,

  • Kubernetes and Rancher Management: Configure, scale, and maintain Kubernetes clusters and Rancher for multi-cluster management, ensuring optimal performance and resource allocation.
  • GPU Resource Management: Manage GPU resources and servers, ensuring efficient resource scheduling, load balancing, and performance optimization for AI workloads.
  • Storage Management: Maintain and optimize large storage systems, ensuring high availability, performance, and data persistence.
  • DevOps and Automation: Implement CI/CD pipelines and automate infrastructure management using tools such as Terraform, Ansible, Jenkins, and GitLab CI.
  • Monitoring and Troubleshooting: Set up and manage monitoring and logging systems (e.g., Prometheus, Grafana, ELK) to ensure high availability and rapid issue resolution.
  • AI Framework Optimization: Collaborate with data scientists and AI developers to optimize AI frameworks (e.g., TensorFlow, PyTorch) for GPU and cluster environments.
  • Security and Access Management: Implement and manage role-based access control (RBAC) and ensure data security, encryption, and backup procedures are in place.
  • Team Support and Collaboration: Provide technical support and training to AI teams, ensuring smooth operations and effective use of infrastructure.
  • This job description is only an outline of the tasks, responsibilities and outcomes required of the role. The jobholder will carry out any other duties as may be reasonably required by his/her line manager. The job description and personal specification may be reviewed on an ongoing basis in accordance with the changing needs of Huawei Research and Development UK Limited.

    + Proven experience in managing large-scale Kubernetes clusters and containerisation technologies (e.g., Docker). + Strong understanding of GPU resource management and optimization for AI workloads. + Expertise in managing large storage systems and implementing data persistence strategies. + Proficiency in scripting and automation (Python, Bash, Go), with experience in infrastructure as code (IaC) using Terraform, Ansible, or similar tools. + Familiarity with deep learning frameworks (e.g., TensorFlow, PyTorch) and experience optimizing them for large-scale environments. + Experience with monitoring and logging tools such as Prometheus, Grafana, and ELK. + Excellent communication and collaboration skills, with a proactive approach to problem-solving and supporting technical teams.
  • Desired:
  • + Experience with Rancher or other Kubernetes management platform + Experience in managing hybrid cloud environments + Preferred Red Hat Certified System Administrator (RHCSA) + Preferred Certified Kubernetes Administrator (CKA) + Preferred Mandarin Speaker.

    Founded in 1987, Huawei is a leading global provider of information and communications technology (ICT) infrastructure and smart devices. We have 207,000 employees and operate in over 170 countries and regions, serving more than three billion people around the world. Our vision and mission is to bring digital to every person, home and organization for a fully connected, intelligent world. To this end, we will drive ubiquitous connectivity and promote equal access to networks; bring cloud and artificial intelligence to all four corners of the earth to provide superior computing power where you need it, when you need it; build digital platforms to help all industries and organizations become more agile, efficient, and dynamic; redefine user experience with AI, making it more personalized for people in all aspects of their life, whether they're at home, in the office, or on the go. This spirit of innovation has led Huawei to work in close partnership with leading academic institutions in the UK to develop and refine the latest technologies. With a shared commitment to innovation and progress, both parties have worked together to achieve common goals and establish a strong partnership. The partnership between UK and Huawei help to develop the technologies of the future that will transform the way we all communicate, work and live. For the past 30 years we have maintained an unwavering focus, rejecting shortcuts and easy opportunities that don't align with our core business. With a practical approach to everything we do, we concentrate our efforts and invest patiently to drive technological breakthroughs., Huawei's vision is a fully connected, intelligent world. To achieve this, we work to inspire passion for basic research around the world. Our combined passion drives development across the global innovation value chain. Huawei has the largest Research and Development organization in the world with 96,000+ employees in research centers around the globe. In the UK, we already have design centers in Cambridge, London, Edinburgh and Ipswich. We continue to explore and define new research directions and new services. We have expanded our collaborations with academic researchers; researched new network architectures, integration of communications and key enabling technologies; and developed the fundamental theories of these technologies. We invite you to join us on this exciting journey and drive your career forward.

Do you like this job?

We can email jobs like this to your inbox

  • Facebook

Copy the direct link to this job

www.jobs24.co.uk/job/senior-it-engineer-contractor-ai-infrastructure-management-125410963
Displaying results 21 to 40 of 640 found
Create a high quality job alert