Talent.com
Site Reliability Engineer (Advanced)
Site Reliability Engineer (Advanced)Sabenza IT & Recruitment • Johannesburg, Gauteng, South Africa
Site Reliability Engineer (Advanced)

Site Reliability Engineer (Advanced)

Sabenza IT & Recruitment • Johannesburg, Gauteng, South Africa
30+ days ago
Job description

Exciting Opportunity : Site Reliability Engineer (Advanced) Industrial IoT & Edge Ecosystem

Are you a skilled Site Reliability Engineer passionate about cloud infrastructure containerization and automation Our client in the Motor Industry is looking for an advanced SRE to join their international DevOps teams working on cutting-edge Industrial IoT and Edge solutions for global production systems.

About the Role :

You will be part of an interdisciplinary team delivering platform solutions for smart factory wearables and production-critical cloud connections. Your work will directly impact industrial IoT self-service enabling innovative reliable and scalable systems across the globe.

Requirements

What You ll Do :

Design implement and maintain scalable cloud infrastructure (Azure preferred).

Manage and optimize Kubernetes clusters and containerized environments.

Set up monitoring alerts and troubleshoot system performance issues .

Participate in incident response and contribute to root cause analysis.

Collaborate with development teams to improve application reliability and performance .

Develop and maintain automation scripts and IaC practices .

Support security compliance and documentation initiatives.

Provide on-call support for the edge platform in a DevOps environment.

Essential Skills :

Docker & Container Orchestration (Docker Compose)

Python

Linux (Ubuntu preferred)

Advantageous Skills :

Bash Go C# Jinja2 PyTest

Networking GitHub Workflows Azure Cloud & VMs

Kubernetes AKS HELM Kustomize

Kusto Query Language (KQL)

Experience & Qualifications :

3 years hands-on experience with Docker Python Linux

Proven experience in container orchestration platforms

Strong problem-solving and collaboration skills in Agile environments

Docker, Python, Linux, Container Orchestration, AWM, Azure

Key Skills

Kubernetes,FMEA,Continuous Improvement,Elasticsearch,Go,Root cause Analysis,Maximo,CMMS,Maintenance,Mechanical Engineering,Manufacturing,Troubleshooting

Employment Type : Full Time

Experience : years

Vacancy : 1

Create a job alert for this search

Reliability Engineer • Johannesburg, Gauteng, South Africa