Join Our Team as a Senior Site Reliability Engineer!
Are you passionate about building and maintaining resilient systems?
If you're ready to dive deep into technical challenges and drive mission-critical projects, we want you on our team!
We're hiring a Senior Site Reliability Engineer for a 3-year renewable contract in a hybrid role based in Menlyn / Midrand.
What You'll Bring to the TableEssential Skills : Container Expertise : Skilled in Kubernetes or similar container orchestration platforms.Unix / Linux Knowledge : Strong understanding of Unix / Linux internals, administration, and networking stack.Networking Mastery : Proficient with subnetting, routing, firewalls, DNS, (reverse)-proxies, and OSI layers.TCP Stack Debugging : In-depth understanding of the TCP stack and ability to debug at the socket / driver level.Programming Skills : Proficiency in one or more languages, such as C# or JavaScript / TypeScript.Agile Responsibilities : Open to additional duties as outlined in the Agile Working Model (AWM) Charter.Advantageous Skills : Building highly automated CI / CD pipelines with custom scripts using bash or Python.IaaS Knowledge : Skilled in deploying and maintaining Linux VMs using Ansible.Cloud Technologies Experience : Exposure to Confluent Kafka, Managed Kubernetes Engines (e.g., AKS), Proxies like Azure Application Gateway, Managed databases such as PostgreSQL, CI / CD pipeline systems from GitLab or GitHub Actions, Git and repository platforms like GitLab, Nexus, and GitHub (self-hosted).Soft Skills : Ability to stay structured and confident, even in high-stress incidents.Strong communication skills with a collaborative spirit.Adaptability to handle various project components and tasks.Willingness to coach and train team members through deep-dive workshops and pair-programming.Available for occasional international travel (twice a year).A strong commitment to work ethics.What You'll Be DoingSupport & Development : Work within a large product team to support and develop mission-critical components, improving system resiliency at every step.Collaboration : Partner with development teams and product owners to plan and coordinate 'Design for Run' activities.Lifecycle Improvement : Enhance service lifecycle processes, from design to deployment, for these essential systems.Global Impact : Participate in a 24 / 7 on-call rotation, ensuring fast restoration and reliability for systems worldwide.What We OfferCutting-Edge Technology : Work with advanced global IT systems (Cloud and Edge technologies on Microsoft Azure).Flexible Working Hours : hours annually, with a schedule that supports work-life balance.Hybrid Flexibility : Work both remotely and on-site as needed.Exclusive Benefits : Affordable vehicle promotions (buying or leasing options available).Vibrant Work Environment : Join a fast-paced, global team in modern, state-of-the-art offices.Agile Methodology : Operate within an Agile Working Model (Scrum) to keep projects dynamic and efficient.Ready to make an impact in a challenging, rewarding environment?
Apply today and become part of a team that values innovation and growth!
Send CVs to
J Ljbffr