Site Reliability Engineer | Platform Engineering – CDP

InfotreeWorkFromHome, Wes-Kaap, South Africa

20 hours ago

Job description

Site Reliability Engineer (SRE III)

Salary budget : Approximate Rands 40,000-50,000 / month

Working location : Cape Town, South Africa

Working mode : Hybrid

Company background : A leading technology-driven organization specializing in scalable cloud and platform engineering solutions, committed to innovation, automation, and high system reliability.

Employment type : 5 years contract renewable

Role Summary

Are you passionate about building resilient, automated, and scalable systems in the cloud?

Do you thrive in fast-paced environments where reliability and performance are key to success?

We’re looking for a Site Reliability Engineer (SRE III) to join our Platform Engineering team supporting the CDP environment . In this role, you will design, maintain, and improve critical infrastructure that powers our applications. You will work closely with developers and operations teams to build systems that are automated, observable, and secure.

You’ll have the opportunity to shape our CI / CD pipelines, Infrastructure-as-Code (IaC) practices, and monitoring frameworks—ensuring that our systems are performant, compliant, and aligned with best DevOps standards.

Key Responsibilities

Maintain and improve system reliability, uptime, and performance across production and non-production environments.
Design, implement, and optimize CI / CD pipelines using GitHub and related automation tools.
Implement and manage AWS-based infrastructure using Infrastructure as Code (IaC) practices.
Develop scalable Kubernetes clusters and ensure containerized workloads meet performance and security standards.
Proactively monitor and respond to incidents using tools such as DataDog, driving root cause analysis and long-term stability improvements.
Enhance automation and observability to reduce manual intervention and mean time to recovery (MTTR).
Collaborate cross-functionally with engineering, product, and operations teams to ensure seamless deployment and reliability.
Ensure compliance with security and operational standards across environments.

Requirements

Experience : Minimum 4+ years in Site Reliability Engineering, DevOps, or related roles.

Primary Skills

GitHub and CI / CD pipeline design and maintenance

Automation and scripting for infrastructure reliability

Secondary Skills

Kubernetes and container orchestration

Monitoring and alerting tools (DataDog preferred)

Security compliance and environment hardening

Nice-to-have

Experience with cost optimization and performance tuning on AWS

Hands‑on experience with microservices and distributed systems

Exposure to DevSecOps or modern SRE frameworks

What We Offer

A full‑time permanent position in a technology‑driven environment.

Opportunities to lead reliability initiatives and influence infrastructure strategy.

Exposure to cutting‑edge cloud technologies and automation frameworks.

A collaborative, multicultural engineering culture focused on growth and innovation.

#J-18808-Ljbffr

Create a job alert for this search

Reliability Engineer • WorkFromHome, Wes-Kaap, South Africa

Related jobs

Promoted

Reliability & Qualification Engineer

Recruitpro SolutionsCape Town, South Africa

We are seeking a Reliability & Qualification Engineer to strengthen our client’s Hardware Engineering team.This role is ideal for an engineer who is passionate about testing the limits of technolog...Show moreLast updated: 30+ days ago

Promoted

Site Reliability Engineer

Robin App ASWorkFromHome, Western Cape, South Africa

We are a pioneer in Legal AI, built on proprietary models, licensed data, anddeeppartnerships with Anthropic and AWS.Since 2019, we’ve expanded our footprint to 4 continents and have been supportin...Show moreLast updated: 30+ days ago

Promoted

Site Reliability Engineer

CanonicalWorkFromHome, Western Cape, South Africa

Site Reliability Engineer role at Canonical.Canonical is a leading provider of open source software and operating systems to the enterprise and technology markets, known for Ubuntu and open source ...Show moreLast updated: 30+ days ago

Promoted

Intermediate Site Reliability Engineer, GitLab Delivery - Release and Deploy

GitLabWorkFromHome, Western Cape, South Africa

Intermediate Site Reliability Engineer, GitLab Delivery - Release and Deploy.Intermediate Site Reliability Engineer, GitLab Delivery - Release and Deploy. GitLab is an open-core software company tha...Show moreLast updated: 4 days ago

Promoted

Senior Site Reliability Engineer

MoonPayWorkFromHome, Wes-Kaap, South Africa

MoonPay is hiring a Senior Site Reliability Engineer in City of Cape Town, Western Cape, South Africa.We’re here to onboard the world to the decentralized economy. Because crypto and blockchain aren...Show moreLast updated: 21 days ago

Promoted

Senior Site Reliability Engineer

Communicate RecruitmentCape Town, South Africa

This isnt just engineering, its a.As the Senior Site Reliability Engineer, youll act as the.CI / CD pipelines like secure supply lines, reinforcing fault tolerance as strongholds, and closing securit...Show moreLast updated: 30+ days ago

Promoted

Site Reliability Engineer

LexisNexisWorkFromHome, Western Cape, South Africa

LexisNexis Legal & Professional, which serves customers in more than 150 countries with 11,800 employees worldwide, is part of. Our company has been a long-time leader in deploying AI and advanced t...Show moreLast updated: 30+ days ago

Promoted

Intermediate Site Reliability Engineer, Database Operations

GitLabWorkFromHome, Western Cape, South Africa

GitLab is an open-core software company that develops the most comprehensive AI-powered DevSecOps Platform, used by more than 100,000 organizations. Our mission is to enable everyone to contribute t...Show moreLast updated: 2 days ago

Promoted

Site Reliability Engineer

Electrum PaymentsCape Town, Western Cape, South Africa

Electrum is the next-generation payments technology company that provides cloud-native software to optimise the processing of financial transactions. Since 2012, we have established ourselves as a r...Show moreLast updated: 24 days ago

Promoted

SRE (Site Reliability Engineer)

TravelstartCape Town, Western Cape, South Africa

SRE (Site Reliability Engineer).Continue with Google Continue with Google.Be among the first 25 applicants.SRE (Site Reliability Engineer). Get AI-powered advice on this job and more exclusive featu...Show moreLast updated: 30+ days ago

Promoted

SRE (Site Reliability Engineer)

TravelLab Global ABCape Town, Western Cape, South Africa

Our Travelstart team is seeking an.SRE (Site Reliability Engineer).This role ensures the reliability, performance, and scalability of the Travelstart systems. This role bridges the gap between softw...Show moreLast updated: 30+ days ago

Promoted
New!

Site Reliability Engineer

Mind DetectWorkFromHome, Wes-Kaap, South Africa

Mind Detect City of Cape Town, Western Cape, South Africa.Site Reliability Engineer (SRE) to join their world-class Engineering team, located in Cape Town (hybrid). As SRE, you will be responsible f...Show moreLast updated: 20 hours ago

Promoted
New!

Site Reliability Engineer

Infotree Global SolutionsWorkFromHome, Wes-Kaap, South Africa

Direct message the job poster from Infotree Global Solutions.SRE III – Platform Engineering (Customer Data Platform).Cape Town / Hybrid (2 days in office). Maintain and improve system reliability an...Show moreLast updated: 20 hours ago

Promoted

Senior Site Reliability Engineer

DuckDuckGoWorkFromHome, Western Cape, South Africa

Be among the first 25 applicants.Hi, we're DuckDuckGo, the online protection company and remote-first team of 300+ on a mission to raise the standard of trust online. Founded in 2008 and profitable ...Show moreLast updated: 30+ days ago

Senior Site Reliability Engineer

CyberSentriqCape Town, Western Cape, ZA

Quick Apply

Job Title : Senior Site Reliability Engineer (SRE) .Closing date : November 10, 2025.Location : Somerset-West, South Africa. Are you a skilled Senior SRE passionate about building resilient, ...Show moreLast updated: 30+ days ago

Promoted
New!

Site Reliability Engineer

Robin AICape Town, Western Cape, South Africa

Robin is on a mission to rebuild the legal industry — starting with making contracts simple for everyone.We are a pioneer in Legal AI, built on proprietary models, licensed data, and deep partnersh...Show moreLast updated: 20 hours ago

Promoted

Site Reliability Engineer Team Lead

Robin AICape Town, Western Cape, South Africa

Robin is on a mission to rebuild the legal industry starting withmaking contracts simple for everyone.We are a pioneer in Legal AI built on proprietary models licensed data anddeeppartnerships with...Show moreLast updated: 2 days ago

Promoted

Site Reliability Engineering Manager

CanonicalWorkFromHome, Western Cape, South Africa

Site Reliability Engineering Manager role at Canonical.Location : Remote in APAC region.Lead your team in daily agile devops practices. Represent the IS team to stakeholders, customers, and internal ...Show moreLast updated: 30+ days ago