Talent.com
Site Reliability Engineer | Platform Engineering – CDP

Site Reliability Engineer | Platform Engineering – CDP

InfotreeWorkFromHome, Wes-Kaap, South Africa
20 hours ago
Job description

Site Reliability Engineer (SRE III)

Salary budget : Approximate Rands 40,000-50,000 / month

Working location : Cape Town, South Africa

Working mode : Hybrid

Company background : A leading technology-driven organization specializing in scalable cloud and platform engineering solutions, committed to innovation, automation, and high system reliability.

Employment type : 5 years contract renewable

Role Summary

Are you passionate about building resilient, automated, and scalable systems in the cloud?

Do you thrive in fast-paced environments where reliability and performance are key to success?

We’re looking for a Site Reliability Engineer (SRE III) to join our Platform Engineering team supporting the CDP environment . In this role, you will design, maintain, and improve critical infrastructure that powers our applications. You will work closely with developers and operations teams to build systems that are automated, observable, and secure.

You’ll have the opportunity to shape our CI / CD pipelines, Infrastructure-as-Code (IaC) practices, and monitoring frameworks—ensuring that our systems are performant, compliant, and aligned with best DevOps standards.

Key Responsibilities

  • Maintain and improve system reliability, uptime, and performance across production and non-production environments.
  • Design, implement, and optimize CI / CD pipelines using GitHub and related automation tools.
  • Implement and manage AWS-based infrastructure using Infrastructure as Code (IaC) practices.
  • Develop scalable Kubernetes clusters and ensure containerized workloads meet performance and security standards.
  • Proactively monitor and respond to incidents using tools such as DataDog, driving root cause analysis and long-term stability improvements.
  • Enhance automation and observability to reduce manual intervention and mean time to recovery (MTTR).
  • Collaborate cross-functionally with engineering, product, and operations teams to ensure seamless deployment and reliability.
  • Ensure compliance with security and operational standards across environments.

Requirements

Experience : Minimum 4+ years in Site Reliability Engineering, DevOps, or related roles.

Primary Skills

  • GitHub and CI / CD pipeline design and maintenance
  • Automation and scripting for infrastructure reliability
  • Secondary Skills

  • Kubernetes and container orchestration
  • Monitoring and alerting tools (DataDog preferred)
  • Security compliance and environment hardening
  • Nice-to-have

  • Experience with cost optimization and performance tuning on AWS
  • Hands‑on experience with microservices and distributed systems
  • Exposure to DevSecOps or modern SRE frameworks
  • What We Offer

  • A full‑time permanent position in a technology‑driven environment.
  • Opportunities to lead reliability initiatives and influence infrastructure strategy.
  • Exposure to cutting‑edge cloud technologies and automation frameworks.
  • A collaborative, multicultural engineering culture focused on growth and innovation.
  • #J-18808-Ljbffr

    Create a job alert for this search

    Reliability Engineer • WorkFromHome, Wes-Kaap, South Africa

    Related jobs
    • Promoted
    Reliability & Qualification Engineer

    Reliability & Qualification Engineer

    Recruitpro SolutionsCape Town, South Africa
    We are seeking a Reliability & Qualification Engineer to strengthen our client’s Hardware Engineering team.This role is ideal for an engineer who is passionate about testing the limits of technolog...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Robin App ASWorkFromHome, Western Cape, South Africa
    We are a pioneer in Legal AI, built on proprietary models, licensed data, anddeeppartnerships with Anthropic and AWS.Since 2019, we’ve expanded our footprint to 4 continents and have been supportin...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CanonicalWorkFromHome, Western Cape, South Africa
    Site Reliability Engineer role at Canonical.Canonical is a leading provider of open source software and operating systems to the enterprise and technology markets, known for Ubuntu and open source ...Show moreLast updated: 30+ days ago
    • Promoted
    Intermediate Site Reliability Engineer, GitLab Delivery - Release and Deploy

    Intermediate Site Reliability Engineer, GitLab Delivery - Release and Deploy

    GitLabWorkFromHome, Western Cape, South Africa
    Intermediate Site Reliability Engineer, GitLab Delivery - Release and Deploy.Intermediate Site Reliability Engineer, GitLab Delivery - Release and Deploy. GitLab is an open-core software company tha...Show moreLast updated: 4 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    MoonPayWorkFromHome, Wes-Kaap, South Africa
    MoonPay is hiring a Senior Site Reliability Engineer in City of Cape Town, Western Cape, South Africa.We’re here to onboard the world to the decentralized economy. Because crypto and blockchain aren...Show moreLast updated: 21 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Communicate RecruitmentCape Town, South Africa
    This isnt just engineering, its a.As the Senior Site Reliability Engineer, youll act as the.CI / CD pipelines like secure supply lines, reinforcing fault tolerance as strongholds, and closing securit...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    LexisNexisWorkFromHome, Western Cape, South Africa
    LexisNexis Legal & Professional, which serves customers in more than 150 countries with 11,800 employees worldwide, is part of. Our company has been a long-time leader in deploying AI and advanced t...Show moreLast updated: 30+ days ago
    • Promoted
    Intermediate Site Reliability Engineer, Database Operations

    Intermediate Site Reliability Engineer, Database Operations

    GitLabWorkFromHome, Western Cape, South Africa
    GitLab is an open-core software company that develops the most comprehensive AI-powered DevSecOps Platform, used by more than 100,000 organizations. Our mission is to enable everyone to contribute t...Show moreLast updated: 2 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Electrum PaymentsCape Town, Western Cape, South Africa
    Electrum is the next-generation payments technology company that provides cloud-native software to optimise the processing of financial transactions. Since 2012, we have established ourselves as a r...Show moreLast updated: 24 days ago
    • Promoted
    SRE (Site Reliability Engineer)

    SRE (Site Reliability Engineer)

    TravelstartCape Town, Western Cape, South Africa
    SRE (Site Reliability Engineer).Continue with Google Continue with Google.Be among the first 25 applicants.SRE (Site Reliability Engineer). Get AI-powered advice on this job and more exclusive featu...Show moreLast updated: 30+ days ago
    • Promoted
    SRE (Site Reliability Engineer)

    SRE (Site Reliability Engineer)

    TravelLab Global ABCape Town, Western Cape, South Africa
    Our Travelstart team is seeking an.SRE (Site Reliability Engineer).This role ensures the reliability, performance, and scalability of the Travelstart systems. This role bridges the gap between softw...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Mind DetectWorkFromHome, Wes-Kaap, South Africa
    Mind Detect City of Cape Town, Western Cape, South Africa.Site Reliability Engineer (SRE) to join their world-class Engineering team, located in Cape Town (hybrid). As SRE, you will be responsible f...Show moreLast updated: 20 hours ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Infotree Global SolutionsWorkFromHome, Wes-Kaap, South Africa
    Direct message the job poster from Infotree Global Solutions.SRE III – Platform Engineering (Customer Data Platform).Cape Town / Hybrid (2 days in office). Maintain and improve system reliability an...Show moreLast updated: 20 hours ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    DuckDuckGoWorkFromHome, Western Cape, South Africa
    Be among the first 25 applicants.Hi, we're DuckDuckGo, the online protection company and remote-first team of 300+ on a mission to raise the standard of trust online. Founded in 2008 and profitable ...Show moreLast updated: 30+ days ago
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    CyberSentriqCape Town, Western Cape, ZA
    Quick Apply
    Job Title : Senior Site Reliability Engineer (SRE) .Closing date : November 10, 2025.Location : Somerset-West, South Africa. Are you a skilled Senior SRE passionate about building resilient, ...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Robin AICape Town, Western Cape, South Africa
    Robin is on a mission to rebuild the legal industry — starting with making contracts simple for everyone.We are a pioneer in Legal AI, built on proprietary models, licensed data, and deep partnersh...Show moreLast updated: 20 hours ago
    • Promoted
    Site Reliability Engineer Team Lead

    Site Reliability Engineer Team Lead

    Robin AICape Town, Western Cape, South Africa
    Robin is on a mission to rebuild the legal industry starting withmaking contracts simple for everyone.We are a pioneer in Legal AI built on proprietary models licensed data anddeeppartnerships with...Show moreLast updated: 2 days ago
    • Promoted
    Site Reliability Engineering Manager

    Site Reliability Engineering Manager

    CanonicalWorkFromHome, Western Cape, South Africa
    Site Reliability Engineering Manager role at Canonical.Location : Remote in APAC region.Lead your team in daily agile devops practices. Represent the IS team to stakeholders, customers, and internal ...Show moreLast updated: 30+ days ago