Talent.com
OM Bank - Site Reliability Engineer

OM Bank - Site Reliability Engineer

Old MutualCape Town, Western Cape, South Africa
2 days ago
Job description

Let’s Write Africa’s Story Together!

Old Mutual is a firm believer in the African opportunity and our diverse talent reflects this.

Job Description

OM Bank is currently looking for a site reliability engineer to join OM Bank platform team. The candidate will be responsible for maintaining the OM Bank platform, including first line support for the platform’s technical services and managing service outages through the incident management process.

Key Result Areas

  • First line support for all services that comprise the platform
  • Managing the incident management process for production incidents including detection, triaging, resolve and driving continuous improvements
  • Maintain the production readiness score card defined in terraform to ensure checks are working as expected and responsible for adding new checks to the scorecard workflow
  • Creating and maintaining monitors in datadog that improve observability across the platform
  • Engagement with the wider OM Bank product and build team to ensure alignment to the observability standards defined by the platform team
  • Designing and implementing enhancements to the platform that contribute towards reducing MTTR (mean time to recovery)
  • Designing and implementing automation initiatives including self-service capabilities
  • Implementing Service Level Indicators & Objectives for the platform
  • Implementing and maintaining datadog dashboards for the platform
  • Defining and maintaining baseline monitors to be used by product teams
  • Maintaining the observability repository that contains all service definitions and observability related configurations
  • Maintaining the feature flagging repository containing all feature flagging definition for product teams
  • Maintaining Pager Duty definitions and overall administration
  • Fine tuning monitors to ensure alerts are triggered appropriately
  • Leading an action center during a production incident, fostering collaboration across the bank to resolve the outage
  • Advising product and platform on engineering best practices to ensure services are built with observability and scalability from the start
  • Maintaining overall platform health by monitoring key metrics
  • Maintaining and extending the SRE API written in python and deploy to Kubernetes

Role Requirements

  • Bachelor’s degree in computer science, electrical or electronic engineering, Information Technology, or relevant field
  • 7+ years of software and platform engineering experience building and supporting scalable services
  • 3-5 years experience in writing infrastructure as code (Terraform, AWS CDK, Cloudformation)
  • Solid experience using observability platforms like Datadog
  • Experience with microservices architecture and Restful API
  • Solid Kubernetes expertise including end-to-end deployment and maintenance of clusters, designing and building infrastructure as code required to deploy the cluster and required cloud resources that support the cluster
  • Experience with Kubernetes custom resource management and deployment
  • Solid experience deploying Kubernetes resources using Helm Charts
  • Experience in fine tuning Kubernetes HPA configs
  • Moderate experience using go / python programming language
  • Solid experience using GitOps and general git based operations
  • Solid infrastructure as code background displaying experience in designing, implementing and maintaining IAC design patterns that manage large scale cloud environment
  • Solid AWS experience, displaying advanced understanding of cloud architecture and maintaining distributed systems
  • Experience maintaining queuing systems like AWS SQS and event streaming platforms like Kafka
  • Experience supporting mobile applications
  • Closing Date

    01 November 2025, 23 : 59

    The appointment will be made from the designated group in line with the Employment Equity Plan of Old Mutual South Africa and the specific business unit in question.

    The Old Mutual Story!

    #J-18808-Ljbffr

    Create a job alert for this search

    Reliability Engineer • Cape Town, Western Cape, South Africa

    Related jobs
    • Promoted
    Reliability & Qualification Engineer

    Reliability & Qualification Engineer

    Recruitpro SolutionsCape Town, South Africa
    We are seeking a Reliability & Qualification Engineer to strengthen our client’s Hardware Engineering team.This role is ideal for an engineer who is passionate about testing the limits of technolog...Show moreLast updated: 30+ days ago
    • Promoted
    Software Engineer (Radar Systems)

    Software Engineer (Radar Systems)

    Communicate RecruitmentStellenbosch, South Africa
    This isnt ordinary software development its engineering for battle readiness.As part of the radar division, youll write embedded C / C++ code that commands precision and response under real-world pr...Show moreLast updated: 19 days ago
    • Promoted
    Software Engineer

    Software Engineer

    Communicate RecruitmentStellenbosch, South Africa
    My client is seeking a Software Engineer to join their Fibre Front End team, building next-gen fibre signal acquisition systems. Youll work on reducing internet traffic, programmable switches, and s...Show moreLast updated: 30+ days ago
    • Promoted
    SKA Mid - Storage Systems Engineer

    SKA Mid - Storage Systems Engineer

    The Hiring HouseCape Town, South Africa
    Administer, maintain, and support enterprise storage platforms across on-premises and cloud environments.Support enterprise backup, restore, and disaster recovery operations.Investigate, troublesho...Show moreLast updated: 30+ days ago
    • Promoted
    IT Systems Engineer (Tier 1)

    IT Systems Engineer (Tier 1)

    PRR RecruitmentBellville, South Africa
    Are you a hands-on IT support specialist with strong technical skills, project implementation experience, and a passion for solving problems? A dynamic team in Bellville is looking for an experienc...Show moreLast updated: 8 days ago
    • Promoted
    Software Engineer

    Software Engineer

    Network RecruitmentStellenbosch, South Africa
    Includes the following, but is not limited to : .Design, develop, and maintain mission-system-related software, including software for payload control, payloads emulators, relay applications, mission...Show moreLast updated: 30+ days ago
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    CyberSentriqCape Town, Western Cape, ZA
    Quick Apply
    Job Title : Senior Site Reliability Engineer (SRE) .Closing date : November 10, 2025.Location : Somerset-West, South Africa. Are you a skilled Senior SRE passionate about building resilient, ...Show moreLast updated: 30+ days ago
    • Promoted
    Lead Engineer - Contracts

    Lead Engineer - Contracts

    Talent Acquisition ConsultingCape Town, South Africa
    Lead Engineer : Contracts – Cape Town.Engineering Consulting | Transportation Division.An established engineering consultancy. Transportation team in Cape Town.This position reports directly to the E...Show moreLast updated: 1 day ago
    • Promoted
    Site Agent / Construction Manager (Civil Construction

    Site Agent / Construction Manager (Civil Construction

    The Legends AgencyCape Town, South Africa
    Site Agent / Construction Manager (Civil Construction).Civil Construction | R62,500 - R79,200 per month.Our client is a well-established civil construction contractor with an excellent reputation f...Show moreLast updated: 30+ days ago
    • Promoted
    Construction Site Agents

    Construction Site Agents

    West Coast PersonnelCape Town, South Africa
    Strong leadership, planning, and problem-solving skills.Show moreLast updated: 30+ days ago
    • Promoted
    Resident Engineer

    Resident Engineer

    Network RecruitmentCape Town, South Africa
    Are you ready to take your career offshore and into exciting international waters? Were looking for experienced Resident Engineers to lead the construction of world-class marine and port infrastruc...Show moreLast updated: 23 days ago
    • Promoted
    Cloud Infrastructure Engineer (Kubernetes / OpenTofu)

    Cloud Infrastructure Engineer (Kubernetes / OpenTofu)

    Pure PlacementsCape Town, South Africa
    Cape Town (Century City), On-Site.R 720,000 – R 960,000 Annual CTC.The Role : Key Responsibilities & Daily Duties.We are seeking a seasoned Cloud Infrastructure Engineer to design, automate, and sec...Show moreLast updated: 30+ days ago
    • Promoted
    Specialist Services Engineer (Tier 3)

    Specialist Services Engineer (Tier 3)

    PRR RecruitmentBellville, South Africa
    Specialist Services Engineer (Tier 3).Are you a senior IT professional with deep technical expertise and a track record of successful project delivery? A high-performing team in Bellville is lookin...Show moreLast updated: 8 days ago
    • Promoted
    AWS Site Reliability Engineer (SRE) – Cape Town

    AWS Site Reliability Engineer (SRE) – Cape Town

    ClarkHouseCape Town, South Africa
    Youll take ownership of infrastructure, monitoring, and automation, building.Continuous Integration and Continuous Delivery (CI / CD) pipelines. Youll work closely with engineers and security teams to...Show moreLast updated: 3 days ago
    • Promoted
    Site Agent

    Site Agent

    The Legends AgencyCape Town, South Africa
    The Site Agent is responsible for overseeing the execution of on-site project activities.This includes ensuring quality, productivity, and compliance with safety standards.The role is fully site-ba...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    Communicate RecruitmentStellenbosch, South Africa
    Pipeline Crafting : Advanced ETL / ELT design, batch & stream processing (Spark, Flink, Beam).Cloud Endurance : Hands-on experience with AWS (Redshift, Glue, Kinesis), Azure (Data Factory, Synapse)...Show moreLast updated: 30+ days ago
    Site Reliability Engineer

    Site Reliability Engineer

    Electrum SoftwareCape Town, Western Cape, ZA
    Quick Apply
    Electrum is a next-generation payment software technology company.Since 2012, we've delivered trusted, enterprise-grade, cloud-native software to optimise financial transaction processing.Our deep ...Show moreLast updated: 29 days ago
    • Promoted
    SITE AGENT (Commercial Construction) - Western Cape

    SITE AGENT (Commercial Construction) - Western Cape

    HR GenieSea Point, South Africa
    SITE AGENT (Commercial Construction) Western Cape.Established And Reputable Commercial Construction Firm.Site Agent to join their dynamic team of professionals. Competitive Salary Package on Offer ...Show moreLast updated: 15 days ago