Talent.com
Site Reliability Engineer (SRE II) (Kubernetes / Python)

Site Reliability Engineer (SRE II) (Kubernetes / Python)

k0deHutJohannesburg, Gauteng, South Africa
3 days ago
Job description

Site Reliability Engineer (SRE II) (Kubernetes / Python)

Job Openings Site Reliability Engineer (SRE II) (Kubernetes / Python)

About the job Site Reliability Engineer (SRE II) (Kubernetes / Python)

Intermediate Site Reliability Engineer (SRE II)

Our Client is offering the right candidate a great opportunity to join a fast growing South African fintech that enables seamless and innovative end-to-end customer onboarding services that drive conversion rates, prevent fraud, reduce risk and costs. They provide automated and easy to implement solutions that fully onboard a new customer in under two minutes.

You'll work in a small, senior team that operates on trust and high collaboration. The team works remotely most of the time and occasionally comes into the office when more direct collaboration is required. You should be motivated to achieve operational excellence using automation tooling (e.g. Terraform) and enjoy keeping your technical skills current to allow you to contribute to architectural discussions. Naturally, you'll be exposed to many aspects of our business from day one. They will ensure that you have the tools and support to do great work, but you'll also have the freedom to try new things and learn.

Infrastructure & Software Stack

  • CI / CD with Jenkins
  • Kong API Gateway
  • LogDNA
  • Falco
  • MongoDB Atlas
  • Microservice Architecture with Event Sourcing and CQRS

Your responsibilities will include :

  • Improving and maintaining our infrastructure using Terraform, which includes making effective use of public clouds (primarily Google Cloud and AWS) while considering :
  • Security
  • Maintainability
  • Scalability
  • Ensuring our infrastructure is automated and reproducible across environments
  • Leveraging Kubernetes in an effective manner to host our applications
  • Owning infrastructure projects from start to finish and driving them to completion within agreed timeframes
  • Documenting infrastructure design and how tooling should be used
  • Regularly considering the long-term vision for our infrastructure and our alignment to it
  • Making well-considered tradeoffs between short-term infrastructure requirements
  • and long-term objectives
  • Identifying potential improvements that could enable us to deliver faster without compromising operational objectives
  • Managing our identity platform and enabling enterprise user and system authentication and authorization using OAuth2
  • Writing, testing and executing change control plans for production changes with an eye for detail to spot potential issues
  • Having a good working understanding of how our systems operate and be able to debug production issues
  • Being part of our on-call rotation. When on-call, you will work on repaying technical debt and deal with operational incidents as and when they occur. This will require you to have or acquire a good general knowledge of production operations for technical support.
  • Being part of our security incident response team
  • Writing operational tooling to automate otherwise manual processes (e.g. Golang, Bash)
  • Performing high quality, ego-free code reviews for your colleagues as well as submitting your code for review by others and accepting their feedback generously
  • Taking ownership of our operational metrics and drive visibility, testing and improvement initiatives
  • Working effectively with the development team to plan and deploy required infrastructure changes or new capabilities ahead of time and unblocking the development team when unforeseen infrastructure blockers arise
  • Accepting feedback willingly and sharing your knowledge freely
  • Flexible working hours and leave (no clock watching)
  • Strong values that are practised
  • Remote work for most days of the week
  • Opportunity to learn and grow being surrounded by a strong technical team
  • #J-18808-Ljbffr

    Create a job alert for this search

    Reliability Engineer • Johannesburg, Gauteng, South Africa

    Related jobs
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    LunoWorkFromHome, Gauteng, South Africa
    Luno is the crypto investment app you can rely on, enabling you to buy, store and explore crypto securely.We're committed to putting the power of cryptocurrency in everyone's hands sensibly and res...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    LexisNexisJohannesburg, Gauteng, South Africa
    LexisNexis Legal & Professional, which serves customers in more than 150 countries with 11,800 employees worldwide, is part of. Our company has been a long-time leader in deploying AI and advanced t...Show moreLast updated: 30+ days ago
    • Promoted
    Intermediate Site Reliability Engineer, Database Operations

    Intermediate Site Reliability Engineer, Database Operations

    GitLabWorkFromHome, South Africa
    GitLab is an open-core software company that develops the most comprehensive AI-powered DevSecOps Platform, used by more than 100,000 organizations. Our mission is to enable everyone to contribute t...Show moreLast updated: 17 days ago
    • Promoted
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    Impronics TechnologiesJohannesburg, Gauteng, South Africa
    Site Reliability Engineer (SRE).Be among the first 25 applicants.Site Reliability Engineer (SRE).Get AI-powered advice on this job and more exclusive features. Site Reliability Engineer (SRE).The id...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CanonicalWorkFromHome, Gauteng, South Africa
    Site Reliability Engineer role at Canonical.Canonical is a leading provider of open source software and operating systems to the enterprise and technology markets, known for Ubuntu and open source ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    CanonicalWorkFromHome, Gauteng, South Africa
    Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in enterprise initiatives such as ...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineering Manager

    Site Reliability Engineering Manager

    CanonicalWorkFromHome, Gauteng, South Africa
    Site Reliability Engineering Manager role at Canonical.Location : Remote in APAC region.Lead your team in daily agile devops practices. Represent the IS team to stakeholders, customers, and internal...Show moreLast updated: 30+ days ago
    • Promoted
    Engineer, Reliability

    Engineer, Reliability

    Standard Bank of South Africa LimitedJohannesburg, Gauteng, South Africa
    Business Segment : Personal & Private Banking.Location : ZA, GP, Johannesburg, Simmonds Street.We are seeking a detail-oriented and analytical Reliability Engineer to join our team in Johannesburg, S...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    DatacentrixJohannesburg, Gauteng, South Africa
    Gauteng, JHB - Eastern Suburbs (Market related) Are you a Site Reliability Engineer with solid Datadog experience? Our client in the Warehousing and Logistics sector is looking to employ an Enginee...Show moreLast updated: 20 days ago
    • Promoted
    Engineer, Site Reliability

    Engineer, Site Reliability

    Standard Bank of South Africa LimitedJohannesburg, Gauteng, South Africa
    Business Segment : Business & Commercial Banking.Location : ZA, GP, Johannesburg, 3 Simmonds Street.Responsible for the resilience of Group Information Technology across the entire eco system of the ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Engineer

    Senior Engineer

    Boardroom AppointmentsJohannesburg, Gauteng, South Africa
    BSc in Computer Science / Information Technology.SQL Certification (advantageous).Project Management Certification (recommended). Experience administering MS Windows Server environments.Experience w...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Level-UpJohannesburg, South Africa
    We are looking for a skilled Site Reliability Engineer (SRE) with expertise in Ansible and Linux to join our dynamic team. The successful candidate will play a critical role in maintaining the relia...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer (SRE II) (Kubernetes / Python)

    Site Reliability Engineer (SRE II) (Kubernetes / Python)

    k0deHutWorkFromHome, Gauteng, South Africa
    Site Reliability Engineer (SRE II) (Kubernetes / Python).Job Openings Site Reliability Engineer (SRE II) (Kubernetes / Python). About the job Site Reliability Engineer (SRE II) (Kubernetes / Python).Inter...Show moreLast updated: 3 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    DuckDuckGoWorkFromHome, Gauteng, South Africa
    Be among the first 25 applicants.Hi, we're DuckDuckGo, the online protection company and remote-first team of 300+ on a mission to raise the standard of trust online. Founded in 2008 and profitable ...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer (Datadog)

    Site Reliability Engineer (Datadog)

    Data CentrixJohannesburg, Gauteng, South Africa
    Datadog Certified Fundamentals Must have.Degree in Information Technology or Computer Science.Management of operations on virtualized and distributed infrastructures. Management of operations on env...Show moreLast updated: 21 days ago
    • Promoted
    Senior Systems Engineer

    Senior Systems Engineer

    NetsuritJohannesburg, South Africa
    Netsurit's mission is to "Support the dreams of the doers.For Netsurit, this means helping employees achieve their personal dreams and ambitions while they free up our customers to meet their broad...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    RELXJohannesburg, Gauteng, South Africa
    Our CEMEA Cloud / SRE team is looking for an experienced DevOps Engineer to help build scalable, secure, and reliable systems. Our team specializes in cloud and DevOps technologies, with members pos...Show moreLast updated: 30+ days ago
    • Promoted
    Platform / DevOps / Site Reliability Engineer

    Platform / DevOps / Site Reliability Engineer

    Elite Search & SelectionJohannesburg, Gauteng, South Africa
    Platform / DevOps / Site Reliability Engineer.Remote but ideally based in Johannesburg, Cape Town, Durban.Part of a large ICT group, this company offers globally available cloud services, solutions...Show moreLast updated: 30+ days ago