Design, deploy, and maintain cloud infrastructure using platforms like Microsoft Azure, AWS, or Google Cloud.
Ensure high availability, scalability, and fault tolerance of applications and services, including managing containerized environments.
Utilize Rancher for managing Kubernetes clusters, ensuring efficient deployment and orchestration of containerized applications across various environments.
Automation and CI / CD Pipeline Development
Build and maintain continuous integration / continuous deployment (CI / CD) pipelines for automated testing, building, and deployment of software.
Automate infrastructure provisioning using tools like Terraform, Ansible, or ARM templates.
Leverage Ranchers CI / CD capabilities and integrations to streamline the deployment process for containerized applications in Kubernetes environments.
Monitoring and Performance Optimization
Implement and maintain monitoring, logging, and alerting solutions to track application and infrastructure performance, using tools such as Azure Monitor, Prometheus, or CloudWatch.
Use Ranchers built-in monitoring tools to observe Kubernetes clusters and containers, ensuring that applications are performing optimally.
Optimize infrastructure for cost-efficiency and performance, configuring autoscaling and resource management within Rancher-managed Kubernetes clusters.
Security and Compliance
Ensure the security of cloud infrastructure by configuring firewalls, access controls, and encryption for sensitive data.
Implement security best practices to maintain compliance with industry regulations and standards, including role-based access control (RBAC) within Rancher for managing Kubernetes security.
Monitor for security vulnerabilities, manage container security, and perform regular security audits of Kubernetes clusters and cloud resources.
Collaboration with Development and Operations Teams
Work closely with software development teams to understand application requirements and provide the necessary infrastructure support, particularly for containerized workloads.
Collaborate with operations teams to ensure the smooth operation of deployed services, particularly within containerized environments managed through Rancher.
Incident Management and Troubleshooting
Investigate and resolve platform-related issues, including application outages, network failures, and security incidents.
Utilize Ranchers centralized logging and monitoring to quickly identify and troubleshoot issues within Kubernetes clusters.
Provide on-call support and contribute to incident response strategies, ensuring minimal downtime and fast recovery of services.
System Upgrades and Patching
Manage platform updates, patches, and upgrades to ensure systems remain secure and up-to-date.
Plan and execute Kubernetes cluster upgrades and Rancher version updates to stay current with new features and security patches.
Ensure that containerized applications remain compatible and functional after updates.
Documentation and Knowledge Sharing
Maintain clear, comprehensive documentation of infrastructure configurations, deployment processes, and troubleshooting procedures.
Share knowledge of Rancher, Kubernetes, and cloud infrastructure best practices with team members to improve platform operations and efficiency.
Capacity Planning and Scaling
Monitor resource usage and plan for capacity scaling to meet changing business and application demands.
Implement scaling strategies for Kubernetes clusters in Rancher, including auto-scaling of pods, nodes, and applications to accommodate varying workloads.
Cost Management and Optimization
Track and analyze cloud resource usage and costs to ensure efficient resource allocation.
Optimize cloud spending by implementing best practices like reserved instances, spot instances, and resource rightsizing.
Use Rancher to monitor the resource consumption of containerized applications and optimize the deployment of Kubernetes clusters to reduce infrastructure costs.
Disaster Recovery and Backup Planning
Implement disaster recovery strategies and data backup solutions to minimize downtime and data loss.
Regularly test backup systems and recovery procedures to ensure reliability in case of failure, including implementing backup solutions for Kubernetes environments managed through Rancher.
Essentia l skills
Several years (typically 3-5 years) of experience in a related field (e.g., systems engineering, DevOps, infrastructure engineering).
Bachelor's degree in Computer Science or related field, and or certifications such as Microsoft Certified : Azure Solutions Architect Expert, AWS Certified Solutions Architect, Certified Kubernetes Administrator (CKA), or Red Hat Certified Engineer (RHCE).
Strong verbal and written communication skills, with the ability to convey complex ideas clearly and effectively
Experience working collaboratively in cross-functional teams, with a focus on achieving shared goals
Expertise in managing multiple projects simultaneously, with a track record of delivering on time and within scope
Exceptional attention to detail, ensuring high standards of quality in all outputs
Ability to adapt quickly to changing environments and priorities, maintaining effectiveness in dynamic situations
Skills in designing highly available and fault-tolerant systems, ensuring platforms are resilient under various conditions.
Proven working experience with tools like Prometheus, Grafana, Datadog, New Relic, or ELK stack to monitor the health of infrastructure, applications, and services.
Excellent skills in identifying, diagnosing, and resolving infrastructure issues quickly, especially when systems fail or behave unexpectedly.
Knowledge of securing infrastructure and applications, including role-based access control (RBAC), encryption, and network security.
A solid understanding of Git for source code management, collaboration, and version control, is essential
A strong understanding of container orchestration with Kubernetes, particularly in deploying, managing, and scaling containerized applications across multi-cluster environments.
Proficiency in configuring and maintaining Rancher for cluster management, along with expertise in implementing security policies, monitoring, and logging within Kubernetes clusters, is essential for optimizing containerized workloads and ensuring high availability, security, and performance.
A deep understanding of cloud infrastructure management, such as provisioning and configuring virtual machines, networking, storage solutions, and implementing security best practices like Azure Active Directory and network security groups.
Additionally, proficiency in automation and CI / CD pipelines using Azure DevOps, along with expertise in Azure monitoring tools like Azure Monitor and Application Insights, is crucial for ensuring high availability, security, cost optimization, and efficient deployment of applications in the cloud.
Strong focus on automating manual processes and optimizing workflows for more efficient system management
Experience with designing distributed systems, microservices, and understanding the trade-offs between performance, consistency, and scalability.
Please call us on
Create a job alert for this search
Platform Engineer • Cape Town, South Africa
Related jobs
Senior Platform Engineer
DigiOutsource • Cape Town, Western Cape, South Africa
Kick-start your career in the online gaming world and experience the very latest in technology and innovation.We’re part of Super Group, the NYSE-listed digital gaming company behind some of the wo...Show more
Last updated: 30+ days ago • Promoted
Senior Platform Engineer
Kifiya Financial Technology • Cape Town, ZA
Job title : Senior Platform Engineer.Job Location : Western Cape, Cape Town.The Senior Platform Engineer is responsible for building and maintaining the foundational data and AI infrastructure that...Show more
Last updated: 12 days ago • Promoted
Senior Data Platform Engineer
MoonPay • WorkFromHome, Western Cape, South Africa
We’re here to onboard the world to the decentralized economy.MoonPay is building the infrastructure that powers a new financial system.
We make it easy for anyone, anywhere, to buy, sell, and trade ...Show more
Last updated: 22 days ago • Promoted
Senior Backend Engineer I
M-KOPA • WorkFromHome, Western Cape, South Africa
Join our expanding engineering team.We're continuing to hire multiple Senior Backend Engineers (I, II & III) as we scale our FinTech team at M-KOPA.
Build sophisticated event-driven microservices th...Show more
An innovative Independent Asset Management Firm in Cape Town is seeking an Azure Cloud Platform Engineer to design, build, and evolve its Internal Developer Platform.
The ideal candidate will have a...Show more
Last updated: 8 days ago • Promoted
Senior Platform Engineer
Potentiam Ltd • WorkFromHome, Wes-Kaap, South Africa
Our client is looking for a Senior Platform Engineer to help design, secure, and scale their infrastructure across multiple cloud providers (AWS, GCP, and Azure).
While Azure experience is preferred...Show more
Last updated: 30+ days ago • Promoted
Senior Platform Engineer
Ringier South Africa • WorkFromHome, Western Cape, South Africa
Ringier South Africa is looking to hire a.We are a technology company supporting international marketplaces based in Cape Town, South Africa and Mauritius.
Not based in South Africa or Mauritius? We...Show more
Last updated: 30+ days ago • Promoted
Senior Platform Engineer
Tillo • WorkFromHome, Western Cape, South Africa
A Senior Platform Engineer with AWS & Kubernetes experience, and a strong curiosity to learn new tools.You will make significant contributions to the Tillo platform - designing and building infrast...Show more
Last updated: 30+ days ago • Promoted
Platform Engineer
Electrum • Cape Town, Western Cape, South Africa
Electrum is the next-generation payments technology company that provides cloud-native software to optimize the processing of financial transactions.
Since 2012, we have established ourselves as a r...Show more
Last updated: 30+ days ago • Promoted
Senior Cloud Engineer
Betsoftware • Cape Town, Western Cape, South Africa
Work tightly with our customers, to understand, capture, and deliver against their requirements.Design and build distributed systems.
Ability to work with the Operator Framework.Having an interest i...Show more
A dynamic tech company is seeking a Senior Platform Engineer to design scalable solutions using AWS and Kubernetes.This role offers flexibility in a hybrid work setup, contributing to improving dev...Show more
Last updated: 3 days ago • Promoted
Senior Cloud Engineer
Solugrowth • Cape Town, Western Cape, South Africa
The Senior Cloud Engineer (Azure) is responsible for designing, implementing, and managing secure, scalable, and reliable cloud infrastructure solutions on Microsoft Azure.This role ensures optimal...Show more
Last updated: 20 days ago • Promoted
Senior Backend Engineer I
M-KOPA Kenya Limited • WorkFromHome, Western Cape, South Africa
Join our expanding engineering team.We're continuing to hire multiple Senior Backend Engineers (I, II & III) as we scale our FinTech team at M-KOPA.
Build sophisticated event-driven microservices th...Show more
Last updated: 30+ days ago • Promoted
Senior DevOps Engineer - CI / CD & AWS Platform Lead
RELX • WorkFromHome, Western Cape, South Africa
A global information provider is seeking a Senior DevOps Engineer in Cape Town to develop and evolve their global CI / CD platform.
The role involves designing reusable tools, managing Jenkins librari...Show more
Last updated: 8 days ago • Promoted
SKA Mid - Platform Engineer
The Hiring House • Cape Town, South Africa
Contribute to the development and improvement of platform services supporting engineering and operational teams.Support integration of platform services with application and infrastructure systems....Show more
Last updated: 30+ days ago • Promoted
Engineer (Platform)
Woolworths • Cape Town, Western Cape, South Africa
FMCG, Retail, Wholesale and Supply Chain.An Engineer responsible for designing and implementing scalable and robust processes to support various engineering capabilities.
This role includes extracti...Show more
Last updated: 30+ days ago • Promoted
Senior Cloud Engineer
DevOps projects • WorkFromHome, Western Cape, South Africa
Get weekly curated DevOps opportunities, salary insights, and career tips — no spam, only relevant roles that match your stack and experience level.
We’re building a team of highly-skilled engineers...Show more
Last updated: 12 days ago • Promoted
Senior Backend Engineer (AWS)
Lumenalta • WorkFromHome, Western Cape, South Africa
As a Senior Backend Engineer at Lumenalta, you will design and implement robust backend systems, ensuring our applications are secure, scalable, and efficient.
You will be responsible for working cl...Show more