Design, deploy, and maintain cloud infrastructure using platforms like Microsoft Azure, AWS, or Google Cloud.
Ensure high availability, scalability, and fault tolerance of applications and services, including managing containerized environments.
Utilize Rancher for managing Kubernetes clusters, ensuring efficient deployment and orchestration of containerized applications across various environments.
Automation and CI / CD Pipeline Development
Build and maintain continuous integration / continuous deployment (CI / CD) pipelines for automated testing, building, and deployment of software.
Automate infrastructure provisioning using tools like Terraform, Ansible, or ARM templates.
Leverage Ranchers CI / CD capabilities and integrations to streamline the deployment process for containerized applications in Kubernetes environments.
Monitoring and Performance Optimization
Implement and maintain monitoring, logging, and alerting solutions to track application and infrastructure performance, using tools such as Azure Monitor, Prometheus, or CloudWatch.
Use Ranchers built-in monitoring tools to observe Kubernetes clusters and containers, ensuring that applications are performing optimally.
Optimize infrastructure for cost-efficiency and performance, configuring autoscaling and resource management within Rancher-managed Kubernetes clusters.
Security and Compliance
Ensure the security of cloud infrastructure by configuring firewalls, access controls, and encryption for sensitive data.
Implement security best practices to maintain compliance with industry regulations and standards, including role-based access control (RBAC) within Rancher for managing Kubernetes security.
Monitor for security vulnerabilities, manage container security, and perform regular security audits of Kubernetes clusters and cloud resources.
Collaboration with Development and Operations Teams
Work closely with software development teams to understand application requirements and provide the necessary infrastructure support, particularly for containerized workloads.
Collaborate with operations teams to ensure the smooth operation of deployed services, particularly within containerized environments managed through Rancher.
Incident Management and Troubleshooting
Investigate and resolve platform-related issues, including application outages, network failures, and security incidents.
Utilize Ranchers centralized logging and monitoring to quickly identify and troubleshoot issues within Kubernetes clusters.
Provide on-call support and contribute to incident response strategies, ensuring minimal downtime and fast recovery of services.
System Upgrades and Patching
Manage platform updates, patches, and upgrades to ensure systems remain secure and up-to-date.
Plan and execute Kubernetes cluster upgrades and Rancher version updates to stay current with new features and security patches.
Ensure that containerized applications remain compatible and functional after updates.
Documentation and Knowledge Sharing
Maintain clear, comprehensive documentation of infrastructure configurations, deployment processes, and troubleshooting procedures.
Share knowledge of Rancher, Kubernetes, and cloud infrastructure best practices with team members to improve platform operations and efficiency.
Capacity Planning and Scaling
Monitor resource usage and plan for capacity scaling to meet changing business and application demands.
Implement scaling strategies for Kubernetes clusters in Rancher, including auto-scaling of pods, nodes, and applications to accommodate varying workloads.
Cost Management and Optimization
Track and analyze cloud resource usage and costs to ensure efficient resource allocation.
Optimize cloud spending by implementing best practices like reserved instances, spot instances, and resource rightsizing.
Use Rancher to monitor the resource consumption of containerized applications and optimize the deployment of Kubernetes clusters to reduce infrastructure costs.
Disaster Recovery and Backup Planning
Implement disaster recovery strategies and data backup solutions to minimize downtime and data loss.
Regularly test backup systems and recovery procedures to ensure reliability in case of failure, including implementing backup solutions for Kubernetes environments managed through Rancher.
Essentia l skills
Several years (typically 3-5 years) of experience in a related field (e.g., systems engineering, DevOps, infrastructure engineering).
Bachelor's degree in Computer Science or related field, and or certifications such as Microsoft Certified : Azure Solutions Architect Expert, AWS Certified Solutions Architect, Certified Kubernetes Administrator (CKA), or Red Hat Certified Engineer (RHCE).
Strong verbal and written communication skills, with the ability to convey complex ideas clearly and effectively
Experience working collaboratively in cross-functional teams, with a focus on achieving shared goals
Expertise in managing multiple projects simultaneously, with a track record of delivering on time and within scope
Exceptional attention to detail, ensuring high standards of quality in all outputs
Ability to adapt quickly to changing environments and priorities, maintaining effectiveness in dynamic situations
Skills in designing highly available and fault-tolerant systems, ensuring platforms are resilient under various conditions.
Proven working experience with tools like Prometheus, Grafana, Datadog, New Relic, or ELK stack to monitor the health of infrastructure, applications, and services.
Excellent skills in identifying, diagnosing, and resolving infrastructure issues quickly, especially when systems fail or behave unexpectedly.
Knowledge of securing infrastructure and applications, including role-based access control (RBAC), encryption, and network security.
A solid understanding of Git for source code management, collaboration, and version control, is essential
A strong understanding of container orchestration with Kubernetes, particularly in deploying, managing, and scaling containerized applications across multi-cluster environments.
Proficiency in configuring and maintaining Rancher for cluster management, along with expertise in implementing security policies, monitoring, and logging within Kubernetes clusters, is essential for optimizing containerized workloads and ensuring high availability, security, and performance.
A deep understanding of cloud infrastructure management, such as provisioning and configuring virtual machines, networking, storage solutions, and implementing security best practices like Azure Active Directory and network security groups.
Additionally, proficiency in automation and CI / CD pipelines using Azure DevOps, along with expertise in Azure monitoring tools like Azure Monitor and Application Insights, is crucial for ensuring high availability, security, cost optimization, and efficient deployment of applications in the cloud.
Strong focus on automating manual processes and optimizing workflows for more efficient system management
Experience with designing distributed systems, microservices, and understanding the trade-offs between performance, consistency, and scalability.
Please call us on
Create a job alert for this search
Platform Engineer • Cape Town, South Africa
Related jobs
Promoted
Senior Platform Engineer
DigiOutsourceCape Town, Western Cape, South Africa
Kick-start your career in the online gaming world and experience the very latest in technology and innovation.We’re part of Super Group, the NYSE-listed digital gaming company behind some of the wo...Show moreLast updated: 30+ days ago
Promoted
Platform Engineer
Electrum PaymentsCape Town, Western Cape, South Africa
Are you passionate about building the foundational infrastructure that powers modern software development? Electrum is seeking a.
In this role, you\'ll enable engineering teams by automating infrast...Show moreLast updated: 30+ days ago
Promoted
Senior Data Platform Engineer
MoonPayWorkFromHome, Western Cape, South Africa
We’re here to onboard the world to the decentralized economy.MoonPay is building the infrastructure that powers a new financial system.
We make it easy for anyone, anywhere, to buy, sell, and trade ...Show moreLast updated: 1 day ago
Promoted
Senior Cloud Engineer
BETSoftwareCape Town, Western Cape, South Africa
Get AI-powered advice on this job and more exclusive features.Work closely with our customers, to understand, capture, and deliver against their requirements.
Design and build distributed systems.Ab...Show moreLast updated: 29 days ago
Promoted
Senior Back-End Engineer
Weaver FintechWorkFromHome, Wes-Kaap, South Africa
Get AI-powered advice on this job and more exclusive features.Direct message the job poster from Weaver Fintech.The absolute legend of an Engineer we're seeking will be responsible for working with...Show moreLast updated: 1 day ago
Promoted
Cloud Engineer – Become Senior Cloud Engineer while leading the implementation of enterprise so[...]
Acuity ConsultantsCape Town, Western Cape, South Africa
This is an excellent opportunity to become a Senior Cloud Engineer while leading the implementation of enterprise solutions for the leading organizations throughout South Africa.Based in CAPE TOWN,...Show moreLast updated: 11 days ago
Promoted
Senior Platform Engineer
The Hiring HouseCape Town, South Africa
Infrastructure Design and Management.Design, deploy, and maintain cloud infrastructure using platforms like Microsoft Azure, AWS, or Google Cloud.
Ensure high availability, scalability, and fault to...Show moreLast updated: 1 day ago
Platform Engineer
Electrum SoftwareCape Town, Western Cape, ZA
Quick Apply
Electrum is a next-generation payment software technology company.Since 2012, we've delivered trusted, enterprise-grade, cloud-native software to optimise financial transaction processing.Our deep ...Show moreLast updated: 30+ days ago
Promoted
Senior Platform Engineer
TilloCape Town, Western Cape, South Africa
A Senior Platform Engineer with AWS & Kubernetes experience, and a strong curiosity to learn new tools.You will make significant contributions to the Tillo platform - designing and building infrast...Show moreLast updated: 3 days ago
Senior Infrastructure Engineer
Prisma Data, Inc.Cape Town, Other, South Africa, 7100
At Prisma, we're redefining how developers work with databases.If you're fascinated by the cutting-edge data infrastructure powering companies like Twitter, Airbnb, and Facebook, but want the agili...Show moreLast updated: 30+ days ago
Promoted
New!
Lead AI Platform Engineer
Sabio GroupWorkFromHome, Western Cape, South Africa
At Sabio Group, we're dedicated to fostering an environment where employees thrive.Since 1998, we've built a dynamic culture that is both challenging and fun, driven by a team of ambitious, knowled...Show moreLast updated: 13 hours ago
Promoted
Platform Engineer
ElectrumCape Town, Western Cape, South Africa
Electrum is the next-generation payments technology company that provides cloud-native software to optimize the processing of financial transactions.
Since 2012, we have established ourselves as a r...Show moreLast updated: 30+ days ago
Promoted
Platform Engineer
Eqplus Pty LtdCape Town, Western Cape, South Africa
An opportunity exists for a Platform Engineer to contribute to the development, integration, and operation of shared platform services supporting large-scale scientific computing and complex softwa...Show moreLast updated: 1 day ago
Promoted
New!
Senior Cloud Engineer / Architect
George Consulting LtdWorkFromHome, Western Cape, South Africa
George Consulting iscommitted to creating a diverse environment and is an equalopportunity employer.George Consulting is seeking an experienced Senior Cloud Engineer / Architect to lead the design, d...Show moreLast updated: 13 hours ago
Promoted
Platform Engineer / Devops - CPT
DataFinCape Town, Western Cape, South Africa
Join a team of scientists, engineers, and computer scientists working on the world’s largest and most advanced radio telescope project, as they seek a Platform Engineer to contribute to the develop...Show moreLast updated: 30+ days ago
Promoted
Lead Platform Engineer
Absa GroupWorkFromHome, Western Cape, South Africa
Empowering Africa’s tomorrow, together…one story at a time.With over 100 years of rich history and strongly positioned as a local bank with regional and international expertise, a career with our f...Show moreLast updated: 1 day ago
Promoted
Senior DevOps Engineer
Plus1X Solutions (Pty) LtdCape Town, Western Cape, South Africa
We are seeking an experienced Senior DevOps Engineer to drive customer growth, enhance engagement, and improve our website experience.
You will play a pivotal role in ensuring the resilience, securi...Show moreLast updated: 30+ days ago
Promoted
SKA Mid - Platform Engineer
The Hiring HouseCape Town, South Africa
Contribute to the development and improvement of platform services supporting engineering and operational teams.Support integration of platform services with application and infrastructure systems....Show moreLast updated: 30+ days ago