Talent.com
Senior Platform Engineer

Senior Platform Engineer

The Hiring HouseCape Town, South Africa
19 hours ago
Job description

Responsibilities

  • Infrastructure Design and Management
  • Design, deploy, and maintain cloud infrastructure using platforms like Microsoft Azure, AWS, or Google Cloud.
  • Ensure high availability, scalability, and fault tolerance of applications and services, including managing containerized environments.
  • Utilize Rancher for managing Kubernetes clusters, ensuring efficient deployment and orchestration of containerized applications across various environments.
  • Automation and CI / CD Pipeline Development
  • Build and maintain continuous integration / continuous deployment (CI / CD) pipelines for automated testing, building, and deployment of software.
  • Automate infrastructure provisioning using tools like Terraform, Ansible, or ARM templates.
  • Leverage Ranchers CI / CD capabilities and integrations to streamline the deployment process for containerized applications in Kubernetes environments.
  • Monitoring and Performance Optimization
  • Implement and maintain monitoring, logging, and alerting solutions to track application and infrastructure performance, using tools such as Azure Monitor, Prometheus, or CloudWatch.
  • Use Ranchers built-in monitoring tools to observe Kubernetes clusters and containers, ensuring that applications are performing optimally.
  • Optimize infrastructure for cost-efficiency and performance, configuring autoscaling and resource management within Rancher-managed Kubernetes clusters.
  • Security and Compliance
  • Ensure the security of cloud infrastructure by configuring firewalls, access controls, and encryption for sensitive data.
  • Implement security best practices to maintain compliance with industry regulations and standards, including role-based access control (RBAC) within Rancher for managing Kubernetes security.
  • Monitor for security vulnerabilities, manage container security, and perform regular security audits of Kubernetes clusters and cloud resources.
  • Collaboration with Development and Operations Teams
  • Work closely with software development teams to understand application requirements and provide the necessary infrastructure support, particularly for containerized workloads.
  • Collaborate with operations teams to ensure the smooth operation of deployed services, particularly within containerized environments managed through Rancher.
  • Incident Management and Troubleshooting
  • Investigate and resolve platform-related issues, including application outages, network failures, and security incidents.
  • Utilize Ranchers centralized logging and monitoring to quickly identify and troubleshoot issues within Kubernetes clusters.
  • Provide on-call support and contribute to incident response strategies, ensuring minimal downtime and fast recovery of services.
  • System Upgrades and Patching
  • Manage platform updates, patches, and upgrades to ensure systems remain secure and up-to-date.
  • Plan and execute Kubernetes cluster upgrades and Rancher version updates to stay current with new features and security patches.
  • Ensure that containerized applications remain compatible and functional after updates.
  • Documentation and Knowledge Sharing
  • Maintain clear, comprehensive documentation of infrastructure configurations, deployment processes, and troubleshooting procedures.
  • Share knowledge of Rancher, Kubernetes, and cloud infrastructure best practices with team members to improve platform operations and efficiency.
  • Capacity Planning and Scaling
  • Monitor resource usage and plan for capacity scaling to meet changing business and application demands.
  • Implement scaling strategies for Kubernetes clusters in Rancher, including auto-scaling of pods, nodes, and applications to accommodate varying workloads.
  • Cost Management and Optimization
  • Track and analyze cloud resource usage and costs to ensure efficient resource allocation.
  • Optimize cloud spending by implementing best practices like reserved instances, spot instances, and resource rightsizing.
  • Use Rancher to monitor the resource consumption of containerized applications and optimize the deployment of Kubernetes clusters to reduce infrastructure costs.
  • Disaster Recovery and Backup Planning
  • Implement disaster recovery strategies and data backup solutions to minimize downtime and data loss.
  • Regularly test backup systems and recovery procedures to ensure reliability in case of failure, including implementing backup solutions for Kubernetes environments managed through Rancher.

Essentia l skills

  • Several years (typically 3-5 years) of experience in a related field (e.g., systems engineering, DevOps, infrastructure engineering).
  • Bachelor's degree in Computer Science or related field, and or certifications such as Microsoft Certified : Azure Solutions Architect Expert, AWS Certified Solutions Architect, Certified Kubernetes Administrator (CKA), or Red Hat Certified Engineer (RHCE).
  • Strong verbal and written communication skills, with the ability to convey complex ideas clearly and effectively
  • Experience working collaboratively in cross-functional teams, with a focus on achieving shared goals
  • Expertise in managing multiple projects simultaneously, with a track record of delivering on time and within scope
  • Exceptional attention to detail, ensuring high standards of quality in all outputs
  • Ability to adapt quickly to changing environments and priorities, maintaining effectiveness in dynamic situations
  • Skills in designing highly available and fault-tolerant systems, ensuring platforms are resilient under various conditions.
  • Proven working experience with tools like Prometheus, Grafana, Datadog, New Relic, or ELK stack to monitor the health of infrastructure, applications, and services.
  • Excellent skills in identifying, diagnosing, and resolving infrastructure issues quickly, especially when systems fail or behave unexpectedly.
  • Knowledge of securing infrastructure and applications, including role-based access control (RBAC), encryption, and network security.
  • A solid understanding of Git for source code management, collaboration, and version control, is essential
  • A strong understanding of container orchestration with Kubernetes, particularly in deploying, managing, and scaling containerized applications across multi-cluster environments.
  • Proficiency in configuring and maintaining Rancher for cluster management, along with expertise in implementing security policies, monitoring, and logging within Kubernetes clusters, is essential for optimizing containerized workloads and ensuring high availability, security, and performance.
  • A deep understanding of cloud infrastructure management, such as provisioning and configuring virtual machines, networking, storage solutions, and implementing security best practices like Azure Active Directory and network security groups.
  • Additionally, proficiency in automation and CI / CD pipelines using Azure DevOps, along with expertise in Azure monitoring tools like Azure Monitor and Application Insights, is crucial for ensuring high availability, security, cost optimization, and efficient deployment of applications in the cloud.
  • Strong focus on automating manual processes and optimizing workflows for more efficient system management
  • Experience with designing distributed systems, microservices, and understanding the trade-offs between performance, consistency, and scalability.
  • Please call us on

    Create a job alert for this search

    Platform Engineer • Cape Town, South Africa

    Related jobs
    • Promoted
    Senior Platform Engineer

    Senior Platform Engineer

    DigiOutsourceCape Town, Western Cape, South Africa
    Kick-start your career in the online gaming world and experience the very latest in technology and innovation.We’re part of Super Group, the NYSE-listed digital gaming company behind some of the wo...Show moreLast updated: 30+ days ago
    • Promoted
    Platform Engineer

    Platform Engineer

    Electrum PaymentsCape Town, Western Cape, South Africa
    Are you passionate about building the foundational infrastructure that powers modern software development? Electrum is seeking a. In this role, you\'ll enable engineering teams by automating infrast...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Data Platform Engineer

    Senior Data Platform Engineer

    MoonPayWorkFromHome, Western Cape, South Africa
    We’re here to onboard the world to the decentralized economy.MoonPay is building the infrastructure that powers a new financial system. We make it easy for anyone, anywhere, to buy, sell, and trade ...Show moreLast updated: 1 day ago
    • Promoted
    Senior Cloud Engineer

    Senior Cloud Engineer

    BETSoftwareCape Town, Western Cape, South Africa
    Get AI-powered advice on this job and more exclusive features.Work closely with our customers, to understand, capture, and deliver against their requirements. Design and build distributed systems.Ab...Show moreLast updated: 29 days ago
    • Promoted
    Senior Back-End Engineer

    Senior Back-End Engineer

    Weaver FintechWorkFromHome, Wes-Kaap, South Africa
    Get AI-powered advice on this job and more exclusive features.Direct message the job poster from Weaver Fintech.The absolute legend of an Engineer we're seeking will be responsible for working with...Show moreLast updated: 1 day ago
    • Promoted
    Cloud Engineer – Become Senior Cloud Engineer while leading the implementation of enterprise so[...]

    Cloud Engineer – Become Senior Cloud Engineer while leading the implementation of enterprise so[...]

    Acuity ConsultantsCape Town, Western Cape, South Africa
    This is an excellent opportunity to become a Senior Cloud Engineer while leading the implementation of enterprise solutions for the leading organizations throughout South Africa.Based in CAPE TOWN,...Show moreLast updated: 11 days ago
    • Promoted
    Senior Platform Engineer

    Senior Platform Engineer

    The Hiring HouseCape Town, South Africa
    Infrastructure Design and Management.Design, deploy, and maintain cloud infrastructure using platforms like Microsoft Azure, AWS, or Google Cloud. Ensure high availability, scalability, and fault to...Show moreLast updated: 1 day ago
    Platform Engineer

    Platform Engineer

    Electrum SoftwareCape Town, Western Cape, ZA
    Quick Apply
    Electrum is a next-generation payment software technology company.Since 2012, we've delivered trusted, enterprise-grade, cloud-native software to optimise financial transaction processing.Our deep ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Platform Engineer

    Senior Platform Engineer

    TilloCape Town, Western Cape, South Africa
    A Senior Platform Engineer with AWS & Kubernetes experience, and a strong curiosity to learn new tools.You will make significant contributions to the Tillo platform - designing and building infrast...Show moreLast updated: 3 days ago
    Senior Infrastructure Engineer

    Senior Infrastructure Engineer

    Prisma Data, Inc.Cape Town, Other, South Africa, 7100
    At Prisma, we're redefining how developers work with databases.If you're fascinated by the cutting-edge data infrastructure powering companies like Twitter, Airbnb, and Facebook, but want the agili...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Lead AI Platform Engineer

    Lead AI Platform Engineer

    Sabio GroupWorkFromHome, Western Cape, South Africa
    At Sabio Group, we're dedicated to fostering an environment where employees thrive.Since 1998, we've built a dynamic culture that is both challenging and fun, driven by a team of ambitious, knowled...Show moreLast updated: 13 hours ago
    • Promoted
    Platform Engineer

    Platform Engineer

    ElectrumCape Town, Western Cape, South Africa
    Electrum is the next-generation payments technology company that provides cloud-native software to optimize the processing of financial transactions. Since 2012, we have established ourselves as a r...Show moreLast updated: 30+ days ago
    • Promoted
    Platform Engineer

    Platform Engineer

    Eqplus Pty LtdCape Town, Western Cape, South Africa
    An opportunity exists for a Platform Engineer to contribute to the development, integration, and operation of shared platform services supporting large-scale scientific computing and complex softwa...Show moreLast updated: 1 day ago
    • Promoted
    • New!
    Senior Cloud Engineer / Architect

    Senior Cloud Engineer / Architect

    George Consulting LtdWorkFromHome, Western Cape, South Africa
    George Consulting iscommitted to creating a diverse environment and is an equalopportunity employer.George Consulting is seeking an experienced Senior Cloud Engineer / Architect to lead the design, d...Show moreLast updated: 13 hours ago
    • Promoted
    Platform Engineer / Devops - CPT

    Platform Engineer / Devops - CPT

    DataFinCape Town, Western Cape, South Africa
    Join a team of scientists, engineers, and computer scientists working on the world’s largest and most advanced radio telescope project, as they seek a Platform Engineer to contribute to the develop...Show moreLast updated: 30+ days ago
    • Promoted
    Lead Platform Engineer

    Lead Platform Engineer

    Absa GroupWorkFromHome, Western Cape, South Africa
    Empowering Africa’s tomorrow, together…one story at a time.With over 100 years of rich history and strongly positioned as a local bank with regional and international expertise, a career with our f...Show moreLast updated: 1 day ago
    • Promoted
    Senior DevOps Engineer

    Senior DevOps Engineer

    Plus1X Solutions (Pty) LtdCape Town, Western Cape, South Africa
    We are seeking an experienced Senior DevOps Engineer to drive customer growth, enhance engagement, and improve our website experience. You will play a pivotal role in ensuring the resilience, securi...Show moreLast updated: 30+ days ago
    • Promoted
    SKA Mid - Platform Engineer

    SKA Mid - Platform Engineer

    The Hiring HouseCape Town, South Africa
    Contribute to the development and improvement of platform services supporting engineering and operational teams.Support integration of platform services with application and infrastructure systems....Show moreLast updated: 30+ days ago