Reliability Engineer – Pragma
Job Description
At Pragma, we provide the opportunity for individuals to enjoy their working lives as much as their home lives. We foster a team environment in which each individual is recognised, valued and developed to support our company strategy. We encourage people with disabilities and from diverse backgrounds to apply.
You will work within a wide range of industries in the Private and / or Public sectors, covering various aspects of asset management. You will be either dedicated to a single client based at the client's site or work from the Pragma head office, servicing multiple clients. You will will be exposed to industry best practices for asset management and reliability engineering, and have access to extensive training material, to help you be the best you can be as a specialist. If you seek a job where you can use your logical and analytical engineering mind to implement creative solutions to optimise your client's asset management system, this role is meant for you.
Minimum Requirements
- A tertiary qualification in a relevant field (Industrial / Mechanical / Chemical / Electrical).
- A minimum of 3 years of practical experience as a reliability engineer.
- A minimum of 3 years of relevant practical experience in the Gas & Oil industry.
- Advanced knowledge of asset management and reliability engineering
- Advanced MS Excel application and knowledge
- Exposure and understanding of the Project Management methodology (whether applied or completed in a formal course)
- Advantageous : CMRP, SCPP
Duties & Responsibilities
Identify opportunities for improving your client's asset management system and utilise a structured problem-solving methodology to implement solutions.Present asset management training and develop training material and course content.Identify required reports to measure and improve business processes.Deliver ad-hoc client support, training and projects.Site Reliability Engineer – Cartrack
Rosebank, Gauteng R - R Y Cartrack
Posted today
Job Description
We\'re a world-leading smart mobility SaaS tech company with over 2,000,000 active users. Our teams are collaborative, vibrant and fast-growing, and all team members are empowered with the freedom to influence our products and technology.
Are you curious, innovative and passionate? Do you take ownership, embrace challenges, and love problem-solving?
We\'re looking for a Site Reliability Engineer (SRE) who will enable us to build industry disruptive tech products and revolutionize the way our customers use technology.
The Site Reliability Engineer (SRE) will be responsible for ensuring the reliability, performance, and scalability of Cartrack\' Linux-based systems and services. This role combines software engineering with operations, focusing on automation, monitoring, and incident response. The position requires working in shifts and rotations to support 24 / 7 operations.
You want to
Maintain and improve the reliability, scalability, and performance of Cartrack\' infrastructure and applications.Implement automation for deployments, monitoring, and system management.Troubleshoot production issues, perform root cause analysis, and implement permanent fixes.Develop and manage monitoring, alerting, and incident response processes.Work with development teams to design resilient and scalable systems.Participate in on-call shifts and rotation schedules to manage incidents and ensure uptime.Optimize system efficiency and cost-effectiveness in an open-source environment.You have
Strong background in Linux / Unix system administration (open-source stack).Familiarity with monitoring and logging tools (Prometheus, Grafana etc.).Knowledge of networking, load balancing, and system security best practices.Strong problem-solving and debugging skills in a production environment.Proven experience in automation and scripting (Python, Bash, Go, or similar).Ability to design and maintain automation frameworks for deployments, monitoring, and system recovery.Hands-on experience with CI / CD pipelines and configuration management tools (e.g., GitLab CI, Ansible, Puppet, Terraform).Experience building self-healing and auto-remediation solutions for production environments.Nice to Have
Experience with containerization and orchestration (Docker, Kubernetes).Exposure to microservices and service mesh environments.Knowledge of database reliability and performance tuning (PostgreSQL).Qualifications
Bachelor\'s degree in Computer Science, Information Systems, or equivalent practical experience.3+ years of experience in SRE, DevOps, or related infrastructure / operations roles.Ability to work flexible hours, including shift rotations and on-call duties.Job Type : Full-time
Experience / Location :
Rosebank, Gauteng : Rel iably commute or planning to relocate before starting work (Preferred)Is this job a match or a miss?
Manager - Customer Reliability Engineer. Software Solutions – MTN
Roodepoort, Gauteng R - R Y MTN
Posted today
Job Description
Achieve measurable improvements in system uptime and performance by implementing robust reliability engineering practices and leading incident prevention initiatives.Reduce Mean Time to Detect (MTTD) and Mean Time to Resolve (MTTR) through streamlined incident response protocols and team readiness, ensuring minimal disruption to customers.Build, lead, and develop a skilled team of Customer Reliability Engineers with a strong focus on ownership, collaboration, and continuous learning.Ensure that reliability is embedded into service design, development, deployment, and operations by partnering with engineering, product, and operations teams.Deliver clear and actionable reporting on reliability metrics to support leadership decision-making and continuous improvement.Align reliability goals with customer expectations by addressing root causes of service degradation and championing seamless user experiences.Is this job a match or a miss?
Be The First To Know
About the latest Tech operations reliability lead Jobsin Sandton !
#J-18808-Ljbffr