Overview
Our client, a global BPO business is looking for Site Reliability Engineers to support a global payment technology company that provides platforms to consumers, businesses and organizations to make electronic payments. The successful candidate will be responsible for ensuring site reliability & performance, monitoring & alerting, and supporting emergency response situations. The ideal candidate creates a bridge between development and operations by applying a software engineering mindset to service management.
The Role
- Responsible for pipeline build and maintenance in accordance with the client’s tooling and conventions.
- Participate in the software development lifecycle, working closely with the development team to ensure that designed solutions meet non‑functional requirements such as availability, performance, security and maintainability standards.
- Maintain services through monitoring of metrics, system health, and analysis of reports.
- Provide support for production and in‑house systems. Participate in on‑call production support rota.
- Incident management, on‑call support and root‑cause analysis conducting post‑incident reviews and 5‑Whys.
- Remediate system vulnerability, security and resiliency measures.
- Improve process and systems within the program.
- Lead incident management efforts by proactively monitoring and analyzing ISO 8583 financial transaction messages across the four‑party payment model (Cardholder, Merchant, Acquirer, Issuer).
Skills & Requirements
Card payment domain knowledge (mandatory).Experience with CI / CD and build pipelines using Jenkins.Experience in public and private cloud offerings (PCF, Azure, AWS, etc.).Knowledge of NoSQL & SQL databases such as Mongo / Oracle.Experience managing distributed systems and working with microservices.Familiarity with Unix tooling and strong scripting skills.Exposure to monitoring and alerting tools such as Splunk, Dynatrace.Proficiency in one of the following : Python, Java, Go or equivalent.Familiarity defining SLOs and SLAs.Prior experience working in an SRE / DevOps team and excellent understanding of SRE / DevOps principles.High degree of initiative and self‑motivation, with a willingness to take on challenging opportunities.Excellent communication and relationship building / collaboration skills.#J-18808-Ljbffr