Key Responsibilities:
Qualifications:
Contribute to the delivery of reusable automation solutions and reliability frameworks to support LoB teams and reduce operational burden.
Support implementation of failover workflows, swing tests, and recovery automation for business-critical applications.
Work with SREs across Services to identify automation opportunities, share reusable tooling, and improve adoption of production engineering standards.
Maintain and contribute to the central production engineering backlog using Jira, helping prioritize and organize workstreams aligned to Services Technology needs.
Support the development of utilities, scripts, and templates to improve production support effectiveness and reduce manual intervention.
Engage in the Services Production Engineering Forum, collaborating with other engineers to share knowledge and resolve common challenges.
5-8 years of experience in SRE or Engineering roles with a strong focus on automation, resiliency, and tooling.
Proficiency in one or more scripting or automation languages (e.g., Python, Bash, Terraform, Ansible, YAML).
Strong understanding of CI/CD workflows, Agile delivery, and reliability integration into development pipelines.
Familiarity with capacity planning, telemetry, and using production data to improve service operations.
Experience supporting or developing tools across hybrid platforms (on-prem, cloud, and containers like ECS/Kubernetes).
Good communication & interpersonal skills
Any question or remark? just write us a message
If you would like to discuss anything related to payment, account, licensing,
partnerships, or have pre-sales questions, you’re at the right place.