What you’ll do
As this is a new team and growing topic, we expect you to be open minded, curious and willing to push boundaries to learn. The team will be responsible for testing/simulating disaster recovery scenarios.
- Work on Proof of Concepts
- Follow Design and implementation guidelines for HA&DR
- Implement and enhance tools and reusable services for asynchronous data replication between regions
Key Responsibilities:
- Collaborate with cross-functional teams to understand requirements and deliver scalable solutions.
- Test/simulate disaster recovery scenarios
- Create comprehensive guidance and best practices for high availability and disaster recovery strategies.
- Enable and Support Service teams to implement high availability and disaster recovery
- Build Test frameworks and Simulate failovers to validate HA&DR
EDUCATION AND QUALIFICATIONS/ SKILLS AND COMPETENCIES
- Knowledgeable in Dev-ops toolset and related technologies
- Sound Knowledge and expertise in Linux shell scripting and Containerization technologies ( Docker, Kubernetes )
- Experience with Cloud Infrastructure Provider (AWS, Azure, GCP) will be added advantage
- Experience with writing Terraform scripts or other Infra as Code solution
- Experience with CI/CD systems and automation
- Experience with Secure Development practices (implementation and infrastructure level)
- Experience in Chaos Engineering frameworks and tools to simulate failover scenarios will be preferred
General
- BE / B.Tech in Computer Science, Engineering, or a related field.
- Creative thinking, with a willingness and ability to quickly learn new concepts and technologies.
- Result-oriented and self-organized work style.
- Team player with strong collaboration skills.
WORK EXPERIENCE
7 - 13 Years’ experience in above mentioned competencies.
Official notification