Design and implement automated solutions for deployment, monitoring, and incident response, using Infrastructure as Code principles and modern DevOps practices.
Analyze system performance, identify bottlenecks, and implement improvements to enhance reliability and reduce operational overhead.
Develop and maintain service level objectives and reliability metrics while ensuring system availability and performance meet business requirements.
Lead incident response efforts, conduct post-mortem analyses, and implement preventive measures to avoid future incidents.
Collaborate with development teams to build observability into applications and services, including logging, metrics, and distributed tracing.
Primary Tech Stack:
Expertise in Site Upkeep and monitoring must have skill (It includes writing and monitoring SFCC Jobs and SFCC configurations )
Expertise in CI/CD pipeline design, implementation, and optimization (Jenkins, GitLab CI, CircleCI, etc.).
Solid scripting and automation skills (Python, Bash, Groovy).
Strong experience with configuration management tools (Ansible, Puppet, Chef).*
Experience with containerization (Docker)
Experience with orchestration (Kubernetes).*
Hands-on coding experience with GIT/Bitbucket commands.
Infrastructure as Code knowledge (Terraform, CloudFormation).*
Monitoring and alerting tool expertise (Prometheus, Grafana, Datadog).
Security integration in DevOps processes (DevSecOps).
Familiarity with AWS -(WAF , CloudFront, s3, EC2, lambda, VPC).
Must have done some automation (preferably Load/performance testing and Slackbot automation).
(* Good to have)
Key Qualifications:
Bachelor’s degree in Engineering (Computer Science or Information Science preferred).
Strong academic record, excellent written and verbal communication skills.
Strong Debugging and troubleshooting skills.
Must be open to writing code and exploring new tools and technologies(Java/Python/JS etc).
5+ years automating infrastructure and deployments.
Building scalable, repeatable, and maintainable pipelines.
Collaboration with development, QA, and operations to streamline releases.
Passion for automation and continuous improvement.
Adaptability to evolving tech and business needs.
Ownership mindset and proactive problem solving.
Experience with Agile and DevOps culture.
Good to have retail/e-commerce domain knowledge.
Strong communication skills to coordinate across teams.
Ability to advocate and implement best practices in CI/CD and automation.
Leadership or mentoring experience within DevOps teams.