As a Senior Site Reliability Engineer, you'll be responsible for building and supporting Cloud infrastructure automation solutions that support OFSE Digital Cloud strategy. You will also be developing improving, deploying, and support Cloud services.
As a Senior Site Reliability Engineer, you will be responsible for:
- Demonstrating best practices pertaining to Cloud DevOps development along with a willingness to continually learn Cloud native technologies.
- Following security guidelines to develop secure and compliant Cloud services by working with Risk and Security teams.
- Monitoring configuration management, platform layout, and hosting infrastructure.
- Automating deployment of applications and infrastructure
- Be able to work independently and in a team environment managing a range of customers and technical situations.
- Providing technical application support for enterprise-level systems
- Running our infrastructure with Chef, Ansible, Terraform, Github CI/CD, and Kubernetes
- Participating in Capacity planning, system performance monitoring, resource utilization trending and incident and change management.
- Co-ordinating with Cloud infrastructure partners for Server, Network, Database, service-related incidents, and projects
- Deploying application upgrades/patches in production and test environments
- Troubleshooting application alerts, Azure and AWS Policy from monitoring tools and code inspection and performing RCAs
- Writing tutorials, how-to videos, and other technical articles for the customer community and knowledgebase articles and keep them up to date
- Working on critical, complex customer problems that may span multiple services
- Participating in 24x7 on-call rotation and working with global teams
- Collaborating with cross functional stakeholders
- Providing mentorship and guidance to team members
- Ensuring security best practices are integrated into the development lifecycle, including compliance with data protection regulations.
- Collaborating with stakeholders to understand requirements, set priorities, and communicate progress and challenges.
Official notification