Responsibilities
As a Site Reliability Engineer, you will be responsible for:
● Manage and run backend systems like Kubernetes, MySQL and everything in between.
● Drive reliability, availability and efficiency improvements to Rubrik's Polaris Cloud Platform
● Good mix of software and system engineering skills
● Participate on-call rotations across continents, using a follow-the-sun model
● Write and review code, plan and execute upgrades, develop documentation and capacity plans, and debug production issues
● Work cross-functionally with various engineering teams
● Build monitoring tools and automation to increase efficiency of all teams
● Drive blameless postmortems and operations reviews for core systems and services
● Good written and verbal communication skillsExperience You'll Need:● BS/MS in Computer Science or equivalent
● Experience in one or more of the following: Golang, Python, Java, Scala, C++
● Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive
● Expertise in designing, analyzing and troubleshooting large-scale distributed systems
● Ability to debug and optimize code and automate routine tasks
● Strong operational experience with Unix/Linux operating systems and networking
● Experience with Google Cloud Platform or other public cloud technologies
● Minimum 3-5 years of experience as a Development, DevOps or Site Reliability Engineer
● Willing to provide 24/7 coverage
● Strong Documentation skills
● Experience working with multiple departments and divisions within an organization
● Strong understanding of Databases is a definite plus
● Experience leading support personnel
Any question or remark? just write us a message
If you would like to discuss anything related to payment, account, licensing,
partnerships, or have pre-sales questions, you’re at the right place.