System Reliability: Ensure the reliability and uptime of critical services and infrastructure.
Google Cloud Expertise: Design, implement, and manage cloud infrastructure using Google Cloud services.
Automation: Develop and maintain automation scripts and tools to improve system efficiency and reduce manual intervention.
Monitoring and Incident Response: Implement monitoring solutions and respond to incidents to minimize downtime and ensure quick recovery.
Collaboration: Work closely with development and operations teams to improve system reliability and performance.
Capacity Planning: Conduct capacity planning and performance tuning to ensure systems can handle future growth.
Documentation: Create and maintain comprehensive documentation for system configurations, processes, and procedures.
Qualifications:
Education: Bachelor’s degree in computer science, Engineering, or a related field.
Experience: 4+ years of experience in site reliability engineering or a similar role.
Official notificationAny question or remark? just write us a message
If you would like to discuss anything related to payment, account, licensing,
partnerships, or have pre-sales questions, you’re at the right place.