Have a bachelor’s degree in Computer Science discipline and at least 1 AWS Certification is a must. (AWS Sysops Admin preferred)
2 plus years of experience as an SRE engineer or DevOps engineer on AWS Cloud platform.
Ability to work with minimal guidance or no supervision from Senior Site Reliability Engineers.
Demonstrate high-proficiency in automation, system monitoring, and cloud-native applications.
Ability to code in one of the programming languages (Java, C#, Python, JavaScript, etc.)
Ensure highest level of uptime, up to date Server patching and SSL certificates and implement system wide corrections to prevent reoccurrence of issues.
Triage, troubleshoot and resolve issues using golden signals and go past golden signals (Chaos Engineering/Gameday etc.,)
Create and develop documentations on Application services, infrastructure details, Recovery Procedures, Root cause analysis.
Proactive monitoring of the availability, latency, scalability and efficiency of all services
Perform periodic on-call duty as part of the SRE team on a rotational basis.
Experience with SQL, Windows Servers, Load balancers, Linux and AWS services such as AWS, Docker and Kubernetes, CloudFormation, CloudWatch, CodeDeploy, DynamoDB, Lambda, SQS, Amazon FSX, Elastic Search and networking concepts are must.
Working knowledge of infrastructure components (e.g. routers, load balancers, cloud products, container systems, compute, storage, and networks)
Experience with tools such as Jenkins, Ansible, Github, PagerDuty, ServiceNow, Datadog, CloudWatch are required.