Provide L2/L3 support for production Kubernetes clusters and AWS infrastructure
* Troubleshoot complex issues across networking, compute, storage, IAM, and containerized workloads
* Lead incident response, perform root cause analysis (RCA), and document post-mortems
* Monitor system health, respond to alerts, and ensure SLA/SLO compliance
* Manage and maintain Infrastructure as Code (Terraform)
* Develop automation and operational scripts using Bash, Python, or Go
* Support CI/CD pipelines and deployment workflows
* Collaborate with engineering teams to resolve systemic issues and improve reliability
* Participate in on-call rotation as needed
Official notification
Any question or remark? just write us a message
If you would like to discuss anything related to payment, account, licensing,
partnerships, or have pre-sales questions, you’re at the right place.