DevOps Strategy & Implementation: Drive the adoption and maturity of DevOps practices across development and operations teams, fostering a culture of shared responsibility, automation, and continuous feedback.
Infrastructure Management & Automation: Design, implement, and maintain scalable, reliable, and secure infrastructure solutions. This includes extensive experience with container orchestration platforms such as OpenShift and/or AWS ECS.
CI/CD Pipeline Development & Optimization: Architect, build, and maintain robust Continuous Integration/Continuous Delivery (CI/CD) pipelines. Automate build, test, and deployment processes to accelerate software releases and improve efficiency.
Release Management & Orchestration: Implement and manage advanced release strategies, potentially leveraging tools like Lightspeed and "Release-on-Demand" (ROD) concepts to ensure smooth, predictable, and frequent deployments.
Performance Monitoring & Optimization: Establish and maintain comprehensive monitoring solutions for infrastructure and applications. Proactively identify performance bottlenecks and implement solutions to improve system reliability and efficiency.
DORA Metrics & Reporting: Define, track, and report on key DORA (DevOps Research and Assessment) metrics (e.g., deployment frequency, lead time for changes, mean time to recovery, change failure rate) to measure and improve our software delivery performance.
Issue Resolution & Problem Management: Take ownership of critical infrastructure and deployment-related issues (including CVM - Common Vulnerability Management findings), leading investigation, root cause analysis, and remediation efforts to prevent recurrence.
Security & Compliance: Implement and enforce security best practices within CI/CD pipelines and infrastructure, ensuring compliance with relevant standards and policies.
Collaboration & Mentorship: Work closely with development, QA, and operations teams to understand their needs, provide technical guidance, and champion best practices. Mentor junior engineers on DevOps principles and tools.
Tooling & Technology Evaluation: Research, evaluate, and recommend new tools and technologies to enhance our engineering excellence capabilities.
Required Skills & Experience:
Minimum 5+ years of hands-on professional experience in DevOps, Site Reliability Engineering (SRE), or Infrastructure Engineering roles.
Cloud & Container Orchestration Expertise:
Strong practical experience with containerization technologies (Docker) and orchestration platforms like OpenShift (Red Hat OpenShift Container Platform) and/or AWS ECS (Elastic Container Service).
Familiarity with other cloud platforms (AWS, Azure, GCP) and their relevant compute, networking, and storage services.