Senior Site Reliability Engineer I (NM+)
coe | 2 days ago | Bangalore,

  • Is responsible to build software applications by using relevant development languages and applying knowledge of systems, services and tools appropriate for the business area and guide more junior members of the team in this topic.

  • Is responsible to refactor and simplify code by introducing design patterns when necessary and guide more junior members of the team in this topic.

  • Is responsible to ensure the quality of the application by following standard testing techniques and methods that adhere to the test strategy

  • Is responsible to write readable and reusable code by applying standard patterns and using standard libraries

  • Is responsible to maintain data security, integrity and quality by effectively following company standards and best practices

Software Systems Design

  • Is responsible to evaluate possible architecture solutions by taking into account cost, business requirements, technology requirements and emerging technologies

  • Is responsible to describe the implications of changing an existing system or adding a new system to a specific area, by having a broad, high-level understanding of the infrastructure and architecture of our systems

  • Is responsible to help grow the business and/or accelerate software development by applying engineering techniques (e.g. prototyping, spiking and vendor evaluation) and standards

  • Is responsible to meet business needs by designing solutions that meet current requirements and are adaptable for future enhancements

End to End System Ownership

  • Is responsible to reduce business continuity risks and bus factor by applying state-of-the-art practices and tools, and writing the appropriate documentation such as runbooks and OpDocs

  • Is responsible to reduce risk and obtain customer feedback by using continuous delivery and experimentation frameworks

  • Is responsible to independently manage an application or service by working through deployment and operations in production and guide more junior members of the team in this topic.

  • Is responsible to maintain data security, integrity and quality by effectively following company standards and best practises

Technical Incident Management

  • Is responsible to address and resolve live production issues by mitigating the customer impact within SLA

  • Is responsible to improve the overall reliability of systems by producing long term solutions through root cause analysis

  • Is responsible to keep track of incidents by contributing to postmortem processes and logging live issues

Automation and toil reduction

  • Is responsible to ensure that infrastructure stays current by reducing technical debt, searching for bottlenecks and preparing for scaling

  • Is responsible to reduce cost of operations and maintenance by leveraging new technologies, automation, and partner with vendors to ensure we stay current

  • Is responsible to reduce human labour by writing small software features that address availability, scalability, latency and efficiency

Monitoring and Alerting improvements

  • Is responsible to review and verify performance of production systems and network infrastructure by continuously monitoring appropriate observability metrics, business KPIs and capacity planning

  • Is responsible to improve application reliability by partnering with development teams to advise on setting appropriate observability metrics

Critical Thinking

  • Is responsible to systematically identify patterns and underlying issues in complex situations, and to find solutions by applying logical and analytical thinking.

  • Is responsible to constructively evaluate and develop ideas, plans and solutions by reviewing them, objectively taking into account external knowledge, initiating 'SMART' improvements and articulating their rationale.

Continuous Quality and Process Improvement

  • Is responsible to identify opportunities for process, system and structural improvements (i.e performance gains) by examining and evaluating current process flows, methods and standards.

  • Is responsible to design and implement relevant improvements by defining adapted/new process flows, standards, and practices that enable business performance.

Effective Communication

  • Has sufficient knowledge to deliver clear, well-struct Official notification

⚡ Hot Jobs Trending Now

SRE
Sr. SRE Engineer
Stripe | Bangalore, India
DEV
Backend Developer
Coinbase | Remote, India
Infra
Cloud Infra Lead
Datadog | Pune, India
ML
MLOps Architect
Anthropic | Hyderabad
Data
Fivetran Data Eng.
Fivetran | Mumbai
SRE
Sr. SRE Engineer
Stripe | Bangalore, India
DEV
Backend Developer
Coinbase | Remote, India
Infra
Cloud Infra Lead
Datadog | Pune, India
ML
MLOps Architect
Anthropic | Hyderabad
Data
Fivetran Data Eng.
Fivetran | Mumbai
SDE
Staff Software Eng.
Airbnb | Gurgaon, India
Prod
Platform Engineer
Databricks | Bangalore
QA
Quality Assurance
GitLab | Remote
Security
Cloud Security
Zscaler | Mumbai
UX
Product Designer
Figma | Pune, India
SDE
Staff Software Eng.
Airbnb | Gurgaon, India
Prod
Platform Engineer
Databricks | Bangalore
QA
Quality Assurance
GitLab | Remote
Security
Cloud Security
Zscaler | Mumbai
UX
Product Designer
Figma | Pune, India
Contact US

Let's work laptop charging together

Any question or remark? just write us a message

Send a message

If you would like to discuss anything related to payment, account, licensing,
partnerships, or have pre-sales questions, you’re at the right place.