Staff Site Reliability Engineer

Company:

Protoporos Staffing Services Private Limited

Location:

Bengaluru, Karnataka, India

Posted:

May 15, 2024

Apply

Description:

Opportunity with a leading B2B SaaS product client specializing in cutting-edge data integration solutions

Position Overview: We are seeking a highly skilled and experienced Staff Site Reliability Engineer to join our team. As a Staff SRE, you will play a critical role in ensuring the reliability, scalability, and performance of our data integration products. If you have a proven track record of building and maintaining highly available systems, along with a passion for creating a robust and efficient infrastructure, we want to hear from you

Roles & Responsibilities

Design,develop, implement, and maintain highly scalable and efficient components, systems, and reliable infrastructure

Lead efforts to improve the end-to-end availability, reliability, efficiency and performance of mission-critical services and systems.

Define and measure service level indicators (SLIs), service level agreements (SLAs), and service level objectives (SLOs).

Collaborate with cross-functional teams to define and implement SRE best practices.

Participate in incident response, troubleshooting, and resolution of production issues.

Mentor and guide other engineers in adopting SRE principles and best practices.

Contribute to the continuous improvement of our technology stack and deployment processes.

Requirements:

Bachelor's degree in Computer Science, Software Engineering, or a related field (or equivalent experience).

9+ years experience with at least 6 years in a SRE role building, designing and implementing scalable and reliable platform architectures

Strong programming skills in languages such as Java, Python, Ruby Go or similar

Good understanding of cloud platforms (e.g., AWS, Azure, GCP) and containerization & orchestration technologies.

Experience with infrastructure as code (Terraform / Ansible),automation tools & monitoring tools.

Thorough understanding of cloud service delivery (DevOps) infrastructure ecosystem, operational processes, and orchestration models.

Excellent skills in investigating and troubleshooting complicated systems/platforms, and identifying key points of failure.

Apply

Staff Site Reliability Engineer

Description:

Report this job