Post Job Free
Sign in

Site Reliability Engineering

Company:
Forhyre
Location:
Plano, TX
Posted:
May 06, 2024
Apply

Description:

Job Description

Forhyre is looking for engineers who can bring unique perspectives and innovative ideas to all areas of development and are interested in continuing to improve our platform through the ever-changing technology landscape.

To be successful in this role

You'll have the opportunity to design and implement major infrastructure components, systems, and developer-friendly capabilities to improve the availability, scalability, latency, and efficiency of our services

You will provide technical leadership to cross-functional engineering, infrastructure, and product teams, and evangelize cloud best practices while building a culture of reliability and observability

Engage in and improve the end to end lifecycle of software development--from inception and design, through deployment, operation and refinement of a highly distributed system running in public cloud

Serve as subject matter expert in an SRE mindset, best practices, and cloud-native principles

Scale systems sustainably through automation to improve reliability and velocity

Assist with all aspects of operational security and compliance

Run software performance analysis and system tuning

Design and implement tools to collect data from various sources and provide actionable insights

Participate in critical incident management and timely post-mortems of production incidents to drive practices around blameless analysis, resolution, and continuous improvement work with cross-functional teams Develop the rest of the team by conducting code reviews, providing mentorship, pairing, and training opportunities

Qualification & Skills

We are looking for Principal SRE with proven experience in running distributed systems at scale, in production

You have 15+ years of experience in relevant skills gained and developed in the same or similar role

Strong knowledge of container orchestration, preferably Kubernetes and networking technology

Hands-on experience in one or more languages, such as Node JS, Python, Go, Perl, Ruby, and Bash

Experience with SOA, Microservices architecture, API Management & Enterprise system Integrations

Strong production experience with cloud infrastructure, AWS, Azure & Google Cloud

Strong sense of ownership, and an ability to drive tasks to completion

Experience developing and monitoring distributed systems

Experience working in an Agile Environment with great collaboration skills

Apply