Post Job Free
Sign in

Senior Reliability Engineer

Company:
The Hartford
Location:
Clinton Township, OH, 43224
Posted:
February 17, 2026
Apply

Description:

Senior Reliability Engineer - IE07KE

Are you passionate about ensuring the reliability and performance of critical systems? Join our team and make a meaningful difference at an innovative insurance company that goes beyond just coverage and policies. We offer an environment where you can achieve your professional goals while helping others succeed.

Position Overview:

The Senior Reliability Engineer will play an essential role in enhancing the stability, performance, and scalability of our services. This key position is responsible for applying best practices in reliability engineering, fostering continuous improvement, and mentoring team members. We are looking for candidates with strong technical expertise, exceptional problem-solving skills, and a commitment to building resilient infrastructure.

Key Responsibilities

Lead the design, implementation, and optimization of robust systems and infrastructure.

Collaborate effectively with software engineering, operations, and product teams to meet uptime and availability goals.

Develop and maintain effective monitoring, alerting, and incident response strategies to swiftly identify and resolve issues.

Conduct thorough root cause analyses of system failures and implement corrective actions to prevent recurrence.

Promote reliability best practices and create a culture of proactive risk management throughout the organization.

Mentor and guide fellow reliability engineers and cross-functional team members.

Create automation tools to streamline deployment, monitoring, and recovery processes.

Engage in capacity planning, performance testing, and disaster recovery drills.

Stay updated on industry trends, emerging technologies, and reliability engineering best practices.

Qualifications

5+ years of experience in reliability engineering, site reliability engineering (SRE), or similar roles.

Expertise with cloud platforms (e.g., AWS, Azure, Google Cloud) and container orchestration (e.g., Kubernetes).

Strong programming skills in one or more languages such as Python or Java.

Demonstrated experience with logging and monitoring tools (e.g., Splunk, Dynatrace, Datadog) and incident management frameworks (e.g., ServiceNow).

Excellent analytical, troubleshooting, and communication abilities.

Proven capability to lead complex projects and effectively influence stakeholders at all levels.

Preferred Skills

Experience with infrastructure as code (e.g., Terraform, CloudFormation).

Understanding of security best practices and compliance requirements.

Familiarity with high-availability architectures and distributed systems.

Certifications in cloud or reliability engineering domains are a plus.

Work Environment

This position may involve participation in an on-call rotation and occasional after-hours support for critical incidents. We provide a dynamic, collaborative environment where innovation and reliability are highly valued.

This role will follow a hybrid work schedule, with an expectation to work in an office (Columbus, OH, Chicago, IL, Hartford, CT, or Charlotte, NC) at least 3 days a week (Tuesday through Thursday).

Applicants must be authorized to work in the US without company sponsorship. The company will not support the STEM OPT I-983 Training Plan endorsement for this position.

Compensation

The annualized base pay range for this role is $127,600 - $191,400, based on an analysis of comparable positions in the market. Actual base pay may vary based on performance, skills, and competencies. This range is just a part of The Hartford's total compensation package. Additional rewards may include bonuses, long-term incentives, and on-the-spot recognition.

We are an Equal Opportunity Employer and value diversity in our workforce. We encourage applicants of all backgrounds to apply.

Every day presents an opportunity to make a difference. Join us in shaping the future.

Apply