Job Description
About the Role
We are seeking a highly motivated and skilled Site Reliability Engineer (SRE) to join our infrastructure team. As an SRE, you will be responsible for the scalability, reliability, and performance of our cloud-based services. You will work closely with IT, engineering, and Security teams to design and maintain systems that are secure, observable, and cost-efficient, with a strong emphasis on automation and continuous improvement.
Requirements
5+ years of experience in Site Reliability Engineering, DevOps, or Cloud Infrastructure roles.
Strong proficiency with AWS (IAM, EC2, ECS/Fargate, S3, RDS, CloudFormation or Terraform).
Experience with Infrastructure as Code (Terraform preferred).
Experience with GitHub (workflow automation, PR workflows, secrets management).
Hands-on experience with log aggregation and observability tools like Sumo Logic (or equivalent: Datadog, ELK, etc.).
Familiarity with incident management practices and SRE principles (SLAs, SLOs, error budgets).
Experience with Kubernetes (EKS), Helm, and container orchestration.
Prior experience in fast-paced SaaS or startup environments.
Familiarity with compliance frameworks (SOC2, HIPAA, etc.).
Benefits
Health Care Plan (Medical, Dental & Vision)
Retirement Plan (401k, IRA)
Life Insurance (Basic, Voluntary & AD&D)
Paid Time Off (Vacation, Sick & Public Holidays)
Family Leave (Maternity, Paternity)
Short Term & Long Term Disability
Work From Home
Stock Option Plan
Full-time
Hybrid remote