Manage a small team of SREs supporting commercial SaaS platforms in the health and wellness space- 100% Remote
This Jobot Job is hosted by: Charles Simmons
Salary: $165,000 - $190,000 per year
A bit about us:
This mid sized SaaS organization powers health & wellness throughout the world. Every day their members focus their passion and expertise in helping health & wellness facilities operate efficiently and engage their members.
Whether a neighborhood yoga studio, a national franchise with locations in every city, a YMCA or JCC and every type of organization in between we build solutions that make every aspect of running and being a member of a health and wellness organization easier and delightful.
Why join us?
We truly care for our team members, and this is reflected through our offices, benefits, and great perks.
Flexible paid time off
Affordable health, dental, and vision insurance options
Monthly fitness reimbursement
401(k) matching
New-Parent Paid Leave
1-month paid sabbatical every 5 years
up to 100% telecommute or hybrid work in one of the offices
Job Details
We are seeking a dynamic and experienced Site Reliability Engineering Manager to join our team in the Technology industry. As the SRE Manager, you will be responsible for ensuring the reliability, availability, and scalability of our systems and infrastructure. You will work closely with cross-functional teams to design, implement, and maintain our infrastructure and applications. The successful candidate will have a strong background working in environments build on technologies like Linux, VMware, AWS, Azure, Docker, Kubernetes, Redis, RabbitMQ, monitoring, GitLab CI, Jenkins, Terraform, ElasticSearch, Rancher, Python, Bash, and Lambdas.
Responsibilities:
Lead a team of SREs to ensure the reliability, availability, and scalability of our systems and infrastructure
Design, implement, and maintain our infrastructure and applications
Develop and implement monitoring and alerting systems to ensure the health of our systems and infrastructure
Collaborate with cross-functional teams to optimize our systems and infrastructure
Manage incident response and resolution processes
Develop and maintain disaster recovery plans
Ensure compliance with security and regulatory requirements
Continuously improve our processes and infrastructure to increase efficiency and reduce downtime
Qualifications:
Bachelor's degree in Computer Science, Engineering, or related field
3+ years of experience in Site Reliability Engineering or related field
Strong background in Linux, VMware, AWS, Azure, Docker, Kubernetes, Redis, RabbitMQ, monitoring, GitLab CI, Jenkins, Terraform, ElasticSearch, Rancher, Python, Bash, and Lambdas.
Experience leading a team of SREs
Strong problem-solving skills and ability to work in a fast-paced environment
Excellent communication and collaboration skills
Experience with agile methodologies and DevOps practices
Knowledge of security and regulatory requirements and best practices
Ability to manage incident response and resolution processes
Experience developing and maintaining disaster recovery plans
Strong commitment to continuous improvement and learning.
Jobot is an Equal Opportunity Employer. We provide an inclusive work environment that celebrates diversity and all qualified candidates receive consideration for employment without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws.
Sometimes Jobot is required to perform background checks with your authorization. Jobot will consider qualified candidates with criminal histories in a manner consistent with any applicable federal, state, or local law regarding criminal backgrounds, including but not limited to the Los Angeles Fair Chance Initiative for Hiring and the San Francisco Fair Chance Ordinance.
Permanent