Are you passionate about building automation that powers large-scale platforms? We're looking for a hands-on Senior Engineer who thrives at the intersection of backend development and DevOps. Join our team to design, build, and operate robust automation for a cutting-edge Kubernetes and HPC environment.
Req.#932419602
RESPONSIBILITIES
Design and develop automation services, agents, and controllers for platform operations (e.g., Kubernetes namespace lifecycle, RBAC provisioning, drift remediation)
Implement integrations with core platform components, such as job-submission workflows, compute pipelines, and CI/CD hooks
Automate provisioning and configuration of infrastructure at scale using Infrastructure-as-Code and configuration management tools
Enhance observability and operational insights by building metrics, logging, and audit pipelines for platform resources
Collaborate closely with platform, SRE, and infrastructure teams to deliver secure, reliable, and integrated solutions
Ensure software quality and maintainability through modular, testable, and resilient automation workflows
REQUIREMENTS
5+ years of backend or systems engineering experience, with strong proficiency in Go and/or Python for Linux environments
Deep knowledge of Kubernetes internals (CRDs, controllers/operators, RBAC, scheduling) and hands-on experience with production clusters
Experience with batch scheduling systems (preferably Volcano or similar HPC frameworks) and job lifecycle management
Proficiency with Infrastructure-as-Code tools (Terraform, Helm, Kustomize, Ansible) and integrating them into CI/CD pipelines
Familiarity with scale-out or high-performance storage systems (e.g., VAST), including automation for provisioning, quotas, and snapshots
Strong understanding of networking, security, secrets/certificate management, service mesh, and observability pipelines
WE OFFER
Medical, Dental and Vision Insurance (Subsidized)
Health Savings Account
Flexible Spending Accounts (Healthcare, Dependent Care, Commuter)
Short-Term and Long-Term Disability (Company Provided)
Life and AD&D Insurance (Company Provided)
Employee Assistance Program
Unlimited access to LinkedIn learning solutions
Matched 401(k) Retirement Savings Plan
Paid Time Off
Legal Plan and Identity Theft Protection
Accident Insurance
Employee Discounts
Pet Insurance
Employee Stock Purchase Program