Our client, one of the fastest-scaling venture-backed tech startups in the computer vision space, is seeking a highly motivated AI-native Infrastructure/DevOps Engineer to join their fully remote team. This individual will design, secure, and maintain the cloud infrastructure powering production SaaS and machine learning workloads across AWS and GCP. The ideal candidate possesses strong startup experience, thrives under high ownership, and actively uses AI to make their engineering workflows faster and more efficient.
Role
Build and operate scalable, containerized applications using Kubernetes, Helm, and Docker.
Develop and manage infrastructure-as-code (IaC) solutions using Terraform, Bash, and Python.
Manage the infrastructure required for machine learning at scale, including GPUs and support libraries like PyTorch or TensorFlow.
Function as a true developer for DevOps, contributing code (Python, Node.js) directly to product features and platform infrastructure.
Essential Skills
Deep, production-level expertise building and managing containerized applications at scale with Kubernetes.
High proficiency with Terraform, AWS/GCP, and automation scripting.
Solid programming skills in Node.js and Python, with comfort reading and contributing to application code.
Beneficial Skills
5+ years of hands-on experience in fast-paced startup environments, with a strong preference for former founders who have built DevOps or infrastructure tools.
An AI-native approach to development, leveraging LLMs to refactor code, identify security vulnerabilities, and optimize infrastructure.
Base can approach $200k + guaranteed bonus and equity.
If interested, reply and Goliath Partners will be in touch!