HPC Architect
San Francisco, California
About Camber
Camber is on a mission to democratize scientific computing from astrophysics to genomics. Currently, distributed scientific workloads are nearly 100% on-premise, making computation for researchers limited in scope, with long iteration cycles. Our fully managed, infrastructure-free platform allows researchers to deploy on-demand fully managed instances for distributed scientific simulations and data analysis. We’re based in San Francisco and backed by top venture capital firms Pear VC and Base 10 Partners.
About The Role
Camber is seeking a visionary, highly motivated science and research computing expert. The HPC Architect is responsible for designing, implementing, and optimizing high-performance computing environments to support complex computational workloads, in our cloud platform. This role combines deep technical expertise with strategic foresight to develop scalable, efficient computing solutions.
Responsibilities
Design and architect high-performance computing infrastructure, including hardware, software, and network components
Evaluate and select appropriate technologies to meet performance, scalability, and efficiency requirements
Develop comprehensive HPC solutions that align with business objectives and research needs
Collaborate with engineers and scientists to quickly iterate on the optimal solution
Optimize existing HPC environments for performance, energy efficiency, and cost-effectiveness
Create detailed documentation of system architecture, configurations, and best practices
Stay current with emerging HPC technologies, frameworks, and methodologies
Develop strategies for system monitoring, maintenance, and upgrading
Qualifications:
Bachelor's degree in Computer Science, Engineering, or related field; advanced degree preferred
5+ years of experience in HPC environments, with demonstrated expertise in architecture design
Strong knowledge of parallel computing, distributed systems, and computational optimization
Experience with job scheduling systems (e.g., Slurm, PBS, LSF)
Proficiency in HPC programming models (MPI, OpenMP, CUDA)
Familiarity with various storage solutions (parallel file systems, object storage)
Understanding of networking technologies relevant to HPC (InfiniBand, OmniPath, etc.)
Knowledge of containerization and virtualization in HPC contexts
Experience with system monitoring and performance analysis tools
Excellent problem-solving and analytical skills
Benefits
Competitive salary and meaningful equity
A flexible vacation policy
Comprehensive health, vision, dental, and life insurance as well as disability benefits
Being part of and building out a world-class team