We are looking for an experienced DevOps Engineers interested in building, maintaining, and scaling PlaidCloud on Kubernetes. This position requires a keen eye for detail to ensure consistency in provisioning and high availability of resources.
The position is also responsible for supporting many customer deployments through Kubernetes automation such as use of HPAs and VPAs to ensure good resource usage while maintaining a highly responsive system.
Key responsibilities include:
Staging servers and automated provisioning for Kubernetes cluster
Utilize ArgoCD ApplicationSets to manage many isolated deployments
Monitoring and managing databases and file backups using disk snapshots, Google Cloud Storage, and other approaches
Automate restoration of databases and backups
Monitoring and managing RabbitMQ in a high availability configuration
Managing and monitoring a Greenplum database clusters in a high availability shared nothing configuration
Managing and monitoring Redis in a high availability configuration
Monitoring server performance and optimizing server selection and cost
Deploy new machines as required to meet demand. Automate where possible.
Manage SSL certificates, firewalls, load balancers, and other infrastructure through our infrastructure provider
Monitor and test system intrusion processes
Manage and configure DoS prevention tools
Make recommendations and work with software development teams to improve infrastructure usage and simplify design where possible
Deploy and improve logging and monitoring metrics to support insight and prioritize engineering changes
Improve the use of tracking metrics to better understand usage patterns
Work with the software development teams to close testing gaps and support continuous integration processes
Ensure a Kubernetes first approach with highly automated processes, best practices, and good resource usage Ideal Qualifications
Strong familiarity with DevOps processes and approaches, especially with GitOps Kubernetes solutions
Experience with deployment tools and deployment automation
Strong understanding of Linux/Unix configuration processes
Strong understanding of security best practices
Strong understanding of Jenkins and Github Actions automation pipelines
Strong written communications skills Bonus Qualifications
Experience with ArgoCD
Experience with Google Kubernetes Engine (GKE)
Experience with Greenplum databases
Experience with Python
Experience with Redis
Experience with RabbitMQ
Experience with Apache Superset
Experience with network and web application security
Experience with Git/GitHub automation