Our client is looking for a Site Reliability Engineer to join their Team! This position will assist in building the infrastructure and pipeline development for data assets into a new AWS technology stack using Immuta, Starburst, Collibra, Databricks, Alteryx, and Tableau.
Top skills you need to have:
Experience setting up a SRE practice
Must be hands on. It will be automation and operational support over infrastructure setup.
Must be knowledgeable and apply SRE best practices
Experience with pipelines is required – both CI/CD and data ingestion pipelines
Only need to understand these in order to support them: Immuta, Starburst, Collibra, Databricks, Alteryx, and Tableau
Experience mentoring others
Experience working in an Agile environment
Experienced as a SRE in the following:
Stability of the application – Tweaking observability and configuration settings to meet customer expectations.
Lead with data – Have a data driven mindset
Empowering our users and engineers – Ensuring that tools, pipelines are configured along best practices, educating support and engineers on best practices.
Automation – Where feasible, automate tasks and processes to reduce engineering toil and reduce errors.
CI/CD Tools: e.g., Jenkins, GitLab
Familiarity with AWS. The SRE will not be focused on building the infrastructure but must understand it to support it.
Orchestration and environment management tools (Puppet, Kubernetes, Ansible, Terraform)
Monitoring tools (Splunk, Dynatrace, Datadog).
Thorough understanding of APIs, gateways, orchestrators, databases, networking, monitoring, configuration management and security best practices for a production environment.
Experience programming and scripting on UNIX / Linux. (i.e., Python or Bash).
Experienced in implementing Data and Advanced Analytics solutions, and SaaS or related experience in the Cloud.