Job Description
Provide technical functional expertise to the Engineering, DevOps, and QA teams to create and implement Continuous Integration and Continuous Delivery (CI/CD) pipeline.
Coach and mentor others on Platform Engineering and DevOps best practices
Provide operational support for applications and utilities.
Design and implement process improvements.
Leverage SRE best practices.
Assist in the deployment of new modules, upgrades, and fixes to the production environment.
Work with open-source technologies as needed.
Work with CI and CD tools, and source control such as GitLab.
Lead the team through continuous improvement of production operations.
Offer technical support where needed and developing automation software to speed incident resolution.
Stay current with industry trends and source new ways for our business to improve.
Implement, configure, and operate tools and products in the DevOps and DevSecOps Toolchain.
Building and maintaining tools, services, and automations associated with deployment and our operations platform, ensuring that all meet our customer service standards and reduce errors.
Continually evaluate our systems and tooling so that they can accommodate growth and continually changing requirements.
Actively troubleshoot any issues that arise in production.
Update our processes/documentation and design new tools and processes as needed.
Deploy product updates as required while implementing integrations when they arise.
Automate our operational processes as needed, with accuracy, and compliant with security standards.
Requirements
1. Must Haves:
1) Experience setting up a SRE practice
2) Must be hands on. It will be automation and operational support over infrastructure setup.
3) Must be knowledgeable and apply SRE best practices
4) Experience with pipelines is required – both CI/CD and data ingestion pipelines
5) Only need to understand these in order to support them: Immuta, Starburst, Collibra, Databricks, Alteryx, and Tableau
6) Experience mentoring others
7) Experience working in an Agile environment
8) Experienced as a SRE in the following:
a. Stability of the application – Tweaking observability and configuration settings to meet customer expectations.
b. Lead with data – Have a data driven mindset
c. Empowering our users and engineers – Ensuring that tools, pipelines are configured along best practices, educating support and engineers on best practices.
d. Automation – Where feasible, automate tasks and processes to reduce engineering toil and reduce errors.
9) CI/CD Tools: e.g., Jenkins, GitLab
10) Familiarity with AWS. The SRE will not be focused on building the infrastructure but must understand it to support it.
11) Orchestration and environment management tools (Puppet, Kubernetes, Ansible, Terraform)
12) Monitoring tools (Splunk, Dynatrace, Datadog).
13) Thorough understanding of APIs, gateways, orchestrators, databases, networking, monitoring, configuration management and security best practices for a production environment.
14) Experience programming and scripting on UNIX / Linux. (i.e., Python or Bash).
15) Experienced in implementing Data and Advanced Analytics solutions, and SaaS or related experience in the Cloud.
Pluses:
1) Experience working with FedRamp
Full-time