Job Description
Top Skills
SRE Observability lead with hands on experience in App Dynamics, App Insights, Uptime, Science Logic, Grafana, Prometheus.
**Reliability Team**
10+ years of hands on experience in Observability, Production Governance, Risk Management, and Delivery Excellence Tools: AWS CLoudWatch, New Relic, Splunk, DataDog
Communicating across teams and influencing people and decisions to be made
Technical hands on experience as well as leadership capabilities
Ability to persuade and talk to Executive leadership teams
At least 2 Cloud certifications needed.
This SRE team will oversee Dive Deep & Align Advisory sessions, Build SRE program leaders internally. Recruit, develop talent for client assignments. This Practice Architect will be tactical and ideate best SRE practices in observability, production governance, risk management, and delivery excellence to maintain a "Well Managed" team (Meetings SLAs and SLOs and being proactive > reactive).
Job Description
• 10+ years of experience as a technology leader with 3+ years of hands-on experience with AWS or Azure or GCP Cloud technologies
• Strong knowledge of Site Reliability Engineering (SRE) concepts and principles
• Strong background in software/system engineering and architecture
• Experience utilizing monitoring solutions, such as Datadog, Dynatrace, Splunk, Pager Duty, AWS CloudWatch and New Relic
• Experience with automating engineering and creating high standards around deployments, logging, monitoring, and alerting for an enterprise
• Strategize and execute a vision around the KPI’s & Metrics defined for the organization
Candidate Ideally have most of the following:
· AWS / AZURE / GCP
· Python
· Linux
· Puppet / Chef / Ansible
· Terraform
· Docker/ Kubernetes
· CI /CD (Automation, Metrics)
· Observability (Datadog / Dynatrace / Sysdig / Aqua)
· SEIMCompany Description
please visit our site nobletechies.com.
Full-time