Post Job Free
Sign in

Senior DevOps / SRE Engineer Multi-Cloud Automation & Observability

Location:
Williston, ND
Posted:
December 20, 2025

Contact this candidate

Resume:

BISMARK HORFFMAN

Senior DevOps / Cloud & Site Reliability Engineer – Multi-Cloud Infrastructure Automation Observability

Williston, ND (Remote / Open to Relocation) • +1-302-***-**** • *********@*****.*** LinkedIn: www.linkedin.com/in/bismark-horffman-187773348 SUMMARY

Results-driven Senior DevOps / Cloud & Site Reliability Engineer with 9+ years of experience designing, automating, and maintaining scalable, reliable, and secure multi-cloud infrastructure across AWS (primary), Azure, and GCP. Expert in infrastructure as code (IaC), observability, and DevSecOps, with a proven track record in building self-healing, cost-optimized, and fault-tolerant platforms. Passionate about SRE principles — SLIs/SLOs, incident response, error budgets, and automation — to enhance system reliability and performance. Certified in AWS DevOps, Kubernetes (CKA), and Terraform.

CORE SKILLS

Cloud Platforms: AWS (Primary), Azure, GCP

IaC & Automation: Terraform, Ansible, CloudFormation, Pulumi, Packer CI/CD & DevOps: GitHub Actions, GitLab CI/CD, Jenkins, ArgoCD, Spinnaker, Bamboo SRE & Observability: OpenTelemetry, Datadog, Prometheus, Grafana, Jaeger, ELK, Splunk, SLIs/SLOs, Error Budgets

Containers & Orchestration: Kubernetes (EKS, AKS, GKE, OpenShift), Docker, Helm, Istio Security & Compliance: IAM, Vault, Secrets Manager, Kyverno, Trivy, Checkov, Policy-as-Code Scripting & Tools: Python, Bash, PowerShell, Go, YAML, Git, Jira, Confluence Networking & OS: Nginx, API Gateway, ALB/NLB, Route53, Linux (RHEL/Ubuntu), Windows Server PROFESSIONAL EXPERIENCE

DevOps & Site Reliability Engineer – Analog Devices (Mar 2025 – Sep 2025)

• Built multi-cloud GitOps pipelines (ArgoCD + Terraform) with embedded Kyverno & Trivy compliance scans.

• Designed SRE observability stack (OpenTelemetry, Prometheus, Grafana) reducing incident resolution by 45%.

• Defined SLIs/SLOs and automated alerting with Datadog + PagerDuty to ensure 99.9% service uptime.

• Automated infrastructure provisioning using Terraform + Ansible, improving deployment speed by 60%.

• Introduced chaos testing and fault-injection simulations to validate resilience under failure conditions. Cloud / DevOps Engineer – Dorkytek (Oct 2022 – Feb 2025)

• Led Azure ® AWS migration using Terraform & CloudFormation for scalable, resilient workloads.

• Implemented distributed tracing and metrics pipelines (Datadog, OpenTelemetry, Jaeger), reducing MTTR by 35%.

• Automated patching and compliance enforcement with Python + Ansible and integrated DevSecOps scans (Checkov, Trivy).

• Deployed ArgoCD GitOps workflows reducing provisioning time by 80%.

• Developed synthetic monitoring jobs to measure SLO compliance and latency performance. Senior DevOps Engineer – Crestwood Midstream Partners (Feb 2020 – Jun 2022)

• Designed multi-region EKS clusters with unified observability (Prometheus, Fluent Bit, Jaeger).

• Built DR-ready Terraform modules reducing RTO by 40% and improved fault tolerance.

• Automated GCP workload identity federation and Vault secret rotation.

• Implemented performance tuning, scaling policies, and chaos experiments improving reliability by 30%.

Build & Release Engineer – Schlumberger (Apr 2017 – Sep 2019)

• Standardized deployment automation using Jenkins + Bamboo, improving throughput by 40%.

• Integrated ELK stack into CI/CD pipelines for performance monitoring and alerting.

• Automated AWS/Azure provisioning using Terraform & Ansible. Cloud Engineer – Teradata (Feb 2015 – Jan 2017)

• Built Python microservices on GCP with Kubernetes & Istio service mesh.

• Secured workloads using Vault, network policies, and IAM roles.

• Developed Grafana dashboards with Prometheus metrics, cutting troubleshooting time by 40%. PROJECT HIGHLIGHTS

• Unified Observability Platform: Integrated OpenTelemetry + Jaeger + Datadog, reducing MTTR by 60%.

• SRE Metrics & Automation: Established SLIs/SLOs, automated error budget tracking, and incident response workflows.

• Multi-Cloud GitOps Automation: Built Terraform-driven ArgoCD framework enforcing tagging and policy compliance.

• Resilience & DR Framework: Created multi-region failover strategy with RTO/RPO automation on AWS.

• DevSecOps Pipelines: Embedded Kyverno, Trivy, and Checkov into ArgoCD workflows for continuous compliance.

EDUCATION

B.Sc. Computer Science – Benedict College

CERTIFICATIONS

• AWS Certified DevOps Engineer – Professional

• Certified Kubernetes Administrator (CKA)

• HashiCorp Certified: Terraform Associate

• AWS Certified Security – Specialty



Contact this candidate