Senior Infrastructure Engineer with AWS-first expertise

Location:

Seattle, WA

Posted:

April 30, 2026

Contact this candidate

Resume:

Scott Davis Senior Infrastructure Engineer

***********@*******.*** 1-360-***-**** Seattle, WA 98109 Linkedin.com/in/scottd8152 SKILLS

Cloud Platforms: Amazon Web

Services (AWS), Microsoft Azure,

Google Cloud Platform (GCP)

Infrastructure as Code: Terraform,

AWS CloudFormation, Ansible, Packer

Containers & Orchestration: Docker,

Kubernetes, Helm, Amazon EKS,

Azure AKS, Google GKE, ArgoCD,

FluxCD

CI/CD & DevOps Tools: Jenkins,

GitHub Actions, GitLab CI, CircleCI,

AWS CodePipeline, Azure DevOps,

Bitbucket Pipelines, Bamboo

Programming & Scripting: Python,

Go, Bash, YAML, Shell Scripting

Monitoring & Observability:

Prometheus, Grafana, Amazon

CloudWatch, Datadog, Splunk,

Fluentd, ELK Stack

Security & Governance: IAM, RBAC,

AWS Organizations, SCPs, HashiCorp

Vault, AWS Secrets Manager, OPA,

Kyverno, Falco, Zero Trust, Policy as

Code

Networking: VPC, Route 53, DNS,

Load Balancers (ALB/NLB), CDN, VPN,

TCP/IP, Firewalls

Databases & Storage: PostgreSQL,

DynamoDB, MongoDB, Cloud

Bigtable, Amazon S3

Serverless & Integration: AWS

Lambda, API Gateway, Event-Driven

Architecture

Reliability Engineering: High

Availability, Incident Response, Root

Cause Analysis, Disaster Recovery,

Capacity Planning, SLO/SLI,

Performance Tuning

Cost Optimization: AWS Cost

Explorer, Savings Plans, Reserved

Instances, Rightsizing, FinOps

Collaboration: Technical Leadership,

Cross-Functional Communication,

Mentoring, Documentation, Agile /

Scrum, Remote Team Operations

PROFESSIONAL EXPERIENCE

Senior Infrastructure Engineer, NETFLIX

12/2021 – Present

•Architected AWS-first production infrastructure for high-scale streaming and platform services by standardizing EKS, Terraform, Kubernetes, IAM, VPC networking, CI/CD controls, and reliability patterns, improving service availability to 99.97% across 140+ production workloads.

•Modernized container delivery platforms by enhancing Kubernetes scheduling, Helm releases, ArgoCD deployment flows, autoscaling behavior, and progressive rollout strategies, reducing deployment-related incidents by 31.4%.

•Built reusable self-service infrastructure modules for engineering teams using Terraform, YAML pipelines, RBAC templates, secrets integrations, and standardized service blueprints, reducing environment onboarding time from 11 days to 3.8 days.

•Expanded observability and incident response capabilities through Prometheus, Grafana, CloudWatch, Datadog, Fluentd, and centralized alert correlation, lowering mean time to detect by 37.6% and mean time to recover by 29.8%.

•Improved platform resilience by automating rollback validation, node remediation, workload recovery, and drift correction using Python, Bash, Lambda, and Kubernetes controllers, eliminating 42.7% of recurring operational toil.

•Optimized AWS cloud spend through compute rightsizing, storage lifecycle tuning, Savings Plans adoption, and cluster utilization improvements, reducing annual infrastructure cost by $2.46M while maintaining performance targets.

•Partnered with security, platform, and product leaders to define engineering standards, mentor senior contributors, and scale paved-road adoption across 28 globally distributed teams.

Senior Infrastructure Engineer, ATLASSIAN

02/2016 – 11/2021

•Led infrastructure engineering for Atlassian’s global SaaS platform across AWS- primary, Azure/GCP secondary, and legacy on-prem environments, improving provisioning consistency, deployment reliability, and platform stability by 23.9%.

•Standardized multi-environment Infrastructure as Code by consolidating legacy provisioning processes into Terraform modules, reusable CI/CD pipelines, policy automation, and shared service templates, reducing release lead time for 46 teams by 34.2%.

•Modernized shared container platforms supporting cloud products by strengthening Kubernetes clusters, Helm delivery models, runtime governance, autoscaling, and service networking, increasing deployment success rate to 98.6%.

•Improved production observability and incident response through centralized Splunk, Prometheus, Grafana, log pipelines, alert tuning, and operational runbooks, reducing Sev-1 and Sev-2 incident duration by 27.5%.

•Optimized infrastructure spend across cloud and legacy estates through capacity planning, resource rightsizing, storage cleanup, and workload rebalancing, saving

$1.34M annually while supporting continued growth.

•Collaborated with developers, SREs, and service owners to improve internal platform usability, documentation, and rollout processes, accelerating adoption of shared engineering standards across 52+ squads.

EDUCATION

University of Washington, Master's of

Science, Mechanical Engineering

2010 – 2013

University of Oregon,

Bachelor's Degree, Mathematics

2006 – 2010

DevOps Engineer, AMAZON WEB SERVICES

10/2013 – 01/2016

•Supported cloud infrastructure and deployment automation for AWS platform services by improving template-driven provisioning, release workflows, and operational visibility across core environments, reducing provisioning time by 18.7%.

•Automated repeatable build and deployment tasks using CloudFormation, Jenkins, Python, Bash, Linux scripting, and configuration management tooling, decreasing manual release effort by 36.4% while improving change consistency.

•Enhanced production operability through CloudWatch monitoring, alerting, log aggregation, Route 53 health checks, EC2 autoscaling, and incident runbooks, reducing response time to critical events by 24.1%.

•Worked cross-functionally with developers, support teams, and service stakeholders to resolve infrastructure issues, improve release readiness, and document support procedures, increasing delivery efficiency by 19.3%. SUMMARY

Senior Infrastructure Engineer with 12+ years of experience designing, automating, and securing large-scale cloud platforms across AWS, with additional exposure to Azure and GCP. Proven track record building highly available Kubernetes-based infrastructure, Infrastructure as Code solutions, CI/CD pipelines, observability platforms, and cost-optimized production environments for global technology organizations including Netflix, Atlassian, and Amazon Web Services. Strong expertise in reliability engineering, cloud security, automation using Python/Go/Bash, and leading cross-functional initiatives that improve scalability, operational excellence, and engineering velocity. Adept at driving modern platform strategy in fast-paced remote environments with a strong focus on business impact.

Contact this candidate