Isaac Chandler
Senior DevOps Engineer Site Reliability Engineer Cloud Infrastructure
Plano, TX, 75074 USA • +1-914-***-**** • *************.***@*****.*** https://www.linkedin.com/in/isaac-chandler/
SUMMARY
Results-driven DevOps & Site Reliability Engineer with over 9 years of hands-on experience designing, automating, and scaling secure cloud-native infrastructure across AWS, Azure, and hybrid environments. Proven track record in building and optimizing CI/CD pipelines, implementing Infrastructure as Code, and managing containerized applications using Docker and Kubernetes. Adept at enhancing system reliability, observability, and deployment velocity through modern DevOps and SRE practices. Skilled in cross-functional collaboration, cloud security, and performance tuning to support fast-paced, high- availability engineering environments. Committed to continuous improvement, automation, and delivering robust, scalable infrastructure solutions. SKILLS
DevOps & CI/CD
• CI/CD Tools: Jenkins, GitLab CI, GitHub Actions, Travis CI, CircleCI, Bamboo, Azure DevOps
• DevOps Practices: Infrastructure as Code (IaC), GitOps, Agile/Scrum methodologies
• Version Control: Git, GitHub, Bitbucket
• Build & Artifact Management: Maven, Gradle, Nexus, Artifactory, NPM Infrastructure as Code & Configuration Management
• IaC Tools: Terraform, AWS CloudFormation, Pulumi
• Configuration Management: Ansible, Chef, Puppet, Salt Stack
• Containerization & Orchestration
• Docker, Kubernetes, Helm, OpenShift, AWS ECS
Cloud Platforms
• Amazon Web Services (AWS): EC2, S3, Lambda, RDS, EKS, ECS, IAM, VPC, CloudWatch, Route 53, CloudFormation, Secrets Manager
• Microsoft Azure: AKS, Azure DevOps, App Services, Monitor, Key Vault
• Google Cloud Platform (GCP): GKE, Cloud Functions, Cloud Build, Stack driver Monitoring, Logging & Observability
• Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana), Datadog, New Relic, Splunk
Security & Compliance
• Security Tools: SonarQube, Snyk, Aqua, Trivy, Checkov
• Secrets Management: Hash Corp Vault, AWS Secrets Manager, SOP
• Compliance Standards: HIPAA, SOC 2, PCI-DSS, CIS Benchmarks, DevSecOps practices Scripting & Programming
• Languages: Python, Bash, Go, Groovy, PowerShell, SQL
• Automation: Shell scripting, Python-based orchestration and tooling Operating System
• Linux (Ubuntu, CentOS, RHEL), Windows Server
EXPERIENCE
Costco IT Remote Jun 2021 – Present
Lead DevOps Engineer
• Deployed 3 new microservices per week using optimized CI/CD pipelines with GitLab, GitHub Actions, and Jenkins, achieving a personal best in delivery pipeline efficiency.
• Led the infrastructure provisioning using Terraform and AWS CloudFormation, managing resources across AWS (EC2, ECS, EKS, RDS, Lambda) and Azure (AKS, App Services).
• Constructed and maintained Kubernetes clusters utilizing Istio for traffic management and service mesh capabilities, improving inter-service communication latency by 30% and enhancing overall system resilience.
• Integrated custom dashboards using Grafana to display critical application metrics, resulting in findings to fix the three biggest causes of crashes and improving user experience.
• Analyzed best practices using SonarQube, Trivy, and HashiCorp Vault, ensuring PCI-DSS and SOC 2 compliance across environments.
• Automated scripting and tool development using Python and Bash, reducing manual ops by over 40%.
IET CONSULTING Nov 2018 – Jun 2021
Senior DevOps Engineer
• Spearheaded the creation of Infrastructure-as-Code (IaC) using Ansible and Terraform, cutting down infrastructure provisioning time by 60% across AWS and GCP for healthcare applications.
• Constructed a self-healing Kubernetes cluster for containerized workloads, reducing incident response time by 60% and improving the security posture of patient data across hybrid cloud setups.
• Established self-service CI/CD pipelines with Jenkins and Azure DevOps, integrating SonarQube, Snyk, and Checkov for continuous security validation, which slashed vulnerability introduction by 45% and reduced security incidents.
• Integrated ELK Stack with existing monitoring tools, creating a unified dashboard for real-time insights into system performance and security, enabling faster troubleshooting and reducing mean time to resolution (MTTR) by 40%.
• Designed a custom anomaly detection system integrated with Datadog and CloudWatch; discovered 10+ performance bottlenecks that were affecting application responsiveness, resulting in an improved user experience.
• Accelerated Agile workflows by creating standardized Jira workflows and Confluence templates, leading to a 35% improvement in sprint completion rates and better cross-team alignment. Redix Code Mar 2016 – Nov 2018
Site Reliability Engineer
• Ensured high availability, scalability, and reliability of production systems by designing and maintaining fault-tolerant infrastructure across hybrid cloud environments.
• Developed and maintained automation scripts and infrastructure-as-code (IaC) using tools such as Ansible, Terraform, and Python to streamline deployments and reduce human error.
• Monitored system performance and service-level indicators using Prometheus, Grafana, and ELK Stack, proactively identifying and resolving reliability bottlenecks.
• Collaborated cross-functionally with development and operations teams to implement robust CI/CD pipelines using Jenkins, Git, and Docker, accelerating delivery cycles.
• Optimized cloud infrastructure on AWS by implementing auto-scaling, load balancing, and cost- control measures, resulting in a 25% reduction in operational expenses.
• Implemented robust security controls, including IAM policies, network firewalls, and automated patching to enforce infrastructure compliance and protect sensitive data.
• Mentored junior engineers on SRE principles, fostering a culture of reliability, automation, and continuous improvement within the operations team. PROJECTS
Cloud-Native CI/CD Platform with GitOps & Kubernetes Lead DevOps Engineer Tools: AWS, EKS, ArgoCD, Terraform, Helm, GitHub Actions
• Designed and implemented a fully automated CI/CD platform for 20+ microservices deployed on EKS using GitOps practices.
• Built reusable Terraform modules for AWS infrastructure provisioning, integrated with ArgoCD and Helm charts to streamline deployment pipelines.
• Implemented automated rollbacks, canary deployments, and custom health checks, reducing deployment-related incidents by over 70%.
• Established unified observability using Prometheus, Grafana, and Alertmanager, cutting mean time to detect/resolution by 45%.
Digital Intakes
Lead DevOps Engineer Tools: React, DevOps, AWS, Terraform, GitHub Actions
• Led the end-to-end DevOps strategy for a secure, cloud-native digital intake platform, enabling seamless form management without disrupting client software ecosystems.
• Architected and implemented highly scalable infrastructure on AWS using Terraform, promoting infrastructure consistency, reusability, and compliance across multiple environments.
• Designed robust CI/CD pipelines using GitHub Actions to automate build, test, and deployment workflows for the React-based frontend, ensuring rapid and reliable delivery to S3 and CloudFront.
EDUCATION
Bachelor's in Computer Science