Post Job Free
Sign in

Senior DevOps / SRE Engineer with 6+ Years Experience

Location:
Dallas, TX, 75203
Salary:
95000
Posted:
May 07, 2026

Contact this candidate

Resume:

MATTHEW OLADIPUPO

Senior Site Reliability/DevOps Engineer

Location: Dallas, TX Email: ************@*****.*** Phone: +1-469-***-****

Professional Summary:

As a results-driven Senior Site Reliability and DevOps Engineer with 6+ years of experience building, automating, and managing scalable cloud environments across AWS, Azure, and GCP. Skilled in Infrastructure as Code using Terraform and CloudFormation, and in designing efficient CI/CD pipelines that improve deployment speed and reliability. Experienced in Kubernetes, containerization, and implementing monitoring and observability solutions to quickly identify and resolve issues. Strong background in SRE practices, including incident management, performance optimization, and maintaining high system uptime. Focused on DevSecOps and cloud security best practices to ensure secure, compliant, and cost-effective systems, while collaborating with cross-functional teams to deliver reliable, high-performing applications.

Professional Expertise:

Expertise in multi-cloud platforms including AWS, Azure, and GCP with strong focus on high availability, fault tolerance, and cost optimization

Proficient in Infrastructure as Code (IaC) using Terraform, AWS CloudFormation, Pulumi, Ansible, and Packer for scalable automation

Strong experience in CI/CD pipeline design and automation using Jenkins, GitHub Actions, GitLab CI/CD, Azure DevOps, ArgoCD, and Tekton

Skilled in containerization and orchestration using Docker, Kubernetes (K8s), Helm, OpenShift, Istio, and container security best practices

Advanced knowledge of monitoring and observability tools including Prometheus, Grafana, ELK Stack, Datadog, Splunk, New Relic, and AWS CloudWatch

Hands-on experience in Site Reliability Engineering (SRE) practices including SLA, SLO, SLI, incident management, RCA, and chaos engineering

Strong programming and scripting skills in Python, Bash, Shell, Go, YAML, and JSON for automation and system integration

Expertise in Linux systems and networking fundamentals including TCP/IP, DNS, load balancing, VPN, firewalls, and HTTP/HTTPS protocols

Deep understanding of security and DevSecOps practices including IAM, HashiCorp Vault, OAuth, SSL/TLS, SAST/DAST, and compliance standards (SOC 2, HIPAA)

Experience with cloud-native architectures including microservices, REST APIs, serverless computing, event-driven systems, and database technologies like MySQL, PostgreSQL, MongoDB, DynamoDB, and Redis

Skills:

Cloud Platforms & Multi-Cloud:

Infrastructure as Code (IaC):

CI/CD & DevOps Automation:

Containers & Orchestration:

Monitoring, Observability & Logging:

Site Reliability Engineering (SRE):

Programming & Scripting:

Version Control & Collaboration:

Operating Systems & Networking:

Security & DevSecOps:

Cloud-Native & Microservices:

Configuration Management & Automation:

Database & Storage:

Agile, SDLC & Collaboration:

Emerging & Advanced Practices:

AWS (EC2, S3, RDS, Lambda, VPC, IAM, EKS, ECS, CloudFront), Microsoft Azure, Google Cloud Platform (GCP), Hybrid and Multi-Cloud Architecture, Cloud Cost Optimization, High Availability, Fault Tolerance

Terraform, AWS CloudFormation, Pulumi, Ansible, Packer, Infrastructure Automation, Immutable Infrastructure

Jenkins, GitHub Actions, GitLab CI/CD, Azure DevOps, ArgoCD, Tekton, CI/CD Pipeline Design, Continuous, Integration, Continuous Delivery

Docker, Kubernetes (K8s), Helm, OpenShift, Container Security, Service Mesh (Istio)

Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana), Datadog, Splunk, New Relic, AWS CloudWatch, Distributed Tracing, Alerting, Incident Response

SLA, SLO, SLI, Error Budgets, Incident Management, Root Cause Analysis (RCA), Chaos Engineering, Reliability Engineering

Python, Bash, Shell Scripting, Go, YAML, JSON, API Automation

Git, GitHub, GitLab, Bitbucket, Branching Strategies, Code Review Practices

Linux (RHEL, Ubuntu, CentOS), TCP/IP, DNS, Load Balancing, Firewalls, VPN, HTTP, HTTPS, Networking Fundamentals

IAM, Secrets Management (HashiCorp Vault), OAuth, SSL/TLS, Container Security, Static and Dynamic Code Analysis, Compliance (SOC 2, HIPAA)

Microservices Architecture, REST APIs, Serverless Computing, Event-Driven Architecture, API Gateway

Ansible, Chef, Puppet, System Automation, Configuration Drift Management

MySQL, PostgreSQL, MongoDB, DynamoDB, Redis, Data Backup, Disaster Recovery

Agile, Scrum, Kanban, Jira, Confluence, DevOps Culture, Cross-Functional Collaboration

GitOps, Platform Engineering, AIOps, MLOps Fundamentals, Observability Engineering, FinOps, Internal Developer Platforms (IDP)

Education:

Master of Science in Data Science

University of Salford

Bachelor of Technology in Electronic & Electrical Engineering

Ladoke Akintola University of Technology

Certifications:

AWS Certified Solutions Architect – Associate

AWS Certified Cloud Practitioner

Certified Kubernetes Administrator

Certified Kubernetes Security Specialist

Certified Kubernetes Application Developer

Kubernetes and Cloud Native Security Associate

Kubernetes and Cloud Native Associate

HashiCorp Terraform Associate

Kubestronaut

Microsoft Certified Trainer

Microsoft Power BI Data Analyst – Associate

Experience:

Dell Technologies

Role: Senior Site Reliability/DevOps Engineer Feb 2023 - Current

Responsibilities:

Engineered and maintained 99.99% system uptime by implementing SLOs, SLIs, and proactive reliability engineering practices across distributed cloud environments

Automated infrastructure provisioning using Terraform and Infrastructure as Code (IaC), reducing environment setup time by 65%

Designed and optimized CI/CD pipelines (Jenkins, GitHub Actions), accelerating deployment frequency by 3x and minimizing release failures

Managed large-scale Kubernetes (EKS) clusters, improving container orchestration efficiency and reducing resource utilization costs by 30%

Implemented end-to-end observability solutions (Prometheus, Grafana, ELK Stack), decreasing incident detection time by 45%

Led incident response and root cause analysis (RCA), reducing Mean Time to Recovery (MTTR) by 40% through automation and runbook standardization

Enhanced cloud security posture using IAM policies, secrets management, and compliance frameworks, achieving 100% audit readiness

Built scalable, fault-tolerant microservices architecture on AWS (EC2, Lambda, S3, RDS) supporting millions of user transactions daily

Optimized system performance through capacity planning and load testing, increasing application response time by 35%

Integrated DevSecOps practices, embedding security scans (SAST/DAST) into pipelines and reducing vulnerabilities by 50%

Collaborated with cross-functional engineering teams to implement blue-green and canary deployments, improving release stability by 60%

Developed automation scripts using Python and Bash, eliminating 70% of manual operational tasks and improving engineering productivity

The ITtern

Role: Data DevOps Engineer Jul 2019 – Jan 2023

Responsibilities:

Architected and maintained data pipelines processing 2TB+ daily data using modern ETL frameworks, ensuring high data availability and integrity

Automated deployment workflows for data platforms using CI/CD tools (GitLab CI, Jenkins), reducing release cycle time by 55%

Built scalable data infrastructure on AWS (S3, Glue, Redshift, Lambda), improving data processing efficiency by 40%

Orchestrated batch and streaming workflows with Apache Airflow, increasing pipeline reliability and reducing job failures by 35%

Optimized SQL queries and data models in Redshift and PostgreSQL, enhancing query performance by 50%

Implemented monitoring and alerting solutions for data systems using CloudWatch and Grafana, cutting downtime by 30%

Developed reusable automation scripts in Python, minimizing manual data handling efforts by 60%

Ensured data security and compliance through encryption, IAM roles, and access controls, achieving 100% policy adherence

Integrated version control and branching strategies for data engineering workflows using Git, improving collaboration efficiency by 45%

Designed fault-tolerant data ingestion frameworks, supporting millions of records per hour with zero data loss

Collaborated with analytics and engineering teams to deliver real-time data solutions, accelerating business insights by 3x

Conducted performance tuning and cost optimization strategies, reducing cloud expenditure by 25% while maintaining scalability



Contact this candidate