Farooq Ahmed
Senior DevOps Engineer Cloud Infrastructure & Reliability Specialist
215-***-**** ***********.*****@*****.*** Harrisburg, PA (17101), USA SUMMARY
Strategic and performance-driven Senior DevOps Engineer with over 10 years of experience designing, implementing, and optimizing secure, scalable, and high-availability cloud infrastructures across AWS, Azure, and GCP. Adept at modern DevOps methodologies including CI/CD automation, Infrastructure as Code (IaC), and container orchestration, delivering faster and more reliable software deployments. Skilled in Site Reliability Engineering (SRE) principles and cloud-native architecture, driving operational excellence, uptime, and cost efficiency across complex enterprise environments. Recognized for building automation frameworks, integrating DevSecOps practices, and promoting an observability-first culture to ensure security, compliance, and performance at scale. A strong collaborator and mentor, experienced in guiding cross-functional teams, optimizing delivery pipelines, and leading cloud modernization projects that align with business growth and compliance standards such as HIPAA and SOC2. SKILLS
CI/CD Pipelines (Jenkins, GitHub Actions, GitLab CI) Designed and maintained automated pipelines for
multiple teams, reducing release times and minimizing deployment errors.
Kubernetes (EKS, GKE, AKS)
Managed scalable Kubernetes clusters and implemented Helm charts to streamline deployments and improve
service uptime.
Cloud Platforms (AWS, Azure, GCP)
Delivered multi-region, cost-efficient, and secure cloud environments, achieving measurable savings through right-sizing and automation.
Monitoring & Observability (Prometheus, Grafana,
ELK, Datadog)
Built dashboards and alerting systems that improved visibility, reduced response times, and enhanced
reliability.
Security & Compliance (IAM, Vault, container
security)
Enforced access controls, secrets management, and
vulnerability scanning to protect infrastructure and data. Release Management & Feature Flags
Introduced feature flag frameworks for safer deployments and controlled rollouts.
Database & Storage Management (RDS, CosmosDB)
Automated database backups and scaling, maintaining strong recovery objectives and uptime.
Cost Optimization (FinOps)
Monitored cloud usage and implemented automated
scaling policies that reduced operational costs.
Containerization (Docker)
Standardized application packaging and environments by containerizing over 60 microservices, improving
consistency across development and production.
Infrastructure as Code (Terraform, Bicep, ARM)
Automated provisioning and configuration of
infrastructure, enabling repeatable and version-controlled environment setups.
Configuration Management (Ansible, Chef)
Automated routine setup and maintenance tasks,
improving consistency and reducing manual workload across 100+ servers.
Logging & Tracing (ELK Stack, Jaeger)
Implemented centralized logging and tracing solutions that simplified troubleshooting and root cause analysis. Automation & Scripting (Python, Bash, PowerShell)
Wrote automation scripts to manage deployments,
infrastructure, and incident response tasks.
Networking & Load Balancing (VPC, NGINX, CDN)
Designed secure network topologies and optimized traffic routing to ensure reliable user experiences.
Site Reliability Engineering (SRE)
Established SLOs, SLIs, and error budgets to guide reliability improvements and reduce downtime.
PROFESSIONAL EXPERIENCE
Gremlin
Senior DevOps Engineer
07/2021 – Present
•Led the design and implementation of scalable, secure CI/CD pipelines across hybrid multi-cloud infrastructures (AWS
& Azure), ensuring standardized deployments and reliable release cycles.
•Migrated legacy monoliths to microservices with Docker and Kubernetes, improving deployment agility and reducing downtime.
•Built modular Terraform and Ansible frameworks for environment provisioning and compliance automation.
•Integrated DevSecOps tools (Aqua, Trivy, Vault) into build pipelines, enforcing security and compliance.
•Developed observability stacks with Prometheus, Grafana, Loki, reducing mean time to recovery (MTTR) by forty percent.
•Provided architectural leadership, mentoring junior engineers, and promoting infrastructure automation culture.
•Enhanced compliance and security with automated scanning and compliance-as-code initiatives. ChaosSearch
DevOps & Cloud Infrastructure Engineer
01/2018 – 05/2021
•Designed and implemented HIPAA-compliant cloud-native infrastructure on AWS using Terraform, Ansible, and CloudFormation.
•Built automated CI/CD pipelines with GitHub Actions and Jenkins, accelerating deployments by thirty-five percent.
•Centralized monitoring with ELK Stack and Fluentd, improving root cause analysis and reducing incident resolution time by fifty percent.
•Automated IAM policy management and cloud cost optimization strategies, cutting infrastructure spend by twenty percent.
•Collaborated with QA and compliance teams to integrate Snyk and OWASP ZAP for proactive vulnerability detection. Lightstep
Site Reliability Engineer (SRE)
011/2015 – 12/2017
•Managed mission-critical healthcare platforms ensuring ninety-nine point nine nine percent uptime and full compliance with HIPAA and SOC Two.
•Built comprehensive monitoring and alerting using Datadog, Prometheus, and Grafana, improving visibility into SLIs and SLOs.
•Implemented blue-green and canary deployments via Kubernetes, enabling zero-downtime releases.
•Automated incident response workflows with PagerDuty and Python scripts, reducing response time by forty-five percent.
•Spearheaded migration from on-premises systems to AWS and GCP, enhancing scalability and cutting costs by twenty- five percent.
PROJECTS
Healthcare Microservices Infrastructure
Designed and deployed a HIPAA-compliant, containerized healthcare analytics platform using Docker, AWS ECS, RDS, and VPC-based secure networking. Automated provisioning with Terraform and integrated observability with Prometheus and Alertmanager.
Multi-Cloud CI/CD Framework
Built a unified GitLab CI/CD pipeline supporting deployments to both AWS and Azure, with integrated vulnerability scanning (SonarQube, Trivy, Aqua). Automated GitOps workflows with ArgoCD for secure multi-environment promotion. CERTIFICATES
AWS Certified Solutions Architect –
Associate
Certified Kubernetes Administrator
(CKA)
HashiCorp Certified: Terraform
Associate
EDUCATION
Bachelor of Science in Computer Science