Devanand Yerra
DevOps Engineer
United States +1-779-***-**** ******************@*****.*** LinkedIn SUMMARY
• DevOps Engineer with 5+ years of experience designing and implementing scalable, cloud-native infrastructure and automation solutions across AWS, GCP, and Azure.
• Proven expertise in building and optimizing CI/CD pipelines (Jenkins, GitHub Actions, GitLab, Spinnaker) to accelerate release cycles and improve developer productivity.
• Strong background in containerization and orchestration using Docker, Kubernetes (EKS, GKE, OpenShift), and Istio, ensuring high availability for mission-critical applications.
• Skilled in infrastructure automation with Terraform, CloudFormation, Ansible, and Chef, driving consistency, reducing provisioning time, and optimizing cloud costs.
• Adept at monitoring, logging, and security compliance (Prometheus, Grafana, ELK, Splunk, Datadog, PCI DSS, GDPR), ensuring system reliability, resiliency, and regulatory adherence. EXPERIENCE
DevOps Engineer, Expedia Aug 2023 – Present, TX
• Automated AWS (EKS, Lambda, S3) and GCP infrastructure using Terraform and CloudFormation, reducing provisioning delays by 45% and enabling faster go-to-market for travel applications.
• Built and optimized CI/CD pipelines with Jenkins, GitHub Actions, and Spinnaker, reducing build times by 30% and accelerating release cycles for global booking systems.
• Managed Kubernetes (EKS) clusters supporting high-traffic travel services, ensuring zero-downtime deployments and improving application reliability during seasonal traffic spikes.
• Implemented Datadog, Prometheus, Grafana, and Splunk dashboards to monitor booking and payment platforms, reducing Mean Time to Recovery (MTTR) by 40%.
• Conducted chaos engineering experiments to identify resilience gaps, strengthening fault tolerance and improving system availability for millions of daily user transactions.
• Partnered with product and developer teams to build self-service deployment pipelines, reducing operational bottlenecks and empowering teams to deploy code independently.
• Optimized auto-scaling strategies for search and booking microservices, ensuring performance stability during peak holiday traffic while cutting cloud costs by 18%.
• Secured infrastructure by enforcing PCI and GDPR compliance, applying system hardening, and embedding automated compliance checks in CI/CD workflows.
• Migrated legacy monolithic booking services into microservices on AWS EKS, reducing release complexity and improving scalability across multiple Expedia brands.
• Automated cost optimization scripts using Python and AWS SDK, identifying underutilized resources and saving ~20% in monthly cloud spend.
• Supported global on-call rotations, proactively resolving incidents across distributed environments, ensuring uninterrupted availability for travelers in 30+ countries.
• Improved team collaboration by documenting best practices in Confluence and managing sprints in Jira, streamlining delivery timelines and aligning DevOps goals with business objectives. DevOps Engineer, Accenture June 2019 – Oct 2022, India
• Automated multi-cloud deployments across AWS (EKS, Lambda, S3) and GCP (GKE, Compute Engine) using Terraform and Python, reducing provisioning time by 50% and accelerating client onboarding.
• Built and maintained CI/CD pipelines with Jenkins and GitLab CI/CD, enabling faster and more reliable releases for multi-service applications, which improved delivery speed for client projects.
• Managed Kubernetes clusters and implemented Istio service mesh, ensuring reliable inter-service communication and enhancing system resilience during peak booking traffic.
• Configured ELK Stack and Splunk for centralized logging, allowing proactive troubleshooting and reducing incident resolution times, which minimized downtime for business-critical applications.
• Developed automation scripts in Python, Ansible, and Chef, ensuring consistent configurations across hybrid environments and significantly reducing environment drift and human error.
• Deployed Prometheus and Grafana dashboards, providing real-time visibility into performance metrics and improving Mean Time to Recovery (MTTR) by 35% across client-facing platforms.
• Partnered with QA teams to integrate Selenium, JUnit, and TestNG into CI/CD pipelines, increasing test coverage and reducing production defects, resulting in more stable customer releases.
• Strengthened infrastructure security by applying hardening practices across Linux and Windows servers, ensuring compliance with client regulatory standards and reducing vulnerabilities.
• Optimized PostgreSQL and MongoDB databases, improving query performance and maintaining high availability during traffic spikes, which improved user experience for large-scale client applications.
• Designed and implemented disaster recovery strategies for cloud-native applications, ensuring uninterrupted service availability and meeting strict SLA requirements for enterprise clients.
• Collaborated using Jira and Confluence for agile project management and documentation, improving workflow transparency and enabling more efficient cross-team coordination.
• Improved system stability by fine-tuning Linux (Ubuntu, Debian) and Windows environments, reducing unplanned outages and cutting down repeat support tickets by operations teams. EDUCATION
Masters in Computer Technology
Eastern Illinois University
B. Tech in computer science and engineering
Institute of Aeronautical engineering
SKILLS
• Scripting & Programming Languages: Python, Bash, Shell Scripting, Perl, JSON, SQL
• Cloud Platforms: AWS (EKS, ECS, Lambda, S3, EC2, RDS, Redshift), Microsoft Azure, Google Cloud Platform
(GCP)
• Infrastructure as Code (IaC): Terraform, AWS CloudFormation, Azure Resource Manager (ARM), Ansible, Chef, Puppet, SaltStack
• Containerization & Orchestration: Docker, Kubernetes (EKS, GKE, OpenShift), Istio Service Mesh
• CI/CD & DevOps Tools: Jenkins, GitHub Actions, GitLab CI/CD, Spinnaker, Bitbucket, Jira, Confluence, Asana
• Monitoring & Logging: Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana), Splunk, Datadog, New Relic
• Databases: PostgreSQL, MySQL, MongoDB, Cassandra
• Testing & QA Automation: Selenium, JUnit, TestNG, REST Assured, Postman, Unit Testing
• Security & Compliance: PCI DSS, GDPR, Infrastructure Hardening, Disaster Recovery Planning
• Operating Systems: Linux (Ubuntu, Debian, Kali), Windows, MacOS
• Methodologies & Soft Skills: Agile, Scrum, Lean, ITIL, Process Automation, Problem-Solving, Collaboration, Communication, Continuous Learning, Code Review, Best Practices