SHASHI VANGA
Mobile: +1-203-***-**** Email: **************@*****.***
Location: Texas, USA
PROFESSIONAL SUMMARY
Senior DevOps Engineer with over 8 years of experience designing and operating scalable cloud platforms on AWS and Azure. Adept at streamlining software delivery through automated pipelines, container orchestration, and infrastructure automation using tools like Terraform, Kubernetes, and Docker. Demonstrates consistent success in improving system reliability, tightening security practices across the development lifecycle, and reducing cloud infrastructure costs through smart resource management. Brings strong expertise in monitoring and observability, hybrid cloud networking, and leading incident response efforts that have significantly cut downtime and prevented recurring issues across enterprise environments. Skilled in architecting zero-downtime deployment strategies and driving cultural shifts toward site reliability engineering principles across cross-functional teams. Passionate about eliminating manual toil through intelligent automation and building resilient, self-healing infrastructure that scales confidently with business growth.
TECHNICAL SKILLS
Programming & Scripting Languages: Python, Java, JavaScript, Bash/Shell Scripting, PowerShell, YAML, JSON, XML
Cloud Platforms: Amazon Web Services (AWS), Microsoft Azure, OpenStack, Hybrid Cloud Architecture
AWS Services: EC2, ECS, EKS, Lambda, S3, RDS, DynamoDB, VPC, Route53, CloudFormation, CloudWatch, IAM, SNS, SQS, Auto Scaling, Elastic Load Balancing, EMR, Redshift, CodeDeploy, Trusted Advisor
Infrastructure as Code (IaC): Terraform, AWS CloudFormation, Infrastructure Automation, Modular Templates, State Management
Configuration Management: Ansible, Ansible Tower, Chef, Puppet, Puppet Master, Configuration Automation
CI/CD & Build Tools: Jenkins, Git, GitHub, Bitbucket, Maven, Gradle, Ant, Nexus, TFS, Bamboo, Hudson, AWS CodeDeploy, Pipeline Automation
Containerization & Orchestration: Docker, Kubernetes, Amazon EKS, AWS ECS, Kubernetes Manifests (Deployments, Services, HPA), Container Registry, Multi-stage Dockerfiles
Monitoring & Logging: ELK Stack (Elasticsearch, Logstash, Kibana), Prometheus, Grafana, CloudWatch, New Relic, Splunk, Nagios, SLO/SLA Tracking, Dashboard Design
Security & Compliance: Prisma Cloud, Snyk, Checkmarx, SonarQube, IAM Policies, Security Groups, Hardened AMIs, Vulnerability Scanning, Least Privilege Access
Databases: Amazon DynamoDB, MySQL, MongoDB, MSSQL, Amazon RDS, Amazon Redshift, Database Migration
Web & Application Servers: Apache Tomcat, IBM WebSphere, Oracle WebLogic, JBoss, Apache HTTP Server, Reverse Proxy, SSL Termination
Operating Systems: Red Hat Enterprise Linux (RHEL), Ubuntu, CentOS, CoreOS, Windows Server, Linux Troubleshooting, Performance Tuning
Additional Technologies: REST API, Confluent Kafka, Swagger, Boto3, DevOps Automation, VMware Cloud, Blue/Green Deployment, Canary Deployment
DevOps Practices: Continuous Integration, Continuous Deployment, Infrastructure as Code, Configuration Management, Disaster Recovery, Post-Mortem Analysis, RCA (Root Cause Analysis), Test-Driven Development (TDD), Cost Optimization
EDUCATION
Bachelor of Science in Computer Science
Jawaharlal Nehru Technological University Hyderabad (JNTUH), India
PROFESSIONAL EXPERIENCE
Verizon, Texas, USA
Senior DevOps Engineer October 2025 – Present
Key Responsibilities & Achievements:
•Reduced Mean Time to Resolution (MTTR) by 30% for high-traffic applications by implementing RCA-driven remediation strategies and proactive monitoring using CloudWatch and New Relic.
•Integrated vulnerability scanning tools (Prisma Cloud and Snyk) and hardened AMIs into Jenkins CI/CD pipelines reducing pre-production security vulnerabilities by 45% and improving overall security posture.
•Engineered centralized logging and monitoring solutions using ELK Stack (Elasticsearch, Logstash, Kibana) and New Relic reducing critical alert noise by 25% through intelligent threshold tuning and custom dashboards.
•Developed Python and Shell scripts to automate log rotation and service self-healing mechanisms eliminating 10+ hours of manual toil per month and improving system reliability.
•Eliminated configuration drift across Development, Staging, and Production environments by implementing strict Infrastructure as Code (Terraform) standards with version control and code reviews.
•Orchestrated Blue/Green and Canary deployments on AWS enabling zero-downtime releases, lowering deployment-related failure rates, and ensuring seamless user experience during updates.
•Maintained high-scale Jenkins pipelines implementing automated approval gates and parallelized build stages to increase deployment velocity and reduce build times by 40%.
•Leveraged CloudWatch metrics and AWS Trusted Advisor recommendations to right-size EC2 instances and RDS clusters achieving 15% reduction in monthly cloud spend while maintaining performance SLAs.
•Spearheaded quarterly disaster recovery (DR) drills and automated backup/restore validation processes significantly reducing Recovery Time Objective (RTO) and Recovery Point Objective (RPO) for mission-critical services.
•Led weekly post-mortem meetings for high-severity incidents documenting root causes and implementing preventive actions that reduced repeat outages by 50%.
Environment: AWS (EC2, ECS, RDS, CloudWatch, Trusted Advisor), Terraform, Jenkins, Python, Shell Scripting, ELK Stack, New Relic, Prisma Cloud, Snyk, Blue/Green Deployment, Canary Deployment, Disaster Recovery
Equifax, Georgia, USA
DevOps Engineer October 2024 – October 2025
Key Responsibilities & Achievements:
•Architected secure hybrid-cloud network between on-premise data centers and AWS VPCs ensuring compliance with strict financial data residency laws and regulatory requirements including PCI-DSS and SOC 2.
•Developed custom Python utilities to parse complex JSON logs and optimize Elasticsearch mappings improving search performance by 40% and reducing query response time for log analysis.
•Implemented Test-Driven Development (TDD) practices for infrastructure scripts using Python unit tests to reduce CI pipeline regressions by 20% and improve infrastructure code quality.
•Optimized Apache and Tomcat configurations for reverse proxy and SSL termination enhancing application security, request throughput, and overall system performance.
•Developed modular Terraform templates and Ansible playbooks to provision standardized AWS stacks across global regions ensuring consistency and reducing infrastructure deployment time.
•Streamlined S3 data movements and backups using Boto3 Python scripts ensuring data durability, reducing manual storage management, and automating disaster recovery processes.
•Integrated Checkmarx and SonarQube into CI/CD pipelines to enforce code quality standards and identify open-source vulnerabilities early in the development lifecycle.
•Architected Kubernetes manifests (Deployments, Services, Horizontal Pod Autoscaler) to migrate monolithic services to Amazon EKS improving horizontal scalability and resource utilization.
•Standardized CI/CD runtime by migrating Jenkins build agents to Kubernetes pods reducing build latency and eliminating environment inconsistencies across development teams.
•Designed high-fidelity Grafana dashboards to visualize Prometheus metrics improving accuracy of SLO/SLA tracking for product teams and enabling data-driven decision making.
Environment: AWS (VPC, EC2, S3), Terraform, Ansible, Python, Boto3, Elasticsearch, Kubernetes, Amazon EKS, Jenkins, Checkmarx, SonarQube, Prometheus, Grafana, Apache, Tomcat, TDD
CLS Bank, Dallas, Texas, USA
DevOps and Cloud Engineer January 2023 – September 2024
Key Responsibilities & Achievements:
•Designed AWS Auto Scaling groups and Launch Templates with custom hardened AMIs to ensure 99.99% availability for core banking applications serving millions of daily transactions.
•Enforced "Least Privilege" access control by designing granular IAM roles and policies ensuring 100% compliance during annual security audits and regulatory assessments.
•Tuned CloudWatch alarms and metric filters reducing false-positive alerts by 35% while accelerating anomaly detection and improving incident response time.
•Built AWS Lambda functions to automate EMR cluster lifecycle management reducing idle resource costs by $2,000 per month and optimizing big data processing workflows.
•Implemented end-to-end log pipelines from AWS CloudWatch Logs to Elasticsearch enabling real-time analytics for security operations team and improving threat detection capabilities.
•Facilitated Business Intelligence (BI) reporting by automating data ingestion from on-premise SQL databases into Amazon Redshift enabling advanced analytics and data warehousing.
•Developed Ansible and Terraform scripts to manage hybrid environments across OpenStack and AWS reducing setup time by 60% and ensuring infrastructure consistency.
•Developed Proof-of-Concepts (POCs) for containerizing monolithic banking applications using Docker facilitating smoother cloud-native transition and modernization strategy.
•Authored optimized multi-stage Dockerfiles reducing container image sizes by 50% and accelerating image pull times in production improving deployment efficiency.
•Re-engineered legacy ANT build scripts into Maven modules standardizing build lifecycle across diverse Java-based modules and improving build consistency.
Environment: AWS (EC2, Lambda, EMR, Redshift, CloudWatch, Auto Scaling, IAM), Terraform, Ansible, Docker, OpenStack, Elasticsearch, Python, Maven, Ant
Cyient Ltd, Hyderabad, India
Senior DevOps/AWS Engineer June 2019 – January 2022
Key Responsibilities & Achievements:
•Designed and deployed multiple applications using AWS services including EC2, Route53, S3, RDS, DynamoDB, SNS, SQS, and IAM with high availability and fault tolerance architecture.
•Authored CloudFormation templates to create VPCs, subnets, NAT gateways, and security groups for scalable, secure application and database deployments across multiple availability zones.
•Automated continuous deployment, application server setup, and stack monitoring using Ansible playbooks integrated with Jenkins enabling rapid and consistent deployments.
•Demonstrated use of Ansible and Ansible Tower to standardize and automate software delivery processes across teams improving deployment consistency and reducing manual errors.
•Implemented automated deployments on AWS by creating IAM roles and policies and integrating Jenkins with AWS CodeDeploy for seamless application delivery.
•Deployed applications on application servers including Apache Tomcat, JBoss, IBM WebSphere, and Oracle WebLogic to support diverse enterprise workloads and application requirements.
•Used Jenkins with AWS CodeDeploy and Chef for unattended instance bootstrapping and application deployment in AWS ensuring consistent environment configuration.
•Designed distributed private cloud solutions using Kubernetes with Docker on CoreOS to host containerized applications enabling microservices architecture.
•Operated Kubernetes as platform-as-a-service on private and public cloud running on VMware Cloud infrastructure providing scalable container orchestration.
•Wrote Python scripts using Boto3 to push data from DynamoDB to MySQL databases to support downstream processing and reporting requirements.
Environment: AWS (EC2, S3, RDS, DynamoDB, SNS, SQS, Route53, IAM, CodeDeploy), CloudFormation, Ansible, Ansible Tower, Jenkins, Chef, Docker, Kubernetes, CoreOS, VMware Cloud, Python, Boto3, Tomcat, JBoss, WebSphere, WebLogic
Origin IT Solutions, Hyderabad, India
DevOps Engineer September 2017 – May 2019
Key Responsibilities & Achievements:
•Implemented EC2 Auto Scaling and Elastic Load Balancing to handle 3x traffic spikes during seasonal peak periods ensuring application availability and optimal user experience.
•Designed reusable CloudFormation templates to provision tiered environments (Web, Application, Database) ensuring networking and security parity across all environments.
•Enforced cloud security best practices by configuring IAM roles, policies, and Security Group rules for public and private subnets implementing defense-in-depth strategy.
•Configured Route53 latency-based and failover routing policies to optimize global application access and ensure high availability across multiple regions.
•Established organization's first automated CI/CD pipeline using Jenkins and Maven reducing manual release time by 70% and enabling continuous delivery.
•Managed 50+ Red Hat Enterprise Linux (RHEL) and Windows Server instances via central Puppet Master ensuring consistent patch levels and configuration management across fleet.
•Created reusable Puppet module skeleton that became internal standard for all server configuration code improving automation consistency.
•Performed deep-level Linux troubleshooting and performance tuning utilizing Nagios for real-time health monitoring and alerting ensuring system stability.
Environment: AWS (EC2, Auto Scaling, ELB, Route53, CloudFormation, IAM, Security Groups), Jenkins, Maven, Puppet, Puppet Master, Nagios, RHEL, Windows Server, CI/CD