Post Job Free
Sign in

Senior DevOps Engineer Cloud, Automation, SRE

Location:
Forney, TX
Posted:
June 15, 2026

Contact this candidate

Resume:

SHASHI VANGA

Mobile: +1-203-***-**** Email: **************@*****.***

Location: Texas, USA

PROFESSIONAL SUMMARY

Senior DevOps Engineer with over 8 years of experience designing and operating scalable cloud platforms on AWS and Azure. Adept at streamlining software delivery through automated pipelines, container orchestration, and infrastructure automation using tools like Terraform, Kubernetes, and Docker. Demonstrates consistent success in improving system reliability, tightening security practices across the development lifecycle, and reducing cloud infrastructure costs through smart resource management. Brings strong expertise in monitoring and observability, hybrid cloud networking, and leading incident response efforts that have significantly cut downtime and prevented recurring issues across enterprise environments. Skilled in architecting zero-downtime deployment strategies and driving cultural shifts toward site reliability engineering principles across cross-functional teams. Passionate about eliminating manual toil through intelligent automation and building resilient, self-healing infrastructure that scales confidently with business growth.

TECHNICAL SKILLS

Programming & Scripting Languages: Python, Java, JavaScript, Bash/Shell Scripting, PowerShell, YAML, JSON, XML

Cloud Platforms: Amazon Web Services (AWS), Microsoft Azure, OpenStack, Hybrid Cloud Architecture

AWS Services: EC2, ECS, EKS, Lambda, S3, RDS, DynamoDB, VPC, Route53, CloudFormation, CloudWatch, IAM, SNS, SQS, Auto Scaling, Elastic Load Balancing, EMR, Redshift, CodeDeploy, Trusted Advisor

Infrastructure as Code (IaC): Terraform, AWS CloudFormation, Infrastructure Automation, Modular Templates, State Management

Configuration Management: Ansible, Ansible Tower, Chef, Puppet, Puppet Master, Configuration Automation

CI/CD & Build Tools: Jenkins, Git, GitHub, Bitbucket, Maven, Gradle, Ant, Nexus, TFS, Bamboo, Hudson, AWS CodeDeploy, Pipeline Automation

Containerization & Orchestration: Docker, Kubernetes, Amazon EKS, AWS ECS, Kubernetes Manifests (Deployments, Services, HPA), Container Registry, Multi-stage Dockerfiles

Monitoring & Logging: ELK Stack (Elasticsearch, Logstash, Kibana), Prometheus, Grafana, CloudWatch, New Relic, Splunk, Nagios, SLO/SLA Tracking, Dashboard Design

Security & Compliance: Prisma Cloud, Snyk, Checkmarx, SonarQube, IAM Policies, Security Groups, Hardened AMIs, Vulnerability Scanning, Least Privilege Access

Databases: Amazon DynamoDB, MySQL, MongoDB, MSSQL, Amazon RDS, Amazon Redshift, Database Migration

Web & Application Servers: Apache Tomcat, IBM WebSphere, Oracle WebLogic, JBoss, Apache HTTP Server, Reverse Proxy, SSL Termination

Operating Systems: Red Hat Enterprise Linux (RHEL), Ubuntu, CentOS, CoreOS, Windows Server, Linux Troubleshooting, Performance Tuning

Additional Technologies: REST API, Confluent Kafka, Swagger, Boto3, DevOps Automation, VMware Cloud, Blue/Green Deployment, Canary Deployment

DevOps Practices: Continuous Integration, Continuous Deployment, Infrastructure as Code, Configuration Management, Disaster Recovery, Post-Mortem Analysis, RCA (Root Cause Analysis), Test-Driven Development (TDD), Cost Optimization

EDUCATION

Bachelor of Science in Computer Science

Jawaharlal Nehru Technological University Hyderabad (JNTUH), India

PROFESSIONAL EXPERIENCE

Verizon, Texas, USA

Senior DevOps Engineer October 2025 – Present

Key Responsibilities & Achievements:

•Reduced Mean Time to Resolution (MTTR) by 30% for high-traffic applications by implementing RCA-driven remediation strategies and proactive monitoring using CloudWatch and New Relic.

•Integrated vulnerability scanning tools (Prisma Cloud and Snyk) and hardened AMIs into Jenkins CI/CD pipelines reducing pre-production security vulnerabilities by 45% and improving overall security posture.

•Engineered centralized logging and monitoring solutions using ELK Stack (Elasticsearch, Logstash, Kibana) and New Relic reducing critical alert noise by 25% through intelligent threshold tuning and custom dashboards.

•Developed Python and Shell scripts to automate log rotation and service self-healing mechanisms eliminating 10+ hours of manual toil per month and improving system reliability.

•Eliminated configuration drift across Development, Staging, and Production environments by implementing strict Infrastructure as Code (Terraform) standards with version control and code reviews.

•Orchestrated Blue/Green and Canary deployments on AWS enabling zero-downtime releases, lowering deployment-related failure rates, and ensuring seamless user experience during updates.

•Maintained high-scale Jenkins pipelines implementing automated approval gates and parallelized build stages to increase deployment velocity and reduce build times by 40%.

•Leveraged CloudWatch metrics and AWS Trusted Advisor recommendations to right-size EC2 instances and RDS clusters achieving 15% reduction in monthly cloud spend while maintaining performance SLAs.

•Spearheaded quarterly disaster recovery (DR) drills and automated backup/restore validation processes significantly reducing Recovery Time Objective (RTO) and Recovery Point Objective (RPO) for mission-critical services.

•Led weekly post-mortem meetings for high-severity incidents documenting root causes and implementing preventive actions that reduced repeat outages by 50%.

Environment: AWS (EC2, ECS, RDS, CloudWatch, Trusted Advisor), Terraform, Jenkins, Python, Shell Scripting, ELK Stack, New Relic, Prisma Cloud, Snyk, Blue/Green Deployment, Canary Deployment, Disaster Recovery

Equifax, Georgia, USA

DevOps Engineer October 2024 – October 2025

Key Responsibilities & Achievements:

•Architected secure hybrid-cloud network between on-premise data centers and AWS VPCs ensuring compliance with strict financial data residency laws and regulatory requirements including PCI-DSS and SOC 2.

•Developed custom Python utilities to parse complex JSON logs and optimize Elasticsearch mappings improving search performance by 40% and reducing query response time for log analysis.

•Implemented Test-Driven Development (TDD) practices for infrastructure scripts using Python unit tests to reduce CI pipeline regressions by 20% and improve infrastructure code quality.

•Optimized Apache and Tomcat configurations for reverse proxy and SSL termination enhancing application security, request throughput, and overall system performance.

•Developed modular Terraform templates and Ansible playbooks to provision standardized AWS stacks across global regions ensuring consistency and reducing infrastructure deployment time.

•Streamlined S3 data movements and backups using Boto3 Python scripts ensuring data durability, reducing manual storage management, and automating disaster recovery processes.

•Integrated Checkmarx and SonarQube into CI/CD pipelines to enforce code quality standards and identify open-source vulnerabilities early in the development lifecycle.

•Architected Kubernetes manifests (Deployments, Services, Horizontal Pod Autoscaler) to migrate monolithic services to Amazon EKS improving horizontal scalability and resource utilization.

•Standardized CI/CD runtime by migrating Jenkins build agents to Kubernetes pods reducing build latency and eliminating environment inconsistencies across development teams.

•Designed high-fidelity Grafana dashboards to visualize Prometheus metrics improving accuracy of SLO/SLA tracking for product teams and enabling data-driven decision making.

Environment: AWS (VPC, EC2, S3), Terraform, Ansible, Python, Boto3, Elasticsearch, Kubernetes, Amazon EKS, Jenkins, Checkmarx, SonarQube, Prometheus, Grafana, Apache, Tomcat, TDD

CLS Bank, Dallas, Texas, USA

DevOps and Cloud Engineer January 2023 – September 2024

Key Responsibilities & Achievements:

•Designed AWS Auto Scaling groups and Launch Templates with custom hardened AMIs to ensure 99.99% availability for core banking applications serving millions of daily transactions.

•Enforced "Least Privilege" access control by designing granular IAM roles and policies ensuring 100% compliance during annual security audits and regulatory assessments.

•Tuned CloudWatch alarms and metric filters reducing false-positive alerts by 35% while accelerating anomaly detection and improving incident response time.

•Built AWS Lambda functions to automate EMR cluster lifecycle management reducing idle resource costs by $2,000 per month and optimizing big data processing workflows.

•Implemented end-to-end log pipelines from AWS CloudWatch Logs to Elasticsearch enabling real-time analytics for security operations team and improving threat detection capabilities.

•Facilitated Business Intelligence (BI) reporting by automating data ingestion from on-premise SQL databases into Amazon Redshift enabling advanced analytics and data warehousing.

•Developed Ansible and Terraform scripts to manage hybrid environments across OpenStack and AWS reducing setup time by 60% and ensuring infrastructure consistency.

•Developed Proof-of-Concepts (POCs) for containerizing monolithic banking applications using Docker facilitating smoother cloud-native transition and modernization strategy.

•Authored optimized multi-stage Dockerfiles reducing container image sizes by 50% and accelerating image pull times in production improving deployment efficiency.

•Re-engineered legacy ANT build scripts into Maven modules standardizing build lifecycle across diverse Java-based modules and improving build consistency.

Environment: AWS (EC2, Lambda, EMR, Redshift, CloudWatch, Auto Scaling, IAM), Terraform, Ansible, Docker, OpenStack, Elasticsearch, Python, Maven, Ant

Cyient Ltd, Hyderabad, India

Senior DevOps/AWS Engineer June 2019 – January 2022

Key Responsibilities & Achievements:

•Designed and deployed multiple applications using AWS services including EC2, Route53, S3, RDS, DynamoDB, SNS, SQS, and IAM with high availability and fault tolerance architecture.

•Authored CloudFormation templates to create VPCs, subnets, NAT gateways, and security groups for scalable, secure application and database deployments across multiple availability zones.

•Automated continuous deployment, application server setup, and stack monitoring using Ansible playbooks integrated with Jenkins enabling rapid and consistent deployments.

•Demonstrated use of Ansible and Ansible Tower to standardize and automate software delivery processes across teams improving deployment consistency and reducing manual errors.

•Implemented automated deployments on AWS by creating IAM roles and policies and integrating Jenkins with AWS CodeDeploy for seamless application delivery.

•Deployed applications on application servers including Apache Tomcat, JBoss, IBM WebSphere, and Oracle WebLogic to support diverse enterprise workloads and application requirements.

•Used Jenkins with AWS CodeDeploy and Chef for unattended instance bootstrapping and application deployment in AWS ensuring consistent environment configuration.

•Designed distributed private cloud solutions using Kubernetes with Docker on CoreOS to host containerized applications enabling microservices architecture.

•Operated Kubernetes as platform-as-a-service on private and public cloud running on VMware Cloud infrastructure providing scalable container orchestration.

•Wrote Python scripts using Boto3 to push data from DynamoDB to MySQL databases to support downstream processing and reporting requirements.

Environment: AWS (EC2, S3, RDS, DynamoDB, SNS, SQS, Route53, IAM, CodeDeploy), CloudFormation, Ansible, Ansible Tower, Jenkins, Chef, Docker, Kubernetes, CoreOS, VMware Cloud, Python, Boto3, Tomcat, JBoss, WebSphere, WebLogic

Origin IT Solutions, Hyderabad, India

DevOps Engineer September 2017 – May 2019

Key Responsibilities & Achievements:

•Implemented EC2 Auto Scaling and Elastic Load Balancing to handle 3x traffic spikes during seasonal peak periods ensuring application availability and optimal user experience.

•Designed reusable CloudFormation templates to provision tiered environments (Web, Application, Database) ensuring networking and security parity across all environments.

•Enforced cloud security best practices by configuring IAM roles, policies, and Security Group rules for public and private subnets implementing defense-in-depth strategy.

•Configured Route53 latency-based and failover routing policies to optimize global application access and ensure high availability across multiple regions.

•Established organization's first automated CI/CD pipeline using Jenkins and Maven reducing manual release time by 70% and enabling continuous delivery.

•Managed 50+ Red Hat Enterprise Linux (RHEL) and Windows Server instances via central Puppet Master ensuring consistent patch levels and configuration management across fleet.

•Created reusable Puppet module skeleton that became internal standard for all server configuration code improving automation consistency.

•Performed deep-level Linux troubleshooting and performance tuning utilizing Nagios for real-time health monitoring and alerting ensuring system stability.

Environment: AWS (EC2, Auto Scaling, ELB, Route53, CloudFormation, IAM, Security Groups), Jenkins, Maven, Puppet, Puppet Master, Nagios, RHEL, Windows Server, CI/CD



Contact this candidate