Post Job Free
Sign in

DevOps Engineer - Kubernetes - AWS Cloud - Terraform Experts

Location:
Vijayawada, Andhra Pradesh, India
Posted:
April 20, 2026

Contact this candidate

Resume:

SHAIK HUSSAIN BASHA

DevOps Engineer · Kubernetes · AWS Cloud · Linux · Virtualization · IaC · CI/CD

+91-953**-***** ************@*****.*** Kurnool, AP Immediate Joiner PROFESSIONAL SUMMARY

Results-driven DevOps and Cloud Engineer with 6+ years of experience across AWS cloud infrastructure, Ku- bernetes (EKS) container orchestration, Linux/AIX system administration, and enterprise virtualization

(VMware, KVM, OpenStack, HMC/IBM AIX). Proven expertise in provisioning and managing production-grade environ- ments using Terraform, building robust CI/CD pipelines (Jenkins, GitHub Actions, ArgoCD), and deep troubleshoot- ing of Kubernetes workloads (CrashLoopBackOff, NodeNotReady, ImagePullBackOff, PVC latency, service-routing fail- ures). Strong advocate of automation, DevSecOps practices, high-availability architecture, and SRE operational models. TECHNICAL SKILLS

• Kubernetes & Container Orchestration: EKS, deployments, services, Ingress (NGINX/ALB), RBAC, NetworkPolicies, ConfigMaps, Secrets, HPA, VPA, Cluster Autoscaler, PV/PVC, StorageClass, Helm, ArgoCD (GitOps); troubleshooting: CrashLoopBackOff, ImagePullBackOff, NodeNotReady, service not routing traffic, storage latency, CNI/kube-proxy issues

• AWS Cloud: EC2, EKS, ECS (Fargate & EC2), S3, IAM, VPC, RDS, ALB/NLB, Auto Scaling, CloudWatch, Lambda, CloudFront, KMS, GuardDuty, Security Hub, Cost Explorer, Savings Plans, Reserved Instances; multi-account architecture, cross-account IAM federation

• Virtualization Technologies: VMware (vSphere/ESXi, vCenter, VM provisioning, snapshots, vMotion, HA/DRS clusters); KVM (libvirt, virsh, VM lifecycle management, bridge/NAT networking); OpenStack (Nova, Neutron, Cinder, Glance — instance provisioning and tenant management); HMC / IBM AIX (Hardware Management Console, LPAR management, VIO server, AIX patching and performance tuning)

• Infrastructure as Code: Terraform (modules, remote state, reusable components, multi-environment), Terragrunt, Ansible; AWS + Snowflake providers; peer-reviewed PR-based apply workflows; Checkov/tfsec IaC security scanning

• CI/CD & DevOps: Jenkins, GitHub Actions, ArgoCD, GitOps, AWS CodePipeline; Blue-Green & Canary deployments, automated approval gates, rollback workflows, SonarQube quality gates

• Monitoring & Observability: Datadog (metrics, logs, APM), Prometheus, Grafana, CloudWatch, ELK/EFK Stack; SLO/SLA alerting, pipeline latency monitoring, node/pod health dashboards

• Linux & Systems: RHEL, CentOS, Ubuntu, Amazon Linux, IBM AIX; OS patching, performance tuning, log monitoring, capacity planning, runbook authoring, on-call incident response

• Security & Secrets Management: HashiCorp Vault, AWS Secrets Manager, IAM least-privilege, OPA, Kyverno, Veracode, Qualys, CrowdStrike, Splunk, Delinea, branch protection, CODEOWNERS, credential rotation

• Scripting & Automation: Bash/Shell scripting, Python; automated patching, deployment validation, smoke tests, adminis- trative task automation

PROFESSIONAL EXPERIENCE

Senior Associate II, System Administrator May 2024 – Dec 2025 Kyndryl Solutions Pvt. Ltd. — Project: NPCI (National Payments Corporation of India) Hyderabad, IN

• Kubernetes Cluster Management (EKS): Designed, deployed, and operated production-grade Amazon EKS clusters for financial-grade workloads; managed cluster upgrades, node group scaling, pod networking, and multi-tenant namespace isolation with RBAC; configured NGINX and AWS ALB Ingress controllers for path-based and host-based routing.

• Kubernetes Autoscaling & Resource Optimization: Implemented HPA (CPU/memory and custom metrics via Prometheus adapter), VPA recommendations, and Cluster Autoscaler across multiple node groups — reducing over-provisioning by 20% while maintaining 100% SLA compliance.

• Advanced Kubernetes Troubleshooting: Led live production incident diagnosis and resolution: CrashLoopBackOff (mis- configured liveness/readiness probes, OOM limits, init container failures); ImagePullBackOff (ECR authentication, registry pull secrets, image tag drift); NodeNotReady (kubelet crashes, disk pressure, CNI misconfiguration, certificate expiry); service not routing traffic (endpoint slice drift, kube-proxy iptables rules, label selector mismatches); storage latency on PVC-backed stateful workloads (EBS IOPS throttling, StorageClass misconfiguration, ReadWriteOnce contention).

• AWS Cloud Infrastructure Management: Managed enterprise-scale AWS environments spanning EC2, ECS (Fargate & EC2 launch types), VPC, IAM, RDS, ALB/NLB, Auto Scaling, S3, KMS, GuardDuty, and Security Hub; administered 1,000+ EC2 instances with zero SLA breaches; architected multi-account AWS environments with cross-account IAM role federation.

• Infrastructure as Code (Terraform): Provisioned modular, reusable Terraform infrastructure (EKS, ECS, EC2, VPC, IAM, RDS, S3, ALB) with remote state backends and multi-environment support; enforced peer-reviewed PR apply workflows and integrated Checkov/tfsec security scanning into delivery pipelines; applied Terraform AWS and Snowflake providers across dev, staging, and production.

• CI/CD Pipeline Design (GitOps): Built and maintained GitOps-based delivery using ArgoCD, Jenkins, and GitHub Actions with automated build, test, and deployment workflows; implemented Blue-Green and Canary release strategies with automated approval gates, rollback procedures, and SonarQube quality gates.

• Monitoring, Observability & MTTR: Configured Datadog and Prometheus/Grafana for cluster health, ECS task health, pod metrics, and APM tracing; automated alerting on HPA scaling events, node health, and pipeline latency anomalies — improving MTTR by 25%.

• DevSecOps & Secrets Management: Integrated OPA/Kyverno admission webhooks for policy-as-code enforcement; archi- tected HashiCorp Vault and AWS Secrets Manager for secure credential handling with automated rotation and least-privilege IAM; enforced branch protection, CODEOWNERS, and required reviewer policies across all repositories.

• AWS Cost Optimization: Implemented rightsizing strategies for EC2/ECS workloads, Reserved Instances and Savings Plans recommendations, tagging for cost allocation, and budget alerting via Cost Explorer. 1

• SRE & Operational Excellence: Authored pre/post-deployment runbooks, smoke tests, and incident playbooks — cutting incident resolution time by 30%; participated in Agile/SRE sprints, architecture reviews; mentored junior DevOps engineers on Kubernetes and cloud-native best practices.

Linux & AIX System Administrator Sep 2022 – Apr 2024 Canopus GBS Private Limited — Client: Kyndryl Solutions Pvt. Ltd. Bangalore, IN

• Linux Administration (RHEL/CentOS/Ubuntu): Managed production Linux systems including OS patching, kernel tuning, filesystem management, user/group administration, and SLA-bound incident response; implemented automated patching workflows using Ansible and Bash scripts.

• KVM & OpenStack Virtualization: Provisioned and administered KVM virtual machines using libvirt/virsh — managing VM lifecycle, bridge and NAT networking, storage pools, and snapshots; operated OpenStack environments managing Nova compute instances, Neutron networking (tenant networks, floating IPs, security groups), Cinder block storage, and Glance images for internal development teams.

• Infrastructure Collaboration & Automation: Collaborated with DevOps teams on infrastructure migration, deployment automation, and release cycles; automated repetitive administrative tasks using Python and Bash scripting, improving team operational efficiency; supported Git-based branching strategies for deployment pipelines. Linux & AIX System Administrator Mar 2019 – Sep 2022 Netscape Software Limited — Client: Aditya Birla Group Bangalore, IN

• IBM AIX & HMC Administration: Administered IBM AIX servers managed via HMC (Hardware Management Console); managed LPAR (Logical Partition) creation, modification, and decommission; configured VIO (Virtual I/O Server) for shared storage and network resources; performed AIX system patching (TL/SP updates via NIM), performance monitoring with topas/nmon, and capacity planning.

• VMware Virtualization: Managed VMware vSphere/ESXi environments including VM provisioning, snapshot manage- ment, vMotion live migration, and vCenter HA/DRS cluster configuration for production application workloads; conducted regular DR testing and failover validation.

• Linux Production Operations: Administered RHEL/CentOS servers for mission-critical production environments; reduced incident resolution time by 30% through proactive monitoring, runbook documentation, and pre/post-deployment validation scripts; conducted capacity planning and performance tuning for high-availability workloads. KEY PROJECTS

Kubernetes EKS Platform — GitOps CI/CD (NPCI)

Designed and operated production EKS clusters with ArgoCD-based GitOps, Helm chart management, RBAC, multi- tenant namespacing, HPA/VPA/Cluster Autoscaler, and Datadog observability. Built dev staging prod promotion pipelines with automated validation and rollback.

Infrastructure Automation — Terraform Multi-Environment AWS Built modular Terraform infrastructure (EKS, ECS, VPC, IAM, RDS, S3, ALB) with remote state, multi-account archi- tecture, and peer-reviewed apply governance. Integrated Checkov/tfsec IaC security scanning and tagging for full cost visibility.

OpenStack & KVM Private Cloud Operations

Operated OpenStack private cloud (Nova, Neutron, Cinder, Glance) and KVM hypervisor environments to support development and staging workloads; automated VM provisioning and network configuration using Bash/Python scripting and Ansible playbooks.

KEY ACHIEVEMENTS

• Zero SLA breaches across 1,000+ AWS EC2 instances and containerized EKS/ECS workloads in production financial envi- ronments.

• 25% MTTR improvement through Datadog/Prometheus/Grafana dashboards and automated alerting on EKS cluster health, HPA events, and pipeline anomalies.

• 30% faster incident resolution via Kubernetes troubleshooting playbooks (CrashLoopBackOff, NodeNotReady, PVC la- tency), runbook automation, and smoke test frameworks.

• 40% reduction in manual infrastructure effort via Terraform, Ansible, Bash, and Python automation across multi- environment AWS and virtualization platforms.

• 20% cost reduction on EKS/ECS compute through HPA/Cluster Autoscaler rightsizing, Reserved Instance recommendations, and tagging-based cost allocation.

• GitOps-first delivery: Led ArgoCD + Helm implementation enabling fully auditable, peer-reviewed infrastructure changes with complete release traceability across all environments. EDUCATION

Bachelor of Technology (B.Tech) — Computer Science & Engineering 2012 Jawaharlal Nehru Technological University (JNTU), Anantapur 2



Contact this candidate