Post Job Free
Sign in

Senior DevOps Engineer - Azure, Kubernetes, Terraform

Location:
Queens, NY, 11368
Salary:
$140,000
Posted:
March 13, 2026

Contact this candidate

Resume:

KASHIF RASHID

DEVOPS ENGINEER AZURE CLOUD KUBERNETES TERRAFORM

Dallas, TX • 469-***-**** • **********@*****.***

PROFESSIONAL SUMMARY

DevOps / Infrastructure Engineer with 8+ years of experience designing, automating, and operating large-scale Linux and cloud infrastructure across enterprise environments. Specialized in Microsoft Azure, Terraform Infrastructure as Code, CI/CD pipeline automation, Kubernetes orchestration, and cloud observability. Proven history of improving deployment speed, strengthening security posture, and increasing system reliability through automation, scalable cloud architecture, and DevOps best practices. Experienced supporting high-availability production environments, multi-subscription cloud platforms, and containerized microservices. TECHNICAL SKILLS

Cloud: Microsoft Azure, AWS

Infrastructure as Code (IaC): Terraform, ARM Templates, Bicep CI/CD: Azure DevOps, GitHub Actions, Jenkins, GitLab Containers: Docker, Kubernetes, AKS, Helm

Monitoring & Observability: Prometheus, Grafana, Dynatrace, Azure Monitor, Cloud Watch Configuration Management: Ansible, Puppet

Operating Systems: RHEL, CentOS, Ubuntu

Scripting: Bash, Python, PowerShell

Networking & Security: VNet, NSG, Private Endpoints, Load Balancers, RBAC Platform Engineering: DevOps Automation, Infrastructure Automation, SRE, Cloud Architecture PROFESSIONAL EXPERIENCE

COSTCO WHOLESALE — INFRASTRUCTURE ENGINEER

Dallas, TX Apr 2022 – Present

• Managed identity and access across enterprise environments using MS Entra ID including user provisioning, RBAC assignments, SSPR configuration, and external user onboarding.

• Enforced governance at scale by applying Azure Policy, tags configured locks, administering resource groups/subscription and management groups.

• Designed and deployed secure Azure networking architecture including VNet, subnets, VNet peering, NSGs, private endpoints, and Azure Bastion.

• Deployed highly available computing infrastructure using Availability Zones, Availability Sets, and Virtual Machine Scale Sets supporting production workloads.

• Automated infrastructure provisioning using Terraform modules enabling consistent deployments across dev, staging, and production environments.

• Reduced environment provisioning time by 70% through Infrastructure as Code automation.

• Implemented remote Terraform state using Azure storage and key Vault improving collaboration and security.

• Integrated Terraform with CI/CD pipelines to prevent configuration drift and standardize cloud infrastructure.

• Built and maintained CI/CD pipelines using Azure DevOps and GitHub Actions supporting automated infrastructure and application deployments.

• Automated validation, plan, and deployment workflows for Terraform improving release reliability.

• Integrated artifact management, code validation/testing within CI pipelines improving deployment quality.

• Supported automated deployment processes across multiple environments and microservices platform.

• Managed production Kubernetes cluster (API Server, ETCD, Scheduler, Controller Manger, Ingress) supporting 100+ microservices and 200+ pods, improving platform reliability and uptime.

• Implemented Pod Autoscaling using CPU, memory, and Prometheus metrics improving workload scalability.

• Configured Kubernetes networking, CNI policies and persistent storage integration using CSI drivers.

• Maintained cluster performance and node stability through Kubelet optimization and infrastructure tuning.

• Implemented monitoring and alerting solutions using Prometheus, Grafana, and Azure Monitor.

• Developed PromQL queries and dashboards to track application performance, latency, and system health.

• Configured alerts and action groups improving incident detection and operational response times.

• Monitored Kubernetes workloads and infrastructure metrics to maintain system reliability.

• Provisioned and secured Azure Storage Accounts with lifecycle policies and redundant configurations.

• Optimized storage performance using Hot, Cool, and Archive tiers aligned with enterprise data policies.

• Implemented backup and DR recovery using Azure Backup and Site Recovery ensuring business continuity.

• Enhanced data protection through snapshot management, versioning, and soft-delete policies.

• Implemented hybrid connectivity using Azure Arc, VPN, and ExpressRoute enabling unified management across environments.

• Troubleshot network connectivity using Azure Network Watcher and Connection Monitor.

• Configured internal and external load balancers ensuring high availability and traffic distribution.

• Improved network security posture using private endpoints and service endpoints. WELLS FARGO — LINUX SYSTEM ENGINEER

NY Mar 2020 – Apr 2022

• Led migration to centralized identity using SSSD, Kerberos, and LDAP, implementing Sudo, HBAC, and MFA to strengthen access control across Linux servers.

• Automated provisioning using VMware templates and Ansible, improving deployment consistency and reducing manual build effort.

• Migrated GPFS file systems to NAS platforms improving scalability and reducing licensing costs.

• Developed Python automation using Azure SDK to improve infrastructure visibility across hybrid environments.

• Architected high-availability Clusters using Pacemaker/Corosync with STONITH fencing, ensuring reliable failover for mission-critical workloads.

• Managed configuration automation across 1,000+ servers using Puppet with CIS/STIG compliance enforcement.

• Automated the deployment of infrastructure using Kickstart, templates, and AWS CloudFormation.

• Experience with Ansible, using ad-hoc commands, playbooks, and roles for orchestration, automated application deployment, and configuration management.

• Deployed Red Hat Insights Lightspeed for predictive analytics and initiative-taking vulnerability remediation.

• Led OS migrations RHEL7 to RHEL8 with structure validation procedures, minimizing operational downtime.

• Contributed to penetration testing remediation, resolving privilege escalation and vulnerabilities.

• Experienced with GIT for source code management and version control using tools like GitLab.

• Created users and groups, using AWS IAM Service permissions to control access to resources.

• Created EC2 instances with various AMIs and configured applications on those instances.

• Created S3 buckets and managed policies, utilizing S3 and Glacier for storage and backup solutions.

• Built Docker images using Docker files, managed snapshots, images, and Docker volumes.

• Managed Kubernetes manifest files created reproducible builds and replicas for applications.

• Proficient in managing Dell PowerEdge and HP ProLiant servers using iDRAC and ILo interfaces.

• Patched Linux servers through Satellite, resolved system performance tickets with Red Hat support.

• Troubleshooted latency and performed kernel tuning to achieve desired application behavior.

• Monitored network parameters and server resources, maintained maximum up-time using Nagios.

• Supported secure onboarding by producing technical documentation and facilitating KT sessions.

• Provided after-hours support, implemented on-call rotation to ensure timely incident response.

• Led disaster recovery planning and executed DR testing across multiple application environments. WALMART — LINUX SYSTEM ADMINISTRATOR

BENTONVILLE, AR DEC 2017 – JAN 2020

• Managed SAN infrastructure including LUN provisioning, multipath configuration, and FC zoning validation supporting enterprise storage environments.

• Created Bash scripts to manage users, groups, and implemented OpenSSH-based security for Red Hat servers.

• Deployed, monitored, and managed RHEL 6, 7 and CentOS-based infrastructure.

• Automated Audit evidence collection improving compliance readiness and reporting efficiency.

• Administered virtualization and implemented VMware provisioning and template building.

• Provided customer service support in a challenging environment, including hardware assembly, quality control, testing, and troubleshooting new applications.

• Resolved user-related issues, such as creating and resolving user access and connectivity problems.

• Resolved tickets, incidents and change requests assigned through Jira.

• Set cron jobs to continuously monitor server health by checking load averages.

• Maintained and troubleshot system performance, running processes, and application issues.

• Configured RAID levels and managed storage with LVM and created storage elasticity.

• Managed Ext3, Ext4, and XFS file systems, including error resolution.

• Addressed remote server issues through ILo and IDRAC interfaces.

• Used networking tools such as ping, tcpdump, and traceroute to diagnose interrupted data flows.

• Strong experience in creating, maintaining, and synchronizing YUM/RPM repositories.

• Managed network connections using iptables, firewalls, and monitored network traffic/issues.

• Troubleshot critical OS issues related to performance, networking, NFS shares, and file system.

• Managed packages and patches, including adding/removing software and updating patches.

• Configured NIC Teaming for redundancy and achieving load balancing.

• Assigned and maintained ACLs file permissions and scheduled tasks using cron jobs.

• Delivered cost-effective, secure, and fit-for-purpose solutions tailored to organizational needs.

• Provided on-call and weekend support, resolving tickets via Remedy.

• Conducted system monitoring, verified resources integrity, and reviewed OS and application logs. Education

Gomal University D.I Khan Pakistan

Major: Master of Commerce, 2020 – 2022

Microsoft Certified: Azure Administrator Associate (757N9C-51B63B) KEY ACHIVEMENTS

• Reduced cloud infrastructure deployment time by 70% through Terraform automation.

• Built scalable Kubernetes environments supporting enterprise microservices platforms.

• Improved system monitoring and reliability using Prometheus and Grafana observability stack.

• Strengthened Azure governance and security across multiple cloud environments.



Contact this candidate