Nida Asif
*********@*****.*** 469-***-****
Professional Summary
Over 8+ years of experience in DevOps, Site Reliability Engineering (SRE), and Cloud Engineering, designing and implementing scalable automation strategies across Linux and Windows environments. Strong expertise in multi-cloud platforms including AWS, Azure, and GCP, with automated infrastructure provisioning using Terraform (with modules and workspaces), AWS CloudFormation, Pulumi, and configuration management tools like Ansible, Chef, and Puppet.
Hands-on experience in building secure and scalable CI/CD pipelines using GitHub Actions, GitLab CI, Jenkins, Azure DevOps, and Argo CD; implemented progressive delivery strategies including blue/green deployments and canary releases with Kubernetes (EKS, AKS, GKE).
Strong expertise in scripting and automation using Python, Bash, Go, and YAML for Kubernetes Operators, GitOps workflows, and infrastructure pipelines; automated provisioning of containerized microservices using Helm, Kustomize, and Crossplane.
Proficient in system-level programming with C/C++ for optimizing infrastructure tools, debugging performance bottlenecks, and integrating low-latency automation into cloud-native environments.
Extensively used modern observability stacks including Datadog, Prometheus, Grafana, Loki, Tempo, OpenTelemetry, ELK, and AWS CloudWatch for real-time metrics, logs, tracing, and alerting; integrated APM and SLO dashboards into SRE practices.
Expert in source code management and collaboration using Git (GitOps), GitHub, Bitbucket, Azure Repos, and Atlassian suite (JIRA, Confluence); in-depth knowledge of policy-as-code (OPA/HashiCorp Sentinel), container security (Trivy, Aqua, Falco), and artifact management using JFrog Artifactory and GitHub Packages.
Professional Experience
HUB International, Austin, TX Aug 2023 - Present
Sr. DevOps & Site Reliability Engineer ML Infrastructure Engineer Multi-Cloud Environments
Engineered and automated multi-cloud infrastructure leveraging AWS, Azure, and GCP with Terraform, CloudFormation, and Python Boto3 SDK. Designed and provisioned secure, scalable environments with VPC, IAM roles, EC2 Auto Scaling Groups, EKS, Lambda, CloudFront, Route 53, S3, RDS, and EBS, ensuring industry-standard security and compliance using AWS Well-Architected Framework.
Implemented Infrastructure as Code (IaC) using Terraform modules and CloudFormation templates to deploy AWS Lambda, EC2, VPC, RDS, ALB, and EKS components, achieving a 90% reduction in manual configuration and speeding up resource provisioning.
Managed complex networking configurations including private/public subnets, NAT Gateways, VPC Peering, Security Groups, NACLs, and Transit Gateways to ensure secure inter-cloud and intra-cloud communication across AWS, GCP, and Azure.
Integrated and managed Ray Clusters to enable distributed computation for large-scale graph embeddings and semantic search workloads across cloud-native environments.
Designed and implemented end-to-end CI/CD pipelines using Jenkins, GitHub Actions, Maven, Nexus, Docker, and Helm for Kubernetes-based microservices deployments, ensuring seamless integration of GitOps workflows with ArgoCD and FluxCD.
Automated the artifact promotion process using Jenkins, integrating with Nexus and Sonatype, enabling automatic transitions across multiple AWS accounts (prod and non-prod), reducing release cycle time by 70%.
Built and managed Kubernetes clusters using Amazon EKS, Google GKE, and Azure AKS for container orchestration. Utilized Helm to package and deploy microservices while leveraging Kustomize for cluster-level configuration management. Secured Kubernetes environments by implementing RBAC, Network Policies, PodSecurityPolicies, and Secrets Management with Vault.
Implemented observability frameworks utilizing Prometheus for metrics scraping, Grafana for visualization, and Datadog APM for application monitoring. Developed real-time, customizable dashboards that improved team visibility into system performance, enabling proactive issue identification and resolution.
Integrated ELK (Elasticsearch, Logstash, Kibana) and EFK (Elasticsearch, Fluentd, Kibana) stacks for centralized logging, streamlining log aggregation and enabling advanced querying and alerting mechanisms. Set up Alertmanager and Prometheus Alerting to proactively monitor critical metrics, reducing alert fatigue and speeding up response time.
Optimized cloud monitoring by implementing AWS CloudWatch, X-Ray, and Datadog for end-to-end visibility into infrastructure and application performance, resulting in a 60% reduction in downtime through automated anomaly detection and real-time notifications.
Deployed OAuth2 and JWT-based authentication systems for secure API communication and user management. Automated token generation, validation, and expiration to enforce strict access control across distributed microservices and API gateways.
Implemented zero-trust security models using AWS IAM, Azure AD, Okta, and HashiCorp Vault for secrets management. Automated the rotation of credentials and management of access policies, enforcing principle of least privilege and reducing unauthorized access risks.
Adopted compliance-driven cloud security practices by integrating AWS Config, GuardDuty, AWS Inspector, and Azure Security Center to continuously monitor and enforce best security practices across multi-cloud environments.
Engineered and optimized data workflows with AWS Redshift, AWS Glue, Apache Airflow, and Apache Kafka for real-time data processing and ETL pipeline automation, improving data ingestion and processing speeds by 40%.
Automated the scaling of GPU workloads using NVIDIA CUDA, TensorFlow, and PyTorch in GCP and AWS environments, optimizing machine learning and data analytics workloads to run on GPU instances for faster computations and model training.
Enabled real-time debugging and root-cause analysis of high-volume ML traffic using structured logging (EFK) and distributed tracing, resulting in a 70% improvement in MTTD and MTTR.
Integrated infrastructure automation workflows for SAP Converged Cloud environments, aligning cloud-native deployment models with SAP’s hybrid cloud requirements.
Deployed and managed SUSE Linux-based SAP workloads, including OS-level configuration, security patching, and monitoring using Prometheus and ELK
Implemented data warehousing solutions with AWS Redshift and BigQuery for massive-scale data storage and querying, ensuring high availability and optimized performance for analytics workloads across cloud platforms.
Developed advanced automation scripts in Python, Shell, Bash, and Perl to automate deployment, configuration management, and system provisioning. Integrated Jenkins, Ansible, and Chef to build scalable and reproducible infrastructures.
Developed and optimized low-level system utilities in C/C++ for Linux environments to enhance infrastructure performance monitoring and resource management tasks.
Integrated C++ modules with existing DevOps pipelines for performance-critical components, enabling faster execution in automation workflows and reducing execution time by 30%.
Automated deployment pipelines to ensure seamless updates and application rollbacks, reducing downtime during deployment phases. Utilized AWS Lambda for event-driven automation and serverless workloads in the CI/CD process.
Implemented GitOps workflows using FluxCD, ArgoCD, and Helm, automating the deployment of Kubernetes resources directly from Git repositories and reducing manual intervention.
Medtronic, North Haven, CT Dec 2020 - Jul 2023
Senior Cloud DevOps Engineer Site Reliability Engineering (SRE) Cloud Infrastructure & Automation Specialist
Engineered and automated the deployment of containerized applications using Docker and Kubernetes on a multi-cloud infrastructure (AWS, Azure), implementing CI/CD pipelines for seamless deployment and scaling. Utilized Helm for managing Kubernetes manifests and Kustomize for declarative configuration management, reducing deployment time by 40%.
Integrated OAuth and JWT for secure authentication and authorization in microservices, ensuring token-based security for API communications. Designed robust SAML-based authentication for Azure Active Directory and RDS, enabling Single Sign-On (SSO) across enterprise environments, which improved security compliance.
Automated cloud infrastructure provisioning using Terraform and CloudFormation, managing AWS resources like EC2, RDS, VPC, IAM, and Lambda. Achieved full Infrastructure-as-Code (IaC) automation, reducing manual intervention by 90% and ensuring consistent environments across multiple regions.
Debugged and enhanced legacy C/C++ code used in backend services and monitoring agents, improving system reliability and memory efficiency in production environments.
Implemented TensorFlow Serving for model versioning and high-throughput predictions, integrating with feature stores and embedding layers in production pipelines.
Monitored system performance, logs, and resource utilization using CloudWatch and Nagios, proactively resolving issues and ensuring 99.9% uptime.
Implemented and managed observability using Prometheus, Grafana, and AWS CloudWatch for comprehensive monitoring and alerting, proactively identifying and resolving performance bottlenecks. Developed custom Datadog dashboards for real-time metrics visualization, improving incident response time by 30%.
Optimized Kubernetes clusters for high availability and performance using EKS on AWS, leveraging auto-scaling and self-healing mechanisms to ensure 99.99% uptime. Implemented CI/CD pipelines using Jenkins and GitLab, integrating with SonarQube for continuous code quality analysis and rapid delivery of production-ready features.
Collaborated with ML engineers to optimize feature computation and asset embedding pipelines using Kafka, AWS Lambda, and Python, reducing end-to-end inference time by 60%.
Led the migration of legacy applications to containerized environments, utilizing AWS Lambda, API Gateway, and ECS for serverless architectures. Improved resource utilization by 40% while reducing operational costs by 25%, and ensured smooth integration into the cloud-native ecosystem.
Ford Motor Company, Avon, OH Jan 2019 - Nov 2020
DevOps Engineer Linux Administrator Cloud Infrastructure Automation & CI/CD Expert
Designed and implemented AWS infrastructure, including VPC, Subnets, Gateways, and Load Balancers, to support enterprise-grade data warehouses. Automated backups, AMI creation, and resource provisioning using AWS CLI, Terraform, and AWS SDK for Java.
Built and optimized CI/CD pipelines using Jenkins, Bamboo, Git, Bitbucket, and Maven to automate build, test, and deployment processes for Java and containerized applications. Leveraged Jenkins Master/Slave configurations, Shell scripting, and integrated testing phases, with SonarQube for static code analysis.
Contributed to infrastructure automation scripts and tooling using C/C++ for tasks requiring low-latency execution and direct system-level interaction across Linux-based servers.
Administered and automated enterprise-grade Linux (RHEL, CentOS) infrastructure in AWS, supporting mission-critical systems for development and production.
Automated server provisioning, configuration, and patching with Bash, Ansible, and Terraform, ensuring consistency across hybrid cloud environments.
Maintained secure access control through IAM, SSH key rotation, and role-based permissions; implemented backup/restore strategies using AWS CLI and custom scripts.
Streamlined deployment pipelines for Java-based microservices using Jenkins, Docker, and Kubernetes, enhancing delivery speed and stability.
Configured IAM roles and policies for AWS, managing user permissions for secure environments. Integrated AWS Lambda with Java to implement serverless architectures and orchestrated workflows via API Gateway and AWS SDK for Java.
Developed Shell, Python, and Java scripts for automating build processes, system administration tasks, server provisioning, and container orchestration. Used tools like Ansible, Puppet, Chef, and Gradle for configuration management and deployment automation, with a focus on Java-based environments.
Built and migrated to serverless architectures, incorporating AWS Lambda, DynamoDB, and API Gateway. Integrated Java-based data processing tasks with AWS S3, Glacier, EMR, Neo-4j, and Zeppelin for optimized data flow and analysis pipelines, and leveraged Spring Cloud for cloud-native Java applications.
Exiger, Jersey City, NJ Sep 2016 - Dec 2018
Build and Release Engineer Linux Administrator
Designed and optimized build and release pipelines using Jenkins, Maven, Ant, and Terraform, automating build processes and deployments across multiple environments (Dev, QA, Staging, Production).
Leveraged Terraform, CloudFormation, Ansible, and Chef for automated infrastructure provisioning and configuration management, ensuring scalable and secure environments for applications.
Maintained and secured Linux-based build and application servers, managing services like Apache, NGINX, Tomcat, and WebLogic across multiple environments.
Developed and maintained Bash and Python scripts for automation of system tasks, builds, deployments, and monitoring routines.
Orchestrated infrastructure provisioning with Terraform and CloudFormation, optimizing deployment efficiency and reducing manual intervention.
Administered version control systems (Git, SVN), managed user roles, and integrated Jenkins with Ansible and Docker for automated CI/CD workflows.
Implemented centralized logging and monitoring with ELK Stack and CloudWatch, improving visibility and auditability across systems.
Managed artifact lifecycle and versioning using Maven, Ant, Git, SVN, and Artifactory, ensuring consistent and reproducible builds, and enforcing best practices in source code management.
Orchestrated automated deployments with zero-downtime updates using Docker, Kubernetes, AWS ECS, and Helm. Integrated monitoring solutions like ELK Stack, Prometheus, and CloudWatch to ensure stable production releases, and implemented rollback strategies to maintain system reliability.
Technical Skills
Cloud Platforms
AWS (EC2, S3, EBS, VPC, ELB, IAM, CloudFormation, Lambda, Route 53), Microsoft Azure (AKS, ADF, ARM, Azure DevOps, App Services, DNS Zones), GCP (GKE, Cloud Build, IAM, GCS)
CI/CD & Release Management
Jenkins (Declarative Pipelines, Shared Libraries), GitLab CI/CD (Runners, Variables), Bamboo CI, Azure DevOps Pipelines (YAML-based), Octopus Deploy (Lifecycles, Tenants)
Version Control Systems
Git (Branching Strategies, Gitflow, Submodules), GitHub Actions, Bitbucket Pipelines, GitLab Repos, SVN (Hooks, Merges), TFS (Work Item Integration)
Containerization & Orchestration
Docker (Dockerfiles, Multi-stage builds, Docker Compose), Kubernetes (Helm Charts, RBAC, Ingress Controllers, CRDs), VirtualBox, Vagrant (Provisioners)
Application/Web Servers
Apache HTTPD (mod_proxy, SSL/TLS), NGINX (Load Balancing, Reverse Proxy), Tomcat (WAR/EAR Deployments), JBoss EAP, WebSphere, WebLogic, IIS (App Pools, .NET Hosting)
Issue & Change Management
Jira (Custom Workflows, Automation Rules), Bugzilla, IBM Rational ClearQuest, BugHost (Defect Lifecycle Integration)
Infrastructure as Code (IaC)
Terraform (HCL2, Modules, Remote Backends), AWS CloudFormation (Nested Templates, StackSets), Azure ARM (Parameterization, Linked Templates)
Configuration Management
Ansible (Playbooks, Roles, Vaults), Puppet (Manifests, Hiera, PuppetDB), Chef (Cookbooks, Recipes, Test Kitchen), SaltStack (States, Grains, Pillars)
Build Tools
Apache Maven (POM Hierarchies, Profiles), Gradle (Groovy DSL, Multi-module Builds), Apache ANT (Build.xml, Targets), MSBuild, NANT (For .NET Builds)
Database Systems
PostgreSQL (psql, pgAdmin), MySQL (Replication, Dump & Restore), MongoDB (Sharding, Aggregation Framework), Cassandra (CQL, Data Modeling), SQLite
Monitoring & Observability
Prometheus (Custom Exporters, AlertManager), Grafana (Dashboards, Data Sources), AWS CloudWatch (Logs, Metrics, Alarms), Splunk (SPL Queries), New Relic, Sensu, Nagios
Scripting & Automation
Python (Boto3, Paramiko, Jinja2, Fabric), Bash (Shell Functions, Cron), PowerShell (Modules, DSC), Perl, Ruby, JSON/YAML (Parsing, Configuration Templates)
Operating Systems
Linux (RHEL, Ubuntu, CentOS – Systemd, SELinux, LVM), UNIX (AIX, Solaris – SMIT, NIM), Windows Server (Active Directory, Group Policy, PowerShell Remoting)
Certifications
AWS Certified Solutions Architect – Associate
AWS Certified DevOps Engineer – Professional
Certified Kubernetes Administrator (CKA)
Education
Bachelor’s Degree in Computer Science from University of Karachi