YASIR ARAFAT
California, USA 669-***-**** ***************@*****.*** Linkedin Github Website Summary Statement
Results-driven DevOps Engineer / SRE With over 5 years of experience in multi-cloud DevOps (AWS, Azure, GGP), I've led the rollouts of scalable infrastructure, orchestrated Kubernetes (EKS/AKS) deployments, and automated CICD using Jenkins, Python, Shell scripting, GiHub Actions, and ArgoCD. I built IaC frameworks in Terraform, streamlined deployment pipelines, and introduced observability via Prometheus and Grafana, cutting downtime and speeding up releases. I also have expertise in MLOps, Agentic Al and Al prompts. Skills
• Cloud Platforms: AWS, Azure, Google Cloud
• Scripting & Development: Python, Java, Bash, PowerShell, SQL
• Framework: Bootstraps, React.js
• Server: Apache Tomcat, Nginx
• Infrastructure as Code (IaC): Terraform, CloudFormation, Boto3
• Configuration Management: Puppet, Ansible, Chef
• Containerization & Orchestration: Docker, Kubernetes (EKS, AKS, GKE), Kafka
• CI/CD Tools: Jenkins, ArgoCD, GitHub Actions, GitLab, Bamboo, CloudBees
• Multi-Cloud Management: Anthos, Azure Arc, Terraform Cloud
• Security Tools: HashiCorp Vault, SonarQube
• SDLC & Tools: Agile, Scrum, Jira, Slack, Microsoft Teams, Okta
• Monitoring Tools: Prometheus, Grafana, CloudWatch
• Platforms & Networking: Linux, Windows, MacOS, Networking
• AWS Services: AWS CLI, S3, Auto Scaling, RDS, IAM, Lambda (Python), Elastic Load Balancers, CloudTrail, SNS CERTIFICATIONS
• Certified Kubernetes Application Developer (CKAD): Linux Foundation
• AWS Solutions Architect - Associate: AWS
• Microsoft Certified: DevOps Engineer Expert: AZURE 400
• Microsoft Certified: Azure Administrator Associate: AZURE 104 EXPERIENCE
i2Data Systems Inc Jun 2023 - Present
DevOps Engineer
• Designed and managed AWS and Azure cloud-based infrastructure using AWS CLI and Terraform for IaC, reducing manual provisioning by 80%.
• Designed and operated AWS infrastructure following the AWS Well-Architected Framework, improving reliability, security, and cost efficiency across environments.
• Built and optimized Kubernetes clusters (EKS/AKS), ensuring 99.99% uptime with auto-scaling (HPA/VPA) and monitoring via Prometheus & Grafana.
• Created reusable Terraform modules aligned with AWS Well-Architected Framework pillars for scalable and secure infrastructure.
• Built Infrastructure-as-Code pipelines using Terraform and Ansible to standardize cluster provisioning, networking, and node configuration.
• Led root-cause analysis of production incidents related to networking (CNI), storage (CSI), and ingress configurations.
• Integrated observability stack (Prometheus, Grafana, ELK) for full-stack monitoring, alerting, and log aggregation.
• Designed, deployed, and operated containerized applications on Kubernetes, ensuring high availability, scalability, and fault tolerance.
• Implemented ArgoCD for GitOps, automating deployment workflows and reducing errors by 40%.
• Integrated SCA scanning into CI/CD pipelines (GitHub, Jenkins) to enforce security gates and prevent vulnerable builds from reaching production
• Configured auto-scaling, rolling upgrades, and pod disruption budgets for Kafka clusters.
• Integrated Grafana with Prometheus, Loki, and CloudWatch for centralized log and metric visualization.
• Designed and implemented CI/CD pipelines for deployments using Jenkins, Github Actions, DevOps best practices for cloud deployments.
• Administered Unix/Linux environments, performing routine maintenance, patching, and performance tuning to ensure 99.9% uptime.
• Supported mission-critical AWS-hosted middleware platform enabling digital ordering channels, maintaining system reliability and minimizing downtime.
• Managed and optimized large-scale AWS RDS relational databases using advanced SQL queries, enhancing data retrieval performance by 30%.
• Istio Service Mesh practical experience installing and configuring Istio on Kubernetes/EKS.
• Used Splunk for tracking authentication events, correlating logs for intrusion detection, and supporting compliance needs (e.g., SOC2, HIPAA).
• Implemented end-to-end MLOps pipelines integrating CI/CD practices to automate model training, testing, and deployment using Jenkins, Argo Workflows.
• Designed and managed AWS API Gateway (REST & HTTP APIs) to securely expose backend microservices running on Lambda and Kubernetes (EKS)
• Optimized GPU-enabled infrastructure on AWS (Sagemaker, AKS, EKS, EC2) to support scalable AI/ML workloads with auto-scaling and spot instance savings.
• FinOps tools like Kubecost K8s cost monitoring by namespace/workload, Grafana Add cost + usage metrics into dashboards.
• Investigated security incidents end-to-end including detection, containment, eradication, recovery, and post-incident analysis.
• Perform RCA (Root Cause Analysis), improve system resilience, and reduce MTTR.
• Designed and implemented end-to-end vulnerability lifecycle management processes, aligning DevOps and SecOps to accelerate remediation timelines.
• Executed complex SQL queries, table joins, and stored procedures on Amazon Aurora MySQL RDS to investigate data inconsistencies and support issue resolution.
• Defined and monitored SLIs (availability, latency, error rate) using CloudWatch and Prometheus.
• Established SLOs aligned with business reliability targets (99.9% uptime). American International University-Bangladesh Jun 2021 - Dec 2022 Jr DevOps Engineer
• Designed and maintained GitOps-based pipelines using Jenkins and ArgoCD, enabling continuous delivery of containerized microservices across EKS and AKS clusters.
• Monitored Kafka cluster health using consumer lag metrics and broker monitoring.
• Built producer and consumer applications using Kafka to process streaming events with low latency.
• Implement GitLab CI/CD platform & provide hands-on engineering, technical support.
• Automated build, configuration, and deployment with well-structured Jenkins pipelines, integrating artifact management and automated notifications.
• Created and managed IAM profiles, group policies, and permission boundaries, conducted regular reviews for access compliance/audits
• Designed and implemented cloud infrastructures across AWS and Azure, ensuring they were highly available, faulttolerant, and scalable, meeting client requirements.
• Collaborated with production support to deliver timely resolution for high-impact incidents, leveraging logs, metrics, and cross service insights
• Fostered a collaborative knowledge-sharing environment by mentoring junior team members and introducing best practices in IaC and DevOps methodologies.
• Integrated API Gateway deployments into CI/CD pipelines to enable controlled releases and fast rollbacks. GrameenPhone Feb 2020 - May 2021
Jr. Cloud Engineer
• Managed AWS and Azure cloud infrastructure using AWS CLI, optimizing cloud resource usage and costs, and implemented GitLab CI/CD pipelines automating software deployment and testing.
• Authored and maintained operational runbooks for frequent AWS tasks, supporting Level 1/2 teams for faster issue triage and escalation.
• Automated build, configuration, and deployment with well-structured Jenkins pipelines, integrating artifact management and automated notifications.
• Performed daily operational tasks in AWS using AWS CLI: started/stopped EC2 instances to optimize resource usage, managed snapshots, and scheduled instance maintenance windows.
• Configured custom CloudWatch dashboards, defined alarm thresholds, and integrated SNS to proactively monitor key metrics and trigger automated scaling or remediation workflows.
• Administered user, group, and permission policies in IAM for least-privilege access, including automated provisioning, rotation, and auditing.
• Regularly analyzed AWS Cost Explorer reports to maintain budget targets, allocate shared costs, and enable costeffective system design.
• Investigated network-based anomalies and performed advanced networking analysis of lateral movement and suspicious protocol behavior.
• Ensured compliance with SLA commitments to customers. Projects & Achievements
• Optimized AWS EKS, Azure AKS workloads, reducing infrastructure costs by 30%.
• Developed serverless automation with AWS Lambda, improving system efficiency. Education
University of the Cumberlands May 2023 - May 2025
MSc, Information Technology
American International University-Bangladesh Sep 2017 - Dec 2021 BSc, Computer Science & Engineering