Post Job Free
Sign in

DevOps Engineer - SRE - Cloud & CI/CD Specialist

Location:
San Jose, CA
Posted:
March 06, 2026

Contact this candidate

Resume:

YASIR ARAFAT

California, USA 669-***-**** ***************@*****.*** Linkedin Github Website Summary Statement

Results-driven DevOps Engineer / SRE With over 5 years of experience in multi-cloud DevOps (AWS, Azure, GGP), I've led the rollouts of scalable infrastructure, orchestrated Kubernetes (EKS/AKS) deployments, and automated CICD using Jenkins, Python, Shell scripting, GiHub Actions, and ArgoCD. I built IaC frameworks in Terraform, streamlined deployment pipelines, and introduced observability via Prometheus and Grafana, cutting downtime and speeding up releases. I also have expertise in MLOps, Agentic Al and Al prompts. Skills

• Cloud Platforms: AWS, Azure, Google Cloud

• Scripting & Development: Python, Java, Bash, PowerShell, SQL

• Framework: Bootstraps, React.js

• Server: Apache Tomcat, Nginx

• Infrastructure as Code (IaC): Terraform, CloudFormation, Boto3

• Configuration Management: Puppet, Ansible, Chef

• Containerization & Orchestration: Docker, Kubernetes (EKS, AKS, GKE), Kafka

• CI/CD Tools: Jenkins, ArgoCD, GitHub Actions, GitLab, Bamboo, CloudBees

• Multi-Cloud Management: Anthos, Azure Arc, Terraform Cloud

• Security Tools: HashiCorp Vault, SonarQube

• SDLC & Tools: Agile, Scrum, Jira, Slack, Microsoft Teams, Okta

• Monitoring Tools: Prometheus, Grafana, CloudWatch

• Platforms & Networking: Linux, Windows, MacOS, Networking

• AWS Services: AWS CLI, S3, Auto Scaling, RDS, IAM, Lambda (Python), Elastic Load Balancers, CloudTrail, SNS CERTIFICATIONS

• Certified Kubernetes Application Developer (CKAD): Linux Foundation

• AWS Solutions Architect - Associate: AWS

• Microsoft Certified: DevOps Engineer Expert: AZURE 400

• Microsoft Certified: Azure Administrator Associate: AZURE 104 EXPERIENCE

i2Data Systems Inc Jun 2023 - Present

DevOps Engineer

• Designed and managed AWS and Azure cloud-based infrastructure using AWS CLI and Terraform for IaC, reducing manual provisioning by 80%.

• Designed and operated AWS infrastructure following the AWS Well-Architected Framework, improving reliability, security, and cost efficiency across environments.

• Built and optimized Kubernetes clusters (EKS/AKS), ensuring 99.99% uptime with auto-scaling (HPA/VPA) and monitoring via Prometheus & Grafana.

• Created reusable Terraform modules aligned with AWS Well-Architected Framework pillars for scalable and secure infrastructure.

• Built Infrastructure-as-Code pipelines using Terraform and Ansible to standardize cluster provisioning, networking, and node configuration.

• Led root-cause analysis of production incidents related to networking (CNI), storage (CSI), and ingress configurations.

• Integrated observability stack (Prometheus, Grafana, ELK) for full-stack monitoring, alerting, and log aggregation.

• Designed, deployed, and operated containerized applications on Kubernetes, ensuring high availability, scalability, and fault tolerance.

• Implemented ArgoCD for GitOps, automating deployment workflows and reducing errors by 40%.

• Integrated SCA scanning into CI/CD pipelines (GitHub, Jenkins) to enforce security gates and prevent vulnerable builds from reaching production

• Configured auto-scaling, rolling upgrades, and pod disruption budgets for Kafka clusters.

• Integrated Grafana with Prometheus, Loki, and CloudWatch for centralized log and metric visualization.

• Designed and implemented CI/CD pipelines for deployments using Jenkins, Github Actions, DevOps best practices for cloud deployments.

• Administered Unix/Linux environments, performing routine maintenance, patching, and performance tuning to ensure 99.9% uptime.

• Supported mission-critical AWS-hosted middleware platform enabling digital ordering channels, maintaining system reliability and minimizing downtime.

• Managed and optimized large-scale AWS RDS relational databases using advanced SQL queries, enhancing data retrieval performance by 30%.

• Istio Service Mesh practical experience installing and configuring Istio on Kubernetes/EKS.

• Used Splunk for tracking authentication events, correlating logs for intrusion detection, and supporting compliance needs (e.g., SOC2, HIPAA).

• Implemented end-to-end MLOps pipelines integrating CI/CD practices to automate model training, testing, and deployment using Jenkins, Argo Workflows.

• Designed and managed AWS API Gateway (REST & HTTP APIs) to securely expose backend microservices running on Lambda and Kubernetes (EKS)

• Optimized GPU-enabled infrastructure on AWS (Sagemaker, AKS, EKS, EC2) to support scalable AI/ML workloads with auto-scaling and spot instance savings.

• FinOps tools like Kubecost K8s cost monitoring by namespace/workload, Grafana Add cost + usage metrics into dashboards.

• Investigated security incidents end-to-end including detection, containment, eradication, recovery, and post-incident analysis.

• Perform RCA (Root Cause Analysis), improve system resilience, and reduce MTTR.

• Designed and implemented end-to-end vulnerability lifecycle management processes, aligning DevOps and SecOps to accelerate remediation timelines.

• Executed complex SQL queries, table joins, and stored procedures on Amazon Aurora MySQL RDS to investigate data inconsistencies and support issue resolution.

• Defined and monitored SLIs (availability, latency, error rate) using CloudWatch and Prometheus.

• Established SLOs aligned with business reliability targets (99.9% uptime). American International University-Bangladesh Jun 2021 - Dec 2022 Jr DevOps Engineer

• Designed and maintained GitOps-based pipelines using Jenkins and ArgoCD, enabling continuous delivery of containerized microservices across EKS and AKS clusters.

• Monitored Kafka cluster health using consumer lag metrics and broker monitoring.

• Built producer and consumer applications using Kafka to process streaming events with low latency.

• Implement GitLab CI/CD platform & provide hands-on engineering, technical support.

• Automated build, configuration, and deployment with well-structured Jenkins pipelines, integrating artifact management and automated notifications.

• Created and managed IAM profiles, group policies, and permission boundaries, conducted regular reviews for access compliance/audits

• Designed and implemented cloud infrastructures across AWS and Azure, ensuring they were highly available, faulttolerant, and scalable, meeting client requirements.

• Collaborated with production support to deliver timely resolution for high-impact incidents, leveraging logs, metrics, and cross service insights

• Fostered a collaborative knowledge-sharing environment by mentoring junior team members and introducing best practices in IaC and DevOps methodologies.

• Integrated API Gateway deployments into CI/CD pipelines to enable controlled releases and fast rollbacks. GrameenPhone Feb 2020 - May 2021

Jr. Cloud Engineer

• Managed AWS and Azure cloud infrastructure using AWS CLI, optimizing cloud resource usage and costs, and implemented GitLab CI/CD pipelines automating software deployment and testing.

• Authored and maintained operational runbooks for frequent AWS tasks, supporting Level 1/2 teams for faster issue triage and escalation.

• Automated build, configuration, and deployment with well-structured Jenkins pipelines, integrating artifact management and automated notifications.

• Performed daily operational tasks in AWS using AWS CLI: started/stopped EC2 instances to optimize resource usage, managed snapshots, and scheduled instance maintenance windows.

• Configured custom CloudWatch dashboards, defined alarm thresholds, and integrated SNS to proactively monitor key metrics and trigger automated scaling or remediation workflows.

• Administered user, group, and permission policies in IAM for least-privilege access, including automated provisioning, rotation, and auditing.

• Regularly analyzed AWS Cost Explorer reports to maintain budget targets, allocate shared costs, and enable costeffective system design.

• Investigated network-based anomalies and performed advanced networking analysis of lateral movement and suspicious protocol behavior.

• Ensured compliance with SLA commitments to customers. Projects & Achievements

• Optimized AWS EKS, Azure AKS workloads, reducing infrastructure costs by 30%.

• Developed serverless automation with AWS Lambda, improving system efficiency. Education

University of the Cumberlands May 2023 - May 2025

MSc, Information Technology

American International University-Bangladesh Sep 2017 - Dec 2021 BSc, Computer Science & Engineering



Contact this candidate