SANDEEP KUMAR
SENIOR CLOUD ENGINEER
Phone: +1-216-***-****
Email: *******.*****@*****.***
LinkedIn: www.linkedin.com/in/santorli
OBJECTIVE
A dedicated, organized, and results-driven IT professional with 11+ years of industry experience, including 6+ years as a Cloud/DevOps Engineer, specializing in AWS, Azure, SRE & GCP, and OpenStack. Adept at implementing Continuous Integration (CI), Continuous Deployment (CD), and Configuration Management to streamline development and operations. Strong expertise in Build/Release Engineering and Linux Systems Administration with 5 years of hands-on experience. Proven track record in optimizing cloud infrastructure, automating workflows, and improving system scalability using tools like Docker, Kubernetes, Terraform, Ansible, Chef, Jenkins, GIT, and Maven.
PROFESSIONAL SUMMARY
Experience with AWS Cloud services like EC2, VPC, ELB, Auto Scaling Group, Security Groups, Route53, IAM, EBS, AMI, EFS, RDS, S3, SNS, SQS, CloudWatch, CloudFormation, EKS, Lambda, and Direct Connect.
Experience with Azure Cloud services like Virtual Machine (VM), Virtual Network (VNET), App Service, Function App, Azure SQL, Cosmos DB, Azure Container Registry (ACR), Azure Front Door, Azure Active Directory (AD), and Azure Kubernetes Service (AKS).
Strong foundation in Site Reliability Engineering (SRE), ensuring reliability, scalability, and performance of systems, enhancing the overall efficiency of IT infrastructure.
Developed and maintained a repository of scripts and tools tailored to manage Google Cloud Platform(GCP) resources efficiently, enhancing operational efficiency and ensuring consistency across environments.
Worked on setting up Kubernetes Clusters (EKS and AKS), deploying microservices, setting up NGINX ingress controllers, Istio service mesh, and periodically upgrading Kubernetes versions.
Experienced in building and releasing Microservices Enterprise Applications for both Non-Prod and Prod Environments using CI/CD tools like Jenkins, Azure DevOps Pipelines, and GitHub Actions, deploying them onto AKS Clusters using HELM Charts.
Applied a broad understanding of networking concepts such as VPC, NAT, ACL, DNS, Proxies, Firewalls, VPC Endpoints, Direct Connect, and VPN.
Created Terraform Modules for AKS Clusters, integrated with Azure DevOps Pipelines and Jenkins.
Used PowerShell scripting for automation of deployments and Azure resources.
Experienced in migrating .NET Web Applications running in IIS from On-Prem Data Centers to Azure.
Migrated Java Tomcat containers from On-Prem to AWS Beanstalk, Lambda, and EKS.
Migrated SQL Databases and ETLs from On-Prem to Azure SQL using Azure Data Factory.
Extensive experience in deploying, configuring, and managing the Elastic Stack, including Elasticsearch, Logstash, Kibana, and Beats.
Implemented Ansible Playbooks to manage Linux and Windows Host servers and automate new server configurations.
Proficient in deploying container applications in OpenShift 3.9 across multiple clusters in On-Prem Private Clouds provisioned using VMware Tanzu.
Worked on Setting up SQL Server and MYSQL Database into On-Prem Virtual Servers.
Setting up Blue/Green Deployment Strategies for AppService, AKS/EKS Cluster Services.
Demonstrated proficiency in working with Ansible Tower, simplifying the deployment and management of playbooks, inventories, and roles.
Deployed Web Applications, Databases and Setup ETLs using SSIS working with Bare-Metal Servers in On-Prem Data Centre.
Experience in setting up code quality tools such as SonarQube and VeraCode by registering projects and integrating them into CI/CD pipelines.
Developed and implemented Kubernetes manifests for deployment of microservices and installation of Prometheus and Grafana monitoring pods into Kubernetes.
Demonstrated experience building software and computer systems using JavaScript and Ruby.
Strong hands-on experience in scripting languages like Groovy, Python, and Shell scripting.
Successfully executed the migration of Jenkins pipelines to GitLab, improving version control and facilitating a more collaborative development environment.
SKILLS
DevOps Tools
Git, Jenkins, Maven, Docker, Ansible, Chef, Puppet, Azure Repos, Azure Artifacts.
Cloud Platforms
Microsoft Azure, Aws Cloud, Google Cloud Platform (GCP).
AWS Services
EC2, VPC, RDS, IAM, CloudFormation, EBS, EKS, ECR, S3, ELB, Lambda, Auto Scaling, Cloud Trial, SQS, SNS, SWF, CloudWatch Logs.
Azure Services
VM, App Services, Key Vault, Function App, Blob Storage, Azure Active Directory (Azure AD), VNet, Service Bus, AKS, ACR, Azure SQL, Azure Cosmos DB.
Version Control Tools
Git, GitHub, GitLab’s, GitOps, Bitbucket.
CI/CD
Jenkins, Run Deck, Azure DevOps, TeamCity, Bamboo, Travis CI, Gitlab CI, Circle CI, Azure Pipelines.
Databases
Oracle, MySQL, Cassandra, PostgreSQL, Cosmos DB, DynamoDB, MongoDB, RDS.
Container Platforms
Docker, Docker Swarm, Kubernetes, Helm, OpenShift, AWS ECS.
Monitoring And Logging Tools
Prometheus, Grafana, Nagios, Splunk, ELK, CloudTrail, CloudWatch, Azure Monitor, Log Analytics, New Relic, Zabbix, Gray log, Datadog, AppDynamics.
Configuration Management
Ansible, Puppet, Chef, Salt Stack.
Languages
Python, Shell Scripting, Bash Scripting, TypeScript, Groovy, Perl.
IaaC
Cloud Formation, Terraform, Ansible, Puppet, ARM Templates, Chef.
Artifactory
JFrog and Nexus.
Documentation
Confluence, Markdown, Read the Docs.
Code Scanning
SonarQube, JFrog Xray, ECR Inspector.
Web Servers
Nginx, Apache, Tomcat, WebLogic, WebSphere, JBoss.
Operating Systems
Microsoft Windows XP/ 2000, iOS, Linux, Red Hat Enterprise Linux, CentOS, Ubuntu, UNIX.
Build Tools
Maven, Ant, Gradle.
Analytics
Redshift, Athena, EMR, Azure Data Lake Analytics, Azure Synapse Analytics.
Tracking Tools
Jira.
EXPERIENCE
Senior Cloud Engineer T. Rowe Price Owings Mills, MD OCT 2022 - PRESENT
Designed and implemented multi-cloud solutions across AWS and Azure, setting up Azure Data Platforms using Azure Data Factory, Databricks, and Data Lake Storage, while seamlessly integrating Snowflake SaaS Cloud Warehouse with Self-Hosted Integration Runtime to optimize data transformation and processing.
Architected and deployed AWS infrastructure using Terraform and CloudFormation, provisioning EC2 instances, S3 storage, IAM policies, Elastic Load Balancers (ELB), EKS clusters, Auto Scaling Groups, CloudFront CDN, and Route 53 for intelligent DNS management, ensuring high availability, security, and compliance with enterprise standards.
Led the migration of on-premises applications to Azure Cloud, optimizing cost and performance by leveraging Azure Virtual Machines, Managed Disks, and Storage Accounts. Integrated Azure Kubernetes Service (AKS) to manage containerized workloads, enhancing scalability, resilience, and operational efficiency.
Automated Web Application, Database, and Infrastructure Deployments using AWS Jenkins, Azure DevOps CI/CD Pipelines, and GitHub Actions. Integrated SonarQube and Trivy Container Scanning to enforce security and quality gates during deployments.
Developed robust CI/CD pipelines for microservices-based applications, leveraging Docker, Kubernetes, and OpenShift for container orchestration. Implemented Kubernetes Autoscaling, HPA, and Ingress Controller Rules for seamless traffic management.
Configured Azure Active Directory authentication, assigning role-based access controls (RBAC) for Azure resources. Managed application authentication through Service Principal and Managed Identities, ensuring secure API access.
Deployed Istio Service Mesh on Kubernetes clusters to enable secure service-to-service communication. Configured Sidecar Proxy, Init Containers, and mTLS authentication to enhance microservices security, observability, and traffic control.
Automated AWS Secrets Manager integration with Jenkins Credentials, ensuring secure storage and dynamic retrieval of application configurations. Implemented AWS Parameter Store for centralized management of sensitive data.
Set up Kubernetes monitoring with Prometheus, Grafana, and Alert Manager, defining custom scrape metrics to track cluster performance. Designed Grafana dashboards for real-time insights into application health, security compliance, and resource utilization.
Developed Unix Shell Scripts and Python automation tools to streamline AWS workload management, optimizing log analysis, backups, and data migration processes. Integrated Terraform and Ansible for end-to-end cloud automation, reducing manual interventions.
Configured Terraform State File Locking with DynamoDB to ensure collaborative infrastructure deployments. Provisioned and managed RDS PostgreSQL instances with automated schema and table role assignments in Snowflake.
Designed and implemented Ansible Playbooks for provisioning and configuring cloud infrastructure, integrating with Terraform for seamless orchestration. Deployed Java, Maven, Docker, and Nginx Web Server on EC2 instances using automated playbooks.
Worked extensively with Kubernetes Helm charts for deploying applications, managing upgrades, and ensuring consistent deployments across multiple environments. Established best practices for YAML-based configurations.
Built microservices using Spring Boot, Node.js, and OpenShift Container Platform (OCP), optimizing cloud-native applications for scalability and resilience. Managed OpenShift CI/CD pipelines for continuous integration and deployment.
Implemented CloudWatch Metrics and Grafana visualizations using Terraform, automating performance monitoring across AWS environments. Designed threshold-based alerts to proactively detect anomalies and performance bottlenecks.
Leveraged JIRA and Confluence to manage project workflows, track CI/CD pipeline enhancements, and document DevOps best practices. Created dashboards to report on deployment success rates, infrastructure health, and operational efficiency.
Spearheaded security best practices by integrating AWS GuardDuty, Security Hub, and IAM role-based policies into cloud environments, ensuring compliance with organizational security standards. Configured AWS WAF and Shield to enhance security against threats.
Worked closely with development, security, and operations teams to streamline deployment processes, troubleshoot issues, and continuously optimize infrastructure for performance, scalability, and cost-effectiveness.
Implemented centralized logging and monitoring solutions using ELK Stack (Elasticsearch, Logstash, Kibana) and Fluentd for enhanced observability, ensuring proactive troubleshooting and log analytics.
Designed and configured AWS EKS cluster autoscaling and AWS ALB Ingress Controller to balance workloads dynamically, ensuring high availability of Kubernetes-based applications.
Environment: AWS, Azure, Terraform, GCP, OpenShift, CloudWatch, PowerShell, Jenkins, CI/CD, Kubernetes, Docker, MySQL, Groovy, GitHub, Shell, Git, CircleCI, Python, Chef, Ansible, Prometheus, Grafana, Snowflake, Istio Service Mesh, Azure Data Factory, Azure Databricks, ELK Stack, Fluentd, AWS WAF, AWS Shield, Route 53, AWS Parameter Store, AWS ALB Ingress Controller, Azure Kubernetes Service (AKS).
Cloud Engineer FINRA Rockville, MD OCTOBER 2021 – SEPTEMBER 2022
Experience with Microsoft Azure, Azure Resource Management templates, Virtual Networks, Storage, Virtual Machines, and Azure Active Directory, Azure VM & Manage VM backups, azure files, Queue storage and blob storage using Terraform.
Developed and maintained complex Bash scripts to automate routine tasks, reducing manual workload.
Implemented robust error handling and logging in Bash scripts to ensure reliability and ease of debugging during automated processes.
Utilized agile methodologies for project management and conducted weekly and daily release management activities.
Worked on Azure DevOps services such as Azure Repos, Azure Boards, and Azure Test Plans to plan work and collaborate on code development, built and deployed application.
Managed and monitored Azure Cosmos DB instances troubleshooting issues and optimizing performance of SQL Managed Instances.
Created, configured, and managed AKS clusters in Azure, including node pool management, networking configuration, and load balancing setup integrated with Jenkins for CI/CD.
Managed servers on the Microsoft Azure Platform Azure Virtual Machines instances using Ansible Playbooks, tasks, and roles to automate.
Proficient in designing and optimizing SQL Server databases, including schema design, indexing strategies, and query optimization, to ensure efficient data storage and retrieval.
Automated the deployment of application patches by integrating Ansible with Jenkins to trigger updates when critical vulnerabilities were detected in the IT infrastructure.
Created Ansible roles in YAML and defined tasks, variables, files, handlers, and templates.
Worked with Terraform scripts to automate the Azure IAAS virtual machines using terraform modules and deployed virtual machine scale sets in production environment.
Worked with Terraform and Docker for automating VNET, NSG, AKS, ACR, VMs, and Storage accounts to replace the rest of our infrastructure.
Involved in setting up JIRA as defect tracking system and configured various workflows, customizations, and plugins for the JIRA bug/issue tracker.
Established HTTPS Ingress controller and use TLS certificate on AKS to provide reverse proxy, configurable traffic routing for individual Kubernetes services.
Designed and implemented CI/CD pipeline for Azure app services and responsible for monitoring the performance and availability of Azure App Services and troubleshooting any issues that arise.
Extended working knowledge in azure networking services, Configured VNETs and subnets as per the project requirement. created and maintained NSG rules.
Implemented and configured SPLUNK for log management, monitoring, and analysis of large-scale distributed systems, enabling real-time visibility into system performance and operational metrics, checking health checks
Developed and maintained Continuous Integration (CI) using tools in Azure DevOps (VSTS) spanning multiple environments, enabling teams to safely deploy code in Azure Kubernetes Services (AKS) using YAML scripts and HELM charts.
Built and maintained Python-based CI/CD pipeline automation, integrating with tools like Jenkins, GitLab, and Terraform for infrastructure management.
Implemented CI/CD pipelines using GitLab CI/CD, automating build, test, and deployment processes, resulting in significant time and cost savings while ensuring code quality and reliability.
Led migration projects to transition code repositories and CI/CD workflows from other version control systems and CI/CD platforms to GitLab, ensuring a smooth transition and minimal disruption to development workflows.
Integrated GitLab with other DevOps tools and services such as Kubernetes, Docker, and Jira, to create end-to-end automation and streamline the software development lifecycle, from code commit to production deployment.
Developed and implemented access controls in Azure repos to ensure that only authorized users have access to sensitive data and repositories.
Developed python and shell scripts for automation of build and release process.
Configuring Azure Key Vault services to development teams for handling secrets in dev, test, and production environments using both UI and CLI in Azure pipeline.
Strong command-line skills in Unix/Linux environments, including system navigation, file management, and user administration.
Collaborated with cross-functional teams to establish GitLab best practices and workflows, including branching strategies, code review processes, and release management, to optimize team productivity and code quality.
Environment: Azure, Azure Files, Queue Storage, Blob Storage, Azure Repos, Azure Boards, Azure Test Plans, Azure Cosmos DB, Terraform, Azure DevOps, AKS, Prometheus, Grafana, Datadog, Linux (for OS patching processes), Git, Azure Key Vault, Prometheus, Kubernetes, ELK, Grafana, TLS certificate for HTTPS Ingress controller.
DevOps Engineer / SRE General Motors Detroit, MI JULY 2019 - SEPTEMBER 2021
Worked as a DevOps Engineer for multiple development teams, system analysis team to establish a build schedule, provide a guideline for deployment in higher environments and with troubleshooting, build system failures.
Worked for 5 scrum teams (Java, AEM, Jenkins, Ant, Maven, SVN, git, Agile methodology, cucumber scripts, sonar, XL Deploy and XL Release, SharePoint, CI/CD automation from scratch, Docker)
Experience in AWS Cloud platform and its features which includes EC2, VPC, ELB, SQS, SNS, RDS, EBS, EMR, Cloud Watch, IAM, Route 53.
Deployed Active Directory domain controllers to Microsoft Azure using VPN gateway and integrated Kubernetes clusters for centralized authentication.
Created and configured data visualizations using Grafana and integrated Jenkins pipelines for automated dashboard updates, including graphs, tables, and alerts.
Automated processes with PowerShell scripts, Terraform for infrastructure provisioning, DNS changes, DC builds, and user management.
Used Continuous development, Continuous Integration (CI) and Continuous Deployment (CD)in runtime with VSTS.
Created and wrote shell scripts (Bash), Python and PowerShell for automating tasks.
Familiarity with Grafana API and scripting capabilities, and ability to use them to automate data collection and analysis tasks.
Configured Kibana dashboards to visualize Prometheus metrics such as latency and error rates, alongside logs from the ELK stack for unified monitoring.
Implemented Microservices on OpenShift based on Docker to achieve Continuous Delivery.
Created the Docker images using Jenkins and tagged them and pushed that image for all the promotional environments for all the applications.
Used Ticketing tool JIRA to track defects and changes for change management, monitoring tools like New Relic and CloudWatch in different work environments in real and container workspace.
Implemented Ansible Tower to dynamically scale infrastructure resources, including load balancers and auto- scaling groups, based on workload demands, ensuring optimal performance and resource utilization.
Knowledge of best practices for performance monitoring and optimization, and ability to use Dynatrace to drive continuous improvement in application performance.
Developed custom monitoring solutions using Ansible Tower APIs and Prometheus, providing real-time infrastructure and application insights.
Experience in creating alarms and notifications for EC2 instances using CloudWatch.
Designed, Installed and Implemented CI/CD automation system. Used Ant, Maven as a build tool on java projects for the development of build artifacts on the source code.
Implemented Release schedules, communicated the Release status, created Roll out Plans, tracked the Project Milestones, prepared the reports, chaired the Release calls, and worked for successful Releases.
Environment: Java, Maven, GCP, ANT, Gradle, groovy, GIT, Docker, SVN, Puppet, Jenkins, Ruby, Splunk, JMeter, Tomcat, SonarQube, Bugzilla, Shell, and Perl Scripts, Ansible, PowerShell, Nexus, RHEL 5.x/6.x.
Linux System Administrator Zensar Technologies India JAN 2015 - AUG 2018
Created and maintained user accounts in RedHat Enterprise Linux (RHEL)and other operating systems.
Troubleshooting and maintaining of TCP/IP, Apache HTTP/HTTPS, SMTP and DNS applications.
Configuration of NIS, DNS, NFS, SENDMAIL, LDAP, TCP/IP, Send Mail, FTP, Remote access Apache Services on Linux & UNIX Environment.
Implementing and maintaining user authentication mechanisms.
Configured and monitored distributed and multi-platform servers using chef.
Hardening system security by configuring firewalls, intrusion detection systems, and implementing security and involved in regular security audits and ensuring compliance with industry standards.
Migrated different projects from Perforce to SVN.
Deploying and configuring the Linux servers (like CentOS, Ubuntu) for various applications and services and Managing system resources, including CPU, memory, storage, and ensuring optimal performance.
Creating and maintaining documentation for system configurations, procedures, and troubleshooting steps.
Environment: RHEL, Windows, Shell Script, VMware servers, XEN, ESX, ESXi, WebSphere, Perforce, Splunk Enterprise Server 5.x, SVN, Windows 2003 server, Kick Start, Solaris.
Linux/Unix Administrator Reliance Industries India JULY 2013 – DEC 2014
Installation, configuration and administration of RedHat/SUSE Linux and Solaris Operating System.
Worked in server migration using P2P, V2V, and P2v conversion.
Server hardening. Remediating PCI/Audit requirements. Server Patching. Building out VM servers.
Opening Cases with vendors as needed. Providing Root cause analysis for problems as needed.
Moving Linux based applications to different VDC as required.
Updating DNS entries and server team wiki documentation as needed.
Creating/expanding NFS shares/add/remove SAN disk/manage LVM.
Providing on-call help as needed. Managing server inventory database.
Installing operating systems on new hardware or virtual containers.
Responsible for VM (Windows/Linux) Build Configuration including Storage.
Knowledge of POD (Windows/Linux) Blade Build Configuration.
Directed ticket tracking and workflow management using Jira.
Environment: TFS 2010(Team Foundation Server), GIT, JENKINS, Chef, Jira, PUPPET, OpenShift, REDHAT Linux 3,4.X,5,6, VMware ESX 3.5, Veritas Volume Manager.
EDUCATION
D.I.E.T - India 2013
Bachelors In Computer Science
CERTIFICATIONS
Microsoft Certified: Azure Administrator Associate
AWS Certified Developer – Associate
Certified Kubernetes Administrator