Kishore Kumar
Email: ****************@*****.***
Senior AZURE DevOps Engineer
Summary:
Senior Azure Engineer with over 8 years of extensive experience in Cloud (AWS), (Azure) Java applications, Configuration management, Infrastructure automation, Continuous Integration and Delivery (CI/CD). As SRE Engineer focused on Build and Deployment Automation, Writing and Managing CI/CD pipelines using AWS DevOps, Git, GitHub Actions, Jenkins, Containerization and Orchestration, and monitoring to help teams deliver a better, reliable Production environment.
Implemented Terraform Scripts to build AZURE Infrastructure.
Worked as a production support engineer and supported Java based P1/P2 applications which are hosted in Azure.
Responsible for managing all aspects of the Vulnerability Risk Management Program including vulnerability identification, analysis, remediation coordination and reporting.
Solid experience designing and building Continuous Integration/Continuous Deployment (CI/CD) pipelines and version control using tools like Jenkins, Argo CD, Azure Devops, Circles, Bitbucket, Gitlab, Git, Code Deploy, Code build, Code commit.
Functional knowledge and implementation experience of IT Service Management (ITSM) frameworks and demonstrated project management skills and experience working directly with customers and clients.
Migrated and resettled the applications and server instances from on-premise environment to AWS and GCP cloud.
My core competences are Infrastructure Management, Operations Management, Change Management, SLA Management, Application Availability Management, Application Cost Optimization, Customer Engagement
Experienced in working on DevOps/Agile operations process and tools area (Code review, unit test automation, Build & Release automation, Environment, Service, Incident and Change Management).
Provisioning of AWS resources like EC2, VPC, EBS, AMI, S3 buckets, creation of subnets and all other operational tasks
Experience in Shell scripting and extensively used Regular expressions in search string and data anonymization.
Progressive experience in Enterprise Vulnerability Management, Risk Assessment, penetration testing, generating reports, SQL Injection XSS and major hacking protection techniques.
Experience in Monitoring server performance with tools like Splunk, Datadog, Grafana, Prometheus, new relic and Cloud Watch, Big Panda, Dynatrace.
Administered Linux-based VMs (Ubuntu, RHEL, CentOS) in Azure, configuring disk partitions, LVM, network interfaces, SSH hardening, and kernel tuning for optimal performance.
Automated routine system tasks using Bash and PowerShell scripts, including log rotation, service restarts, scheduled jobs (cron), and disk cleanup.
Performed patch management and OS upgrades using package managers like yum, apt, and dnf with zero-downtime deployment strategies in production.
Managed systemd services for custom apps, ensuring proper service startup, failure recovery, and monitoring using systemctl and journalctl.
Managed Docker orchestration and Docker containerization using Kubernetes.
Automate using scripting languages such as PowerShell, Golang, Ruby, Bash, Python or similar.
Perform operation/production support, including incident management and root cause analysis of system and application failures and engineer solutions.
Configured custom dashboards in Dynatrace to provide real-time insights into application health and performance metrics.
Utilized Dynatrace alerts and notifications to proactively identify and resolve performance issues before they impact end-users.
Collaborated with development and operations teams to optimize application performance based on Dynatrace insights.
Experience in google cloud platform (GCP) cloud by provisioning compute engine, cloud load balancing, cloud storage, cloud SQL, stack driver monitoring components using the Terraform GCP Foundation modules
Experienced in setting up and configuring Splunk environments for various use cases.
Deployed Splunk for log aggregation and analysis across all my applications in Dev, UAT and Production environments.
Created and optimized Splunk queries and search strings to extract actionable insights from large datasets.
Experienced in building web pages and developed using JavaScript, shell script.
Defined the functional needs for our ITSM system, ServiceNow and designed teh specific implementation.
Experience in implementation and setting up the tools in high availability. (SVN, GIT, ARTIFACTORY, NEXUS,JENKINS, JIRA).
Managed environments DEV, QA, UAT and PROD for various releases and managed using Blue Green and Canary deployment strategies.
Highly organized, detailed oriented, able to plan, prioritize work and meet deadlines. work well under tight deadlines.
Troubleshooting the application issues using the Dynatrace Tool for finding the Root Cause Analysis of P1/P2 tickets.
Ability to work directly with all levels of Management to gather user requirements.
Experience of Jenkins, Apache Tomcat, JBoss, Subversion, Git, Maven.
Experience on implementation of SonarQube for Continuous static code analysis with CI/CD systems such as Jenkins & build tool Maven.
Monitoring profiles for AWS Services - EC2 Parameters (CPU, Memory, Disk, Response time, etc.)
Technical expertise in facilitating Cloud Infrastructure Management for entire org, Experienced in Amazon Web Services like Amazon EC2, EBS, ELK, ECS, S3, Glacier, RDS, ELB, VPC, Route 53,Cloud trail, Lambda, Code Deploy, Elastic Cache, SNS, SQS, SES, Cloud Formation, Cloud Front, Cloud watch, IAM, Import, Directory Service, Cognito.
In-depth knowledge of IAM security features such as password policies, identity federation, and role-based access control (RBAC), effectively configured and managed via the AWS CLI.
Experience in scripting languages Python and Bash.
Experience in Windows/ Linux Administration (Installation, Configuration and Upgrades of Linux (Red hat, Centos, Ubuntu, Suse).
Strong knowledge of OOAD Object Oriented Analysis and Development OOPS Object Oriented Programming and applying OO principles in full SDLC Software Development Life Cycle and extreme Programming.
Experience in creating, reimaging and cloning datacentres (VMs) on ESXI platform.
Experience on configuration management tool Ansible, Terraform.
Expertise in using Terraform for deploying Cloud Infrastructure in AWS/Azure.
Experience in writing Groovy & bash scripts for automation of build and infrastructure automation.
Experience is using Tomcat, JBOSS and Nginx servers for deployments.
In-depth understanding of the principles and best practices of Software Configuration Management (SCM) processes, which include compiling, packaging, deploying and Application configurations.
Excellent presentation skill and with experience in conducting ITSM functional workshops.
Worked on Container management using Docker by writing Docker files and setting up the
automated build on Docker, installing and configuring Kubernetes.
Experience in configuring and managing the Kubernetes Clusters to Deploy, Scale, Load Balance and manage Containers which includes creation of Pods, Replica Set, Labels, Deployments, Services, Ingress, Config Map, Secret and Health Checks using Liveness Probe, Readiness Probe by writing YAML Scripts.
Excellent communication, interpersonal and managerial skills.
Technical Skills:
Operating System
Windows, Linux, UNIX.
AWS Services
EC2, VPC, IAM, EBS, S3, ELB, API, Auto Scaling, EKS, ECR, AWS CLI, Route53, Lambda, CloudWatch AWS Database Migration Service (DMS), AWS Application Migration Service (AMS), Cloud Formation Templates, ArgoCD
Azure Services
Virtual machines, Storage Unit, VNET, Private endpoints, Azure Data Factory, Azure Databricks, Logic Apps, Azure Synapse Analytics, Azure Update Manger, Microsoft Defender For Cloud
Server
Apache Tomcat, JBoss, Nginx
Issue Tracking
JIRA, Service Now
Database
My SQL, Oracle, DynamoDB
Version Control
GitHub, Bit Bucket
CI Tools
Jenkins, Bamboo, GITLAB
Build Tools
Maven
Repository Tools
Nexus, Artifactory, ACR, ECR
Quality & Test Automation
SonarQube
Cloud
AWS, Azure
CM Tools
Ansible, Terraform
Containers
Docker
Orchestration
Docker swarm, Kubernetes
Monitoring Tool
Splunk, Dynatrace, Grafana, Prometheus, Nagios, AWS Cloud Watch, New relic
Programming & Scripting
Java, Groovy, Bash scripting, Python
Educational Qualification:
Bachelor of Technology in Electronics and Communication Engineering (BTECH), JNTU Kakinada, 2014
Projects Execution
Duration
August -2024 to Till Date
Client
CITI
Location
NEWYORK
Role
SRE DevOps Engineer
Monitoring Tools
Cloud Watch, Grafana & Kibana
Responsibilities:
Managing and supporting production releases with at most precision and delivered value. Maintained GIT workflows for production control. Provisioned servers and deployed playbooks for Linux servers patching. Kubernetes nodes, pods, config-maps, routes, and secrets.
Designed and managed end-to-end CI/CD pipelines in Azure DevOps, integrating GitHub, Jenkins, and SonarQube for automated testing, security, and deployment.
Architected and deployed Azure IaaS and PaaS solutions, including Virtual Machines, Azure Kubernetes Service (AKS), Azure SQL, App Services, and Key Vault.
Defined and implemented SRE principles—SLIs, SLOs, and error budgets—to measure service reliability and guide operational improvements.
Managed production-grade Docker/Kubernetes environments, including Helm-based deployments, ConfigMaps, Secrets, and readiness/liveness probes.
Built resilient, scalable cloud environments using Terraform, automating infrastructure deployment and maintaining version control via Git.
Implemented centralized logging and observability using Azure Monitor, Prometheus, Grafana, and Log Analytics, driving actionable insights and root cause analysis (RCA).
Set up alerting strategies and escalation workflows using Azure Monitor Alerts, Dynatrace, and ServiceNow integrations.
Configured self-hosted agents in Azure DevOps to handle large-scale builds and deployments with custom build dependencies.
Designed and automated incident response workflows, including runbooks and Logic Apps, for high-priority incident remediation.
Configured firewall rules (using iptables/firewalld) and enforced SELinux/AppArmor policies for security compliance across environments.
Implemented disk space monitoring, CPU/memory tracking, and custom alerting using top, htop, iostat, df, du, and integration with Azure Monitor/Prometheus.
Configured and maintained NFS mounts, SSH key-based access, and secure rsync/backup scripts for cross-environment file transfer and archiving.
Automated system patching, backups, and recovery using Azure-native services, including Backup Vaults and Update Management.
Deployed and integrated Logic Apps with Azure Functions, Event Grid, Service Bus, and external systems like SAP, Salesforce, and Dynamics 365.
Led Kubernetes cluster upgrades, node pool scaling, and security hardening initiatives to meet production-readiness standards.
Developed robust Bash and PowerShell scripts for automation, configuration, and monitoring tasks across hybrid environments.
Collaborated with cross-functional teams in Agile/Scrum settings, participating in daily stand-ups, sprint planning, retrospectives, and release cycles.
Experienced in disaster recovery (DR) strategies, including backup validation, environment replication, and failover testing.
Maintained 99.9%+ system uptime across critical applications, coordinating with support, development, and security teams to triage incidents and apply permanent fixes.
Project # 2
OM Soft Solutions UK Limited
Duration
Nov-2022 to April-2024
Client
JPMPC, UK
Location
Central London
Role
Senior SRE DevOps Engineer
Responsibilities:
Designed and provisioned Azure cloud infrastructure using Terraform, managing scalable resources across multiple subscriptions.
Provisioned and managed scalable Azure infrastructure using Terraform, aligning with Infrastructure-as-Code best practices to support multi-environment deployments (DEV, QA, PROD).
Architected and administered Azure Kubernetes Service (AKS) clusters with secure ingress using Application Gateway Ingress Controller (AGIC), achieving high availability and application performance.
Built and maintained reusable CI/CD pipelines using Azure DevOps YAML templates and integrated with Jenkins for legacy compatibility, ensuring smooth deployment workflows across cloud-native and hybrid environments.
Developed core Kubernetes manifests (Deployments, Services, ConfigMaps, Secrets, Ingress) and enforced RBAC and network policies to meet enterprise security and compliance requirements.
Automated Linux server administration tasks including patch management, service monitoring, and performance tuning. Resolved high-priority incidents (P1/P2) involving resource bottlenecks and kernel-level issues.
Created custom build agents using Docker and integrated tools like Terraform, Python SDKs, Helm, kubectl for development and operations teams.
Integrated logging and telemetry using Azure Monitor, Log Analytics, and Application Insights. Implemented automated alerting and proactive diagnostics for improved system observability.
Maintained and secured Azure Container Registry (ACR), managing image lifecycle and deploying hardened containers to AKS.
Leveraged Python scripting for automation tasks including cloud resource cleanup, backup routines, deployment validation, and operational checks.
Supported API integration with third-party services via REST using secure token-based authentication and service principals.
Adhered to cloud security best practices by implementing IAM role-based access, policy enforcement, certificate management, and endpoint protection across all services.
Worked collaboratively in Agile teams using JIRA, Azure Boards, and conducted sprint planning, defect triage, and backlog grooming.
Enforced DR and backup strategies, implemented vulnerability remediation, and maintained compliance posture across cloud environments.
Project # 1
OM Soft Solutions UK Limited
Duration
April-2018 to Nov-2022
Client
Maximus, UK
Location
Reading, UK
Role
DevOps SRE Engineer
DevOps Tools
Git, Docker, Ansible and Jenkins.
Technology
Java Application
Monitoring Tools
Monit, Grafana, Pormethoes, Elasti Search
Responsibilities:
Interacting with partners to capture requirements.
Led the design, deployment, and automation of Azure infrastructure (IaaS/PaaS) using Terraform, enabling scalable and repeatable resource provisioning across environments and subscriptions.
Administered and monitored Azure Kubernetes Service (AKS) clusters, managing node pools, deployments, ConfigMaps, Secrets, Ingress, and Helm-based app rollouts.
Created and managed CI/CD pipelines using Azure DevOps, integrating with GitHub, Jenkins, and SonarQube for secure, quality-controlled deployments to App Services and AKS.
Implemented Application Gateway Ingress Controller (AGIC) for secure traffic routing and SSL termination in AKS workloads.
Designed and optimized Azure Logic Apps and Functions for workflow automation and integration with SAP, Salesforce, Service Bus, and Event Grid.
Built proactive monitoring and alerting frameworks using Azure Monitor, Log Analytics, Prometheus, and Grafana, aligned to SRE best practices (SLIs/SLOs/Error Budgets).
Conducted root cause analysis (RCA) for critical incidents using Dynatrace, Application Insights, and custom telemetry in production environments.
Provisioned and managed Azure Container Registry (ACR), built secure Docker images, and pushed them for consumption in AKS and App Services.
Implemented self-hosted Azure DevOps agents to optimize build/deployment efficiency with custom tools and dependencies.
Managed secure access using Azure Key Vault, RBAC, and Azure AD Integration for authentication and secret management.
Supported disaster recovery (DR) readiness with region-level failover configurations, storage replication, and infrastructure snapshots.
Used JIRA and Azure Boards for sprint planning, backlog grooming, and incident tracking within Agile/Scrum environments.
Hands-on with EC2, VPC, IAM, S3, CloudWatch, CloudTrail, RDS, Route 53 and CloudFormation for parallel AWS-hosted environments.
Applied DevOps principles in hybrid cloud setups, automating resource deployments, and enforcing security best practices using IAM and tagging.
Certifications:
AWS Solution Architect Associate [SAA-CO2]
Terraform Certified Associate 002
Azure Fundamentals [AZ-900]
Azure Administrator Associate [AZ-104]
Azure DevOps Expert [AZ-400]
Kishore Kumar