KIRAN K. SHARMA
Senior Platform & Automation Engineer Cloud AI & GenAI Advocate
Silver Spring, MD 703-***-**** ️ *****.**********@*****.***
linkedin.com/in/kirankumarsharma github.com/kiransharma2016 Blog: talkingwithmachines.blogspot.com
Career Objective
Drive enterprise transformation through the strategic adoption of Generative and Agentic AI, embedding AI agents into development and operational workflows to build autonomous, intelligent, and self healing platforms. Passionate about leveraging AI driven automation to enhance efficiency, resilience, and innovation — with the goal of doubling or tripling operational performance through scalable, intelligent platform design and continuous modernization.
Professional Summary
Accomplished Platform & Automation Engineering Leader with 15+ years of experience modernizing enterprise infrastructure and enabling AI driven operations. Expert in Kubernetes, OpenShift, Terraform, Ansible, and AWS, with a proven record of transforming manual workflows into autonomous, self healing systems through Generative AI and intelligent automation. Recognized for delivering measurable business impact — higher uptime, faster recovery, and lower costs — by integrating DevOps, Cloud, and AI into secure, resilient platforms.
Core Competencies
Cloud & Infrastructure: AWS (EKS, Lambda, CloudWatch), Azure, GCP multi cloud integration
Automation / IaC: Terraform, Ansible, Jenkins, GitLab CI/CD, GitHub Actions
Containerization & Orchestration: Kubernetes (CKA certified), OpenShift, Docker, Podman
Observability & AIOps: Prometheus, Grafana, ELK, Splunk, DataDog, CloudWatch
Generative AI Integration: LLM based ops automation, AI agents for RCA & predictive maintenance
Security & Compliance: DevSecOps, Security+, ITIL 4, FISMA, Prisma Cloud
Leadership & Collaboration: Cross functional mentoring, innovation advocacy, continuous improvement
Professional Experience
Senior Platform Engineer Freddie Mac – McLean, VA
Sept 2019 – Oct 2025
Architected and maintained enterprise-grade Kubernetes (EKS/OpenShift) platforms serving 200+ microservices with 99.9% uptime, supporting mission-critical mortgage operations and ensuring zero-downtime deployments
Pioneered GenAI-powered operational intelligence by integrating LLM-based agents for real-time anomaly detection, automated root cause analysis, and intelligent incident summarization, reducing MTTR by 40% and accelerating problem resolution
Engineered comprehensive infrastructure automation framework covering certificate lifecycle management, resource tagging governance, ELB/security group optimization, and multi-region DR orchestration, eliminating 70% of manual operational overhead
Designed and deployed reusable Infrastructure-as-Code patterns using Terraform and Ansible, establishing security-compliant provisioning standards that reduced deployment time from days to hours while ensuring FISMA and SOX compliance
Built multi-layered observability ecosystem integrating Prometheus, Grafana, DataDog, and CloudWatch with custom alerting rules and dashboards, enabling proactive incident detection and reducing false-positive alerts by 60%
Transformed platform operations through comprehensive documentation, self-service portals, and knowledge base creation, reducing tier-2 support tickets by 35% and empowering development teams with autonomous capabilities
Senior Engineer – Cloud & Platform Automation CSC / CSAR
Jun 2017 – Sep 2019
Led cloud migration initiative for 50+ legacy applications to AWS, executing strategic mix of refactor, replatform, and lift-and-shift approaches that improved scalability, reduced infrastructure costs by 30%, and accelerated time-to-market
Designed and implemented production-grade multi-cloud Kubernetes and OpenShift clusters with standardized CI/CD pipelines, configuration management patterns, and security controls, establishing enterprise container orchestration standards
Developed serverless automation solutions leveraging AWS Lambda, Step Functions, and EventBridge for event-driven workloads including log processing, compliance scanning, and resource optimization, improving operational efficiency by 45%
Created AI-enhanced cost optimization platform using machine learning for workload analysis and rightsizing recommendations, delivering 15% compute cost reduction while maintaining performance SLAs
Established center of excellence through technical workshops and hands-on training on Docker, Kubernetes, Terraform, and AI-driven DevOps practices, upskilling 40+ engineers and accelerating technology adoption
Implemented comprehensive monitoring and alerting framework with Prometheus and Grafana, providing real-time visibility into cluster health, resource utilization, and application performance
Middleware SME – Messaging & Application Hosting CSC – Lanham, MD
Jan 2013 – Jun 2017
Designed and implemented high-availability/disaster recovery architectures for enterprise middleware platforms supporting REST/SOAP APIs, message queuing, and service integration across multiple federal agencies, ensuring 99.95% service availability
Automated end-to-end deployment pipelines and environment provisioning workflows using Python, Bash, and Ansible, reducing deployment time by 70% and eliminating configuration drift across 100+ middleware instances
Administered and optimized enterprise application server fleet including IBM WebSphere, Oracle WebLogic, JBoss EAP, Apache Tomcat, and IBM DataPower gateways, tuning JVM parameters and connection pools for optimal performance under high-load conditions
Developed comprehensive monitoring and alerting automation for IBM MQ message brokers, DataPower XML appliances, and WebSphere Application Server environments, achieving proactive issue detection and 50% reduction in P1 incidents
Spearheaded middleware platform modernization initiatives including containerization proof-of-concepts, API gateway consolidation, and infrastructure standardization, achieving 99.95% SLA compliance and 40% reduction in incident volume
Served as escalation point for critical production issues, performing root cause analysis and implementing preventive measures that improved platform stability and reduced recurring problems by 60%
Earlier Roles (Pre 2013)
Infrastructure / Production Engineer – Verizon Wireless Systems Engineer III – Advance Auto Parts Sr. Systems Administrator – Nordstrom Integration Consultant – McKesson, Electrolux, Discover, Union Bank Systems Director – United Mission to Nepal
Education
M.S. Computer Science – Maharishi International University (2008)
Diploma in Management – Henley Institute of Management (2002)
M.A. Mathematical Economics – Tribhuvan University (1997)
Certifications
Certified Kubernetes Administrator (CKA) – Linux Foundation (2024)
AWS Certified Cloud Practitioner (2023)
CompTIA Security+ (2024)
ITIL 4 Foundation (2023)
DevSecOps Foundation (2023)
Docker Certified Associate (DCA) (2019)
Achievement Highlights
Drove AI powered incident response automation, reducing MTTR 40% and boosting reliability
Spearheaded migration of 50+ apps to AWS, cutting operational overhead and improving deployment speed
Championed AIOps and LLM adoption, mentoring engineers and promoting automation culture
Delivered 99.9% uptime and multi region resilience for mission critical platforms
Future Focus
Advancing next generation enterprise platforms with Agentic AI and self healing architectures — integrating LLMs for real time operational reasoning, predictive automation, and AI driven governance to achieve autonomous cloud operations.