Post Job Free
Sign in

Senior Platform & AI Automation Leader

Location:
Silver Spring, MD
Salary:
250000
Posted:
December 08, 2025

Contact this candidate

Resume:

KIRAN K. SHARMA

Senior Platform & Automation Engineer Cloud AI & GenAI Advocate

Silver Spring, MD 703-***-**** ️ *****.**********@*****.***

linkedin.com/in/kirankumarsharma github.com/kiransharma2016 Blog: talkingwithmachines.blogspot.com

Career Objective

Drive enterprise transformation through the strategic adoption of Generative and Agentic AI, embedding AI agents into development and operational workflows to build autonomous, intelligent, and self healing platforms. Passionate about leveraging AI driven automation to enhance efficiency, resilience, and innovation — with the goal of doubling or tripling operational performance through scalable, intelligent platform design and continuous modernization.

Professional Summary

Accomplished Platform & Automation Engineering Leader with 15+ years of experience modernizing enterprise infrastructure and enabling AI driven operations. Expert in Kubernetes, OpenShift, Terraform, Ansible, and AWS, with a proven record of transforming manual workflows into autonomous, self healing systems through Generative AI and intelligent automation. Recognized for delivering measurable business impact — higher uptime, faster recovery, and lower costs — by integrating DevOps, Cloud, and AI into secure, resilient platforms.

Core Competencies

Cloud & Infrastructure: AWS (EKS, Lambda, CloudWatch), Azure, GCP multi cloud integration

Automation / IaC: Terraform, Ansible, Jenkins, GitLab CI/CD, GitHub Actions

Containerization & Orchestration: Kubernetes (CKA certified), OpenShift, Docker, Podman

Observability & AIOps: Prometheus, Grafana, ELK, Splunk, DataDog, CloudWatch

Generative AI Integration: LLM based ops automation, AI agents for RCA & predictive maintenance

Security & Compliance: DevSecOps, Security+, ITIL 4, FISMA, Prisma Cloud

Leadership & Collaboration: Cross functional mentoring, innovation advocacy, continuous improvement

Professional Experience

Senior Platform Engineer Freddie Mac – McLean, VA

Sept 2019 – Oct 2025

Architected and maintained enterprise-grade Kubernetes (EKS/OpenShift) platforms serving 200+ microservices with 99.9% uptime, supporting mission-critical mortgage operations and ensuring zero-downtime deployments

Pioneered GenAI-powered operational intelligence by integrating LLM-based agents for real-time anomaly detection, automated root cause analysis, and intelligent incident summarization, reducing MTTR by 40% and accelerating problem resolution

Engineered comprehensive infrastructure automation framework covering certificate lifecycle management, resource tagging governance, ELB/security group optimization, and multi-region DR orchestration, eliminating 70% of manual operational overhead

Designed and deployed reusable Infrastructure-as-Code patterns using Terraform and Ansible, establishing security-compliant provisioning standards that reduced deployment time from days to hours while ensuring FISMA and SOX compliance

Built multi-layered observability ecosystem integrating Prometheus, Grafana, DataDog, and CloudWatch with custom alerting rules and dashboards, enabling proactive incident detection and reducing false-positive alerts by 60%

Transformed platform operations through comprehensive documentation, self-service portals, and knowledge base creation, reducing tier-2 support tickets by 35% and empowering development teams with autonomous capabilities

Senior Engineer – Cloud & Platform Automation CSC / CSAR

Jun 2017 – Sep 2019

Led cloud migration initiative for 50+ legacy applications to AWS, executing strategic mix of refactor, replatform, and lift-and-shift approaches that improved scalability, reduced infrastructure costs by 30%, and accelerated time-to-market

Designed and implemented production-grade multi-cloud Kubernetes and OpenShift clusters with standardized CI/CD pipelines, configuration management patterns, and security controls, establishing enterprise container orchestration standards

Developed serverless automation solutions leveraging AWS Lambda, Step Functions, and EventBridge for event-driven workloads including log processing, compliance scanning, and resource optimization, improving operational efficiency by 45%

Created AI-enhanced cost optimization platform using machine learning for workload analysis and rightsizing recommendations, delivering 15% compute cost reduction while maintaining performance SLAs

Established center of excellence through technical workshops and hands-on training on Docker, Kubernetes, Terraform, and AI-driven DevOps practices, upskilling 40+ engineers and accelerating technology adoption

Implemented comprehensive monitoring and alerting framework with Prometheus and Grafana, providing real-time visibility into cluster health, resource utilization, and application performance

Middleware SME – Messaging & Application Hosting CSC – Lanham, MD

Jan 2013 – Jun 2017

Designed and implemented high-availability/disaster recovery architectures for enterprise middleware platforms supporting REST/SOAP APIs, message queuing, and service integration across multiple federal agencies, ensuring 99.95% service availability

Automated end-to-end deployment pipelines and environment provisioning workflows using Python, Bash, and Ansible, reducing deployment time by 70% and eliminating configuration drift across 100+ middleware instances

Administered and optimized enterprise application server fleet including IBM WebSphere, Oracle WebLogic, JBoss EAP, Apache Tomcat, and IBM DataPower gateways, tuning JVM parameters and connection pools for optimal performance under high-load conditions

Developed comprehensive monitoring and alerting automation for IBM MQ message brokers, DataPower XML appliances, and WebSphere Application Server environments, achieving proactive issue detection and 50% reduction in P1 incidents

Spearheaded middleware platform modernization initiatives including containerization proof-of-concepts, API gateway consolidation, and infrastructure standardization, achieving 99.95% SLA compliance and 40% reduction in incident volume

Served as escalation point for critical production issues, performing root cause analysis and implementing preventive measures that improved platform stability and reduced recurring problems by 60%

Earlier Roles (Pre 2013)

Infrastructure / Production Engineer – Verizon Wireless Systems Engineer III – Advance Auto Parts Sr. Systems Administrator – Nordstrom Integration Consultant – McKesson, Electrolux, Discover, Union Bank Systems Director – United Mission to Nepal

Education

M.S. Computer Science – Maharishi International University (2008)

Diploma in Management – Henley Institute of Management (2002)

M.A. Mathematical Economics – Tribhuvan University (1997)

Certifications

Certified Kubernetes Administrator (CKA) – Linux Foundation (2024)

AWS Certified Cloud Practitioner (2023)

CompTIA Security+ (2024)

ITIL 4 Foundation (2023)

DevSecOps Foundation (2023)

Docker Certified Associate (DCA) (2019)

Achievement Highlights

Drove AI powered incident response automation, reducing MTTR 40% and boosting reliability

Spearheaded migration of 50+ apps to AWS, cutting operational overhead and improving deployment speed

Championed AIOps and LLM adoption, mentoring engineers and promoting automation culture

Delivered 99.9% uptime and multi region resilience for mission critical platforms

Future Focus

Advancing next generation enterprise platforms with Agentic AI and self healing architectures — integrating LLMs for real time operational reasoning, predictive automation, and AI driven governance to achieve autonomous cloud operations.



Contact this candidate