NAVANEETHA CHINTHAKUNTA
+1-470-***-**** Alpharetta,GA
*************@*****.*** https://www.linkedin.com/in/navaneethach/ OBJECTIVE
Senior DevOps / Site Reliability Engineer with a professional with 12+ years of experience in infrastruc- ture such as code (IaC), CI/CD pipelines, monitoring, and security best practices to ensure high availability and system reliability. Strong expertise in GCP,AWS cloud platforms, GKE container orchestration and performance optimization.
CERTIFICATIONS
AZ-305: Designing Microsoft Azure Infrastructure Solutions. AWS Certified SysOps Engineer
TECHNICAL SKILLS
Cloud Platforms GCP, AWS, Azure
Infrastructure as Code (IaC) Terraform, CloudFormation, Ansible Containerization and Orchestration Docker, Kubernetes, Helm CI/CD Tools Jenkins, GitHub Actions, GitLab CI/CD, ArgoCD Monitoring and Logging Dynatrace, GCM, Prometheus, Grafana, ELK Stack, Datadog Version Control Git, GitHub, GitLab
Scripting and Automation Python, Bash, YAML, yml
EXPERIENCE
Senior Site Reliability Engineer Jan 2024 - till date Client Name - MACY’s Alpherreta, GA
• Containerized Application Management: Successfully implemented a scalable multi-cluster GKE deploy- ment, reducing operational costs by 30%.
• Deployed and managed containerized applications on Google Kubernetes Engine (GKE) to ensure high avail- ability and scalability.
• Configured and automated GKE cluster operations, including upgrades, monitoring, and logging, using Ter- raform and Helm. Worked on GKE cluster multi-tenancy to optimize resource utilization across projects.
• Developed Terraform modules for infrastructure provisioning, improving deployment speed by 50%.
• CI/CD Pipelines and Deployment: Automated CI/CD workflows, accelerating deployment cycles by 50%. Deployed applications using Helm Charts and automated redeployment using Cloud Functions and Cloud Run.
• Automated Rollbacks and Syncing: Configured automated rollbacks, progressive deployments (blue-green, canary), and self-healing features with ArgoCD.
• Deployed Jenkins on Kubernetes with dynamic Jenkins agents using Docker and Kubernetes Pod Templates.
• Real-Time Data Streaming: Designed a fault-tolerant Pub/Sub system processing millions of events per second with 99.99% uptime.
• Configured Pub/Subtopics and subscriptions to enable seamless data ingestion and message distribution across distributed systems.
• Participated in 24/7 on-call rotation, leading the resolution of critical incidents and ensuring rapid service recovery, resulting in a 30% reduction in Mean Time to Restore (MTTR). Was part of holiday support working with 70% of incidence and change requests for Dec 2024.
• Application Security and Reliability: Ensured application security by managing the K8 Role-based Access control (RBAC), network policies and secrets in GKE. Worked on cluster autoscaling (horizontal and vertical) and node auto-provisioning to optimize resource allocation. Created node taints and tolerations to control workload deployments, ensuring efficient use of cluster resources.
• Self-Healing Pipelines and Auto-Scaling: Designed self-healing Jenkins agents using Kubernetes and Docker to dynamically provision and scale CI/CD workloads.
• Environment Management: Configured and optimized deployment settings for the production, performance, deployment and QA environments. Such as set minimum and maximum pod counts for workload deployments.
• Built automated SLO monitoring pipelines using GCP Cloud Monitoring, Prometheus, and Alertmanager, re- ducing false positives by 20%.
• Conducted SLO retrospectives and adjustments, optimizing service thresholds based on real-world usage patterns and business impact.
• Established progressive rollouts with canary analysis to maintain SLO targets while deploying new services.
• Implemented blameless postmortems, driving continuous improvement in incident management and root cause analysis (RCA).
• Built auto-scaling policies using GCP Autoscaler and Cloud Functions, optimizing cloud costs by 35% while maintaining SLO adherence.
Lead DevOps Engineer Jan 2024 - July 2022
Client Name - Equifax Alpherreta, GA
• Demonstrated proficiency in Agile development methodologies and CI/CD principles within Java environ- ments, fostering iterative development cycles.
• Skilled in configuring and deploying instances across GCP environments and Datacenters, with expertise in Compute, Kubernetes Engine, Stack driver Monitoring, Elastic Search, and security group man- agement.
• Facilitate daily Scrum meetings, remove obstacles, protect the team from distractions, and ensure adherence to Scrum processes and practices.
• Coach the team on Agile principles, help the Product Owner refine the backlog, and facilitate sprint planning, review, and retrospectives.
• Proficiently maintained user accounts (IAM), Cloud SQL, Cloud DNS, VPC, RDB, Cloud Datastore, Cloud Bigtable, SQS, and Cloud Pub/Sub services within the Google Cloud Platform.
• Enforced compliance by integrating security scans (SAST/DAST), automated vulnerability assessments, and compliance audits into Jenkins workflows.
• Designed and configured a virtual data center within the Google Cloud Platform to support Enterprise Data Warehouse hosting, encompassing Virtual Private Cloud (VPC), Public and Private Subnets, Security Groups, Route Tables, and Google Cloud Load Balancing.
• Proficient in managing GCP resources using tools like Terraform and Cloud Deployment Manager. Implemented a robust Continuous Delivery pipeline utilizing Docker, Jenkins, GitHub, and GCP, where Jenkins, the Contin- uous Integration server, automatically initiates building a new Docker container whenever a new GitHub branch is created.
• Implemented CI/CD methodologies utilizing Jenkins pipeline and Helm for efficient Docker containerization and Kubernetes orchestration.
• Automated task scheduling in GKE Cluster through the configuration of Cronjobs using Helm charts, enhancing operational efficiency.
• Monitoring and Observability: Integrated Jenkins with Prometheus, Grafana, and ELK Stack for real-time build monitoring and performance tracking.
• Security and Compliance: Enforced RBAC, Pod Security Policies, Network Policies, and secrets management
(Vault/KMS) for secure Kubernetes environments.
• Cluster Autoscaling and Cost Optimization: Configured Cluster Autoscaler, Horizontal and Vertical Pod Autoscaling (HPA/VPA) to optimize cost and resource utilization.
• Service Mesh and Networking: Deployed Istio/Linkerd for service discovery, observability, and traffic man- agement in Kubernetes.
• Monitoring and Logging: Integrated Prometheus, Grafana, ELK Stack, and Loki for real-time Kubernetes cluster monitoring and logging.
• Storage and Stateful Workloads: Managed Persistent Volumes (PV), CSI Drivers, and StatefulSets for databases and critical workloads on Kubernetes.
AWS DevOps Engineer Oct 2017 - Dec 2020
INFOSYS Bengaluru, India
• Implemented AWS solutions using EC2, EKS, S3, RDS, ELK, EMR, Elastic-Cache, Kinesis, EBS, Elastic Load Balancer, DynamoDB, Lambda, API Gateway, Redshift, RDS, Route53, Cloud For- mation, Cloud Foundry, Auto Scaling groups. Strengthening security by implementing and maintaining Network Address Translation in the company’s network.
• Worked with Terraform to create stacks in AWS from scratch and regularly updated the Terraform per the irregular nation’s requirements.
• Provisioned the high availability EC2 Instances using Terraform and cloud formation and wrote new plugins to support new functionality in Terraform.
• 24/7 production support for the application. Provide production support for clients in the incident and issue resolution overnight using monitoring and dashboard platforms such as Grafana and Prometheus.
• Understand Kubernetes’ authentication model, create users and groups in Kubernetes, and use role-based access control for authorization. Configured network policies to control pod communication.
• Used pod and container security contexts to harden our PROD, DEV, QA and UAT environments. Securely store sensitive information using Kubernetes secrets
• Deployed and managed containerized applications on Google Kubernetes Engine (GKE) to ensure high avail- ability and scalability.
• Promoted collaboration, communication, and Agile adoption while fostering a self-organizing, cross-functional team environment.
• Identified and addressed impediments, monitor team progress, and continuously improve processes and practices. System Engineer – Build and Release Oct 2016 - Oct 2017 Thrikasa Technologies Hyderabad, India
• Extensive working experience with different SDLC methodologies such as Agile and Waterfall, with an ability to be creative and to take self-initiatives to execute/manage multiple projects in parallel during time-critical situations.
• Managed deployment automation and implement a Continuous Delivery framework for new projects using Jenk- ins, Ansible, Code Deploy, Docker, and Git.
• Utilized Kubernetes and Docker for the runtime environment of the CI/CD system to build, test deploy. Launched Docker containers on EC2 instances, managing them in Kubernetes and Docker.
• Experienced in Docker and Ansible to build automation pipelines and Continuous Deployment for code using Jenkins and wrote Ansible Playbooks to automate Ansible Servers using YAML scripting.
• Used Bash scripting for automating network configurations, firewall rules, and VPN setup.
• Created Bash scripts to monitor system health, disk usage, and process management with ps, awk, sed, and grep.
• Automated deployment rollbacks and version control processes using Git hooks and Bash scripting. AWS Cloud Engineer Mar 2012 - Sep 2016
Tech Mahindra Hyderabad, India
• Designed and implemented serverless functions in AWS Lambda using Python for event-driven automation.
• Utilized Docker for the runtime environment of the CI/CD system to build, test deploy. Launched Docker containers on EC2 instances, managing them in Docker.
• Responsible for orchestrating CI/CD processes by responding to GITHUB Git triggers, human input and dependency chains, and environment setup and deployed CI/CD Pipelines.
• Used Docker Swarm for automating deployment, scaling, and operations of application containers across clusters of hosts, providing container-centric infrastructure.
• Designed and implemented CI (Continuous integration) system: Configuring Jenkins servers, and Jenkins nodes, creating required scripts (Python), and creating/configuring in VMs (Windows/Linux).
• Automatic build and deployment through Jenkins and and deployment tools using image or version created by Jenkins.
• Migrated applications to Dev, QA, UAT, and Production environments.
• Optimized Bash scripts for better performance, error handling, and debugging with set -e, -x, and trap signals.
• Created Bash scripts to manage CI/CD pipelines Integrated Bash scripts with cloud CLI tools (AWS CLI, gcloud, az) for infrastructure automation.
• Developed shell scripts for user management, SSH key rotation, and security hardening.
• Installed configured and Managed Monitoring tools such as Splunk and Nagios for Resource Monitoring/Network monitoring/Log Trace Monitoring.
PROJECTS
MACY’s PFL and IVS projects. Built the stable infrastructure using Google cloud platform for the Supply Chain Management to search and identify the PFL Preferred Fulfillment Locations and IVS Inventory Management System to automate the UPC- Unique code assigned to products. The aim was to drastically reduce the cost of the inventory stock and the shipping time.
K8s Deployment Manager Developed an open source tool to automate Kubernetes deployments using Helm and ArgoCD. This tool simplifies deployment processes by integrating Helm charts and ArgoCD workflows, ensuring consistent and repeatable Kubernetes application rollouts. It includes advanced features like rollback strategies, automated health checks, and multi-cluster support. The project has been widely adopted and contributed to by the open-source community.
Terraform AWS Modules. Maintained reusable Terraform modules for VPC, EC2, and RDS provisioning. These modules streamline infrastructure deployment by enforcing best-practices and modularizing components for scal- ability. Contributed features like automated security group configurations, customizable IAM roles, and cost optimization strategies. These modules have been leveraged in enterprise-grade cloud environments, significantly improving deployment efficiency.
EDUCATION
Master of Computer Science, New England College, NH, USA 2022 Bachelor of Electronics and Communication Engineering, JNTU - HYD, INDIA 2008 - 2012