Avinash J
DevOps / SRE
Phone Number: 331-***-****
Email ID: ************@*****.***
LinkedIn: https://www.linkedin.com/in/avinash-janjarla-39a3a4211 PROFESSIONAL SUMMARY
• Experienced DevOps & Site Reliability Engineer with 8+ years of experience designing, automating, and managing multi-cloud infrastructure across AWS, Azure, and GCP, delivering high-availability and secure platforms for enterprise and healthcare systems.
• Skilled in building and optimizing CI/CD pipelines using Jenkins, GitHub Actions, Azure DevOps, and Spinnaker, automating end-to-end delivery lifecycles and improving release efficiency by 30 %.
• Strong hands-on expertise with Infrastructure-as-Code (IaC) using Terraform, Ansible, Azure Bicep, and Packer to standardize provisioning, enforce compliance, and accelerate infrastructure deployment.
• Experienced in deploying and managing Kubernetes clusters (EKS/AKS/GKE) for microservices, implementing GitOps (FluxCD), Service Mesh (Linkerd), and policy-as-code (OPA) for secure, consistent multi-cluster operations.
• Proficient in observability and monitoring using Prometheus, Grafana, ELK, CloudWatch, Azure Monitor, Dynatrace, and GCP Operations Suite, providing actionable insights into performance, cost, and reliability.
• Automated synthetic data pipelines and LLM-driven API validation frameworks leveraging Aurora & Aries databases, reducing manual QA effort by 40 % and improving test scalability.
• Focused on security and DevSecOps using IAM, RBAC, Vault, MFA, encryption, SonarQube, and Trivy, ensuring HIPAA-aligned cloud operations and compliance across environments.
• Skilled in containerization and artifact management with Docker, JFrog Artifactory, Nexus, and version control systems (Git, GitLab, Azure Repos, SVN), driving consistent and traceable release management.
• Experienced in FinOps and cost optimization using CloudHealth, GCP Billing APIs, and Cost Explorer to track budgets, optimize workloads, and enforce spending governance.
• Adept at application performance tuning on Tomcat, JBoss, and Nginx, leveraging Bash, Python, and Groovy scripting for automation, validation, and continuous improvement.
• Collaborative and proactive in SRE practices such as chaos testing, incident response, backup automation, and post-mortem analysis, maintaining 99.9 % uptime and operational resilience.
• Recognized for mentoring teams, promoting an automation-first culture, and aligning DevOps maturity with business outcomes to enhance delivery velocity and reliability. TECHNICAL SKILLS
Cloud Platforms: AWS (EC2, S3, RDS, EKS, IAM, CloudWatch, Lambda), Azure (AKS, AD, API Management, Service Fabric, Cosmos DB, Monitor, Log Analytics), GCP (GKE, Cloud SQL, Compute Engine, Cloud Storage, Cloud Monitoring, IAM)
Infrastructure as Code: Terraform, Ansible, Azure Bicep, ARM Templates, CloudFormation, Packer CI/CD & Automation: Jenkins, GitHub Actions, Azure DevOps, Spinnaker, FluxCD (GitOps), Maven, Gradle, Groovy, Bash, Python, PowerShell
Configuration Management: Ansible, Puppet
Containers & Orchestration: Docker, Kubernetes (EKS, AKS), Helm, Linkerd, ECS, Blue/Green Deployments Monitoring & Observability: Prometheus, Grafana, ELK Stack, CloudWatch, Azure Monitor, GCP Operations Suite (Cloud Monitoring & Logging), Application Insights, Dynatrace, Nagios, OpenTelemetry Logging & Incident Response: ELK, Splunk, Logstash, CloudWatch Logs, Alert Manager, Slack ChatOps Artifact & Version Control: JFrog Artifactory, Nexus, Azure Artifacts, Git, GitHub, GitLab, Azure Repos, SVN Security & Compliance: IAM, RBAC, Vault, MFA, SSL/TLS, OPA, SonarQube, Trivy, HIPAA, DevSecOps Networking & Web Servers: VPC, VPN, Route 53, Load Balancers, Nginx, Apache, JBoss, Tomcat, WebSphere Databases & Pipelines: Aurora, Aries, RDS, MySQL, PostgreSQL, Cosmos DB, Cloud SQL, Synthetic Data Pipelines (LLM Testing)
FinOps & Reliability: CloudHealth, GCP Billing APIs, Cost Explorer, Autoscaling, Rightsizing, Backup & Recovery, Chaos Testing
Operating Systems: Linux (RHEL, Ubuntu), Unix, Windows Collaboration Tools: Jira, Confluence, ServiceNow, Slack PROFESSIONAL EXPERIENCE
Client: AZTRA - Columbus, OH August 2024 - Present Role: DevOps Engineer / Site Reliability Engineer (SRE)
• Designed and maintained end-to-end CI/CD pipelines using Terraform, GitHub Actions, and Jenkins, improving deployment consistency and reducing release cycles by ~30 %.
• Automated provisioning and scaling of Aurora & Aries databases on AWS to support high-volume synthetic data pipelines used for LLM-based API testing.
• Managed Kubernetes (EKS) environments for microservices, optimizing autoscaling policies and cluster performance during intensive QA workloads.
• Built observability dashboards with Prometheus, Grafana, and CloudWatch to monitor data-flow latency, API health, and resource utilization in real time.
• Partnered with development and QA teams to integrate LLM-driven validation models, cutting manual test efforts by 40 % and accelerating defect detection.
• Strengthened platform security by implementing IAM, RBAC, and Vault-based secret management, ensuring HIPAA-compliant infrastructure operations.
• Deployed centralized logging (ELK stack) and incident-response automation that reduced mean-time-to- resolution for test failures and system issues.
• Championed resilience and SRE best practices through chaos testing, backups, and proactive monitoring, helping sustain 99.9 % platform uptime.
Client: Kroger - Dallas, TX Oct 2023 to July 2024
Role: DevOps / Cloud Engineer
• Migrated services from on-prem Kubernetes to Azure AKS and Google Kubernetes Engine (GKE), implementing FluxCD GitOps pipelines for automated multi-cloud deployments.
• Designed and implemented Azure AD, API Management, Cosmos DB, and GCP Cloud SQL workloads using Terraform and Azure Bicep, ensuring infrastructure consistency across environments.
• Integrated Ansible + Jenkins for cross-cloud provisioning, patching, and configuration automation, improving infrastructure scalability and reducing manual interventions.
• Built reusable Terraform modules for AKS, GKE, VNet/Cloud VPC peering, and CosmosDB/Cloud SQL replication.
• Created multi-cloud CI/CD pipelines using GitHub Actions, Azure DevOps, and Spinnaker, integrating Policy-as-Code (OPA) for governance and compliance checks.
• Deployed Service Mesh (Linkerd) for secure microservice communication across Azure and GCP clusters.
• Configured Azure Monitor, GCP Cloud Monitoring, and OpenTelemetry for unified observability, with Application Insights dashboards for latency and resource optimization.
• Deployed custom golden images using Packer + Terraform for pre-configured hybrid environments.
• Automated identity and access provisioning with Azure AD RBAC and GCP IAM, strengthening compliance and access governance.
• Implemented CloudHealth + GCP Billing APIs to consolidate FinOps KPIs and monitor cost optimization across both clouds.
Tata Consultancy Services (TCS) - Hyderabad, India April 2022 - July 2023 Role: AWS Developer / DevOps Engineer
• Built and optimized CI/CD pipelines using Jenkins and Ansible to streamline app deployments across staging and production.
• Automated provisioning of EC2, RDS, and networking components using Ansible and AWS CLI, reducing setup time by 60%.
• Configured and deployed enterprise applications on Apache Tomcat and IBM WebSphere servers.
• Collaborated with security and compliance teams to implement IAM roles, policies, and VPC network segmentation.
• Integrated ELK Stack and AWS CloudWatch for real-time log analytics and system observability.
• Led environment readiness for quarterly releases, ensuring zero downtime and rollback strategy testing.
• Authored technical runbooks, improving team onboarding and reducing support handoffs.
• Created Ansible playbooks to automate configuration drift detection and correction across environments.
• Acted as primary deployment engineer during UAT and production cutovers.
• Contributed to cost optimization by analyzing CloudWatch metrics and rightsizing EC2 instances. Amazon Development Centre - Hyderabad, India Oct 2018 – April 2022 Role: AWS Engineer
• Developed scalable Jenkins pipelines integrated with Maven and Git for e-banking applications.
• Containerized microservices using Docker and deployed them on Kubernetes clusters (EKS).
• Designed reusable Ansible roles for provisioning consistent environments across dev/test/prod.
• Standardized image creation using Dockerfiles and implemented image scanning via Trivy.
• Established central logging using ELK for over 20 applications, integrated alerting via Logstash filters.
• Worked closely with QA to automate nightly regression suites within CI workflows.
• Used Git and SVN for source control management, helped migrate legacy apps from SVN to Git.
• Conducted cloud resource hardening and enabled MFA, encryption, and IAM roles for DevSecOps compliance.
• Participated in release planning, workload balancing, and environment refreshes during sprint cycles.
• Mentored two junior DevOps engineers on Kubernetes resource design and pipeline development. Kapil IT Solutions - Hyderabad, India May 2017 – Sep 2018 Role: Build and Release Engineer
• Automated CI/CD pipelines using Jenkins, Maven, Gradle, and Git to accelerate build and deployment processes.
• Migrated legacy builds from ANT to Maven to enable modular dependency management and consistent artifact generation.
• Managed binary repositories using JFrog Artifactory and Nexus for artifact version control and integrity verification.
• Automated server provisioning with Puppet and Ansible, integrated Vault for secure secrets rotation across environments.
• Deployed Dockerized applications on ECS and Kubernetes, implementing Blue/Green deployment strategies for zero downtime releases.
• Configured log aggregation and monitoring using Splunk, Nagios, and Grafana to enable proactive system health tracking.
• Developed automation scripts in Groovy and Bash to streamline packaging, validation, and deployment workflows.
• Administered Git branching strategies, release tagging, and versioning to maintain consistency across environments.
• Integrated SonarQube and Trivy into build pipelines to ensure code quality, vulnerability detection, and image integrity.
• Collaborated with infrastructure teams to fine-tune Tomcat, JBoss, and Nginx configurations for optimized application performance.
• Configured Slack ChatOps integrations to provide real-time pipeline alerts and deployment notifications.
• Monitored CI/CD pipelines using Dynatrace AIOps to identify performance degradation and trigger early issue resolution.
EDUCATION
• Masters in Computer Information Technology Elmhurst University – May 2025
• Bachelor of Technology (B.Tech) Jawaharlal Nehru Technological University Hyderabad – 2017