Post Job Free
Sign in

Senior DevOps & SRE Engineer /n

Location:
Fremont, CA, 94538
Salary:
80000
Posted:
December 15, 2025

Contact this candidate

Resume:

Sashank Sainath Reddy Allugunti

****************@*****.*** 908-***-**** LinkedIn

SUMMARY

DevOps & Site Reliability Engineer with 3.5+ years of experience at Greenhouse Software and IBM, specializing in cloud automation, CI/CD pipelines, and containerized deployments using AWS and Kubernetes. Experienced in building resilient infrastructure, streamlining monitoring with Prometheus, Grafana, CloudWatch, and ELK, and enhancing system reliability in high-availability environments. Skilled in Terraform, Ansible, Docker, and cross-functional collaboration to deliver scalable and efficient solutions. PROFESSIONAL EXPERIENCE

DevOps Engineer, Greenhouse Software 08/2024 – Present NY, USA

•Automated multi-environment deployments using Jenkins and GitHub Actions, reducing deployment errors by 5% and decreasing average release time from six hours to two hours per cycle.

•Containerized 20+ legacy and microservices applications with Docker and orchestrated them on Kubernetes, improving uptime from 92% to 99.5% across multiple production clusters.

•Implemented Terraform and Ansible pipelines on AWS (Lambda, EC2, S3, VPC, IAM, API Gateway), reducing infrastructure provisioning time from three days to three hours.

•Configured Prometheus, Grafana, and CloudWatch dashboards for real-time monitoring, reducing incident response time by 60% and preventing production outages for critical customer-facing services.

•Collaborated with development teams to optimize rollback processes and troubleshoot production issues, improving recovery time objective (RTO) from two hours to thirty minutes during high-severity incidents. DevOps Engineer, IBM 01/2022 – 07/2023 India

•Built automated infrastructure using AWS CloudFormation and Terraform, supporting 15+ high-availability applications with 99.9% uptime and reducing configuration errors by 0%.

•Configured ELK Stack (Elasticsearch, Logstash, Kibana) for centralized logging, accelerating root-cause analysis by 50% and reducing mean time to resolution (MTTR) for critical incidents.

•Conducted API testing and performance benchmarking with Postman and JMeter, identifying bottlenecks and improving average response times from 900ms to 350ms across key endpoints.

•Standardized Git and Bitbucket workflows, reducing merge conflicts by 65% and improving team delivery speed on multiple parallel development streams by 35%. Site Reliability Engineer, IBM 05/2021 – 12/2021 India

•Automated system monitoring using AWS CloudWatch and Python/Bash scripts, reducing on-call alert fatigue by 0% and decreasing incident detection time from 5 minutes to 15 minutes.

•Developed incident response playbooks in Confluence, cutting mean time to acknowledge (MTTA) from 30 minutes to under 10 minutes during production system failures.

•Deployed fixes and patches using Ansible, reducing manual intervention by 80% and improving overall system uptime from 9% to 99% during peak production cycles.

•Analyzed recurring production errors through Kibana logs, resolving six major issues that reduced incident recurrence by 70% over a three-month period.

EDUCATION

Master in Computer Science, Pace University, Seidenberg School of Computer Science and Information Systems 06/2025 New York, NY

Bachelor in Computer Science,

BMS Institute of Technology and Management

06/2022 Bangalore, India

TECHNICAL SKILLS

Cloud Platforms & Infrastructure

AWS (Lambda, API Gateway, SES, DynamoDB, EC2,

S3, VPC, IAM), Microsoft Azure

CI/CD & Automation

Jenkins, GitHub Actions, GitLab CI, Terraform,

Ansible, AWS CloudFormation

Configuration & Version Control

Git, GitHub, Bitbucket, SVN

Web & Application Development

React, .NET, Django, Flask, HTML, CSS

Collaboration & Agile Tools

Jira, Confluence, GitHub Copilot

Containerization & Orchestration

Docker, Kubernetes, OpenShift

Monitoring & Logging

AWS CloudWatch, Prometheus, Grafana, ELK Stack

(Elasticsearch, Logstash, Kibana)

Scripting & Programming

Python, Java, C++, C#, JavaScript, SQL, Bash/Shell Scripting, PowerShell

Testing & API Tools

Postman, Selenium, JMeter

Productivity Tools

MS Office Suite (Word, Excel, PowerPoint, Outlook), Google Suite (Docs, Sheets, Drive, Gmail)

PROJECTS

Bug Tracking System, [.NET, React]

•Built a bug tracking system using .NET, React, and SQL Server, enabling 100+ users to log issues, with admins assigning tasks and employees resolving them through role-based workflows.

•Designed MVC-based architecture with responsive Razor views, improving user experience by 35% and ensuring secure CRUD operations for bug management while scaling to support multiple teams in real-time. Wi-Fi Controlled Car, [Python, Arduino]

•Engineered a Wi-Fi-controlled car using Arduino, Python, and ESP8266, achieving 95% response accuracy for directional commands and enabling real-time remote operation over a wireless network.

•Integrated L298N motor driver and battery system, delivering 6+ hours of reliable performance while supporting forward, backward, left, and right commands with smooth maneuverability across multiple test environments.

CERTIFICATIONS

Microsoft Certified: Azure Data

Scientist Associate

Python Programming

Certification

Initiating and Planning Projects

Certification



Contact this candidate