Post Job Free
Sign in

Cloud & Site Reliability Engineer in Training

Location:
Los Angeles, CA
Posted:
December 11, 2025

Contact this candidate

Resume:

Samruddhi More

Los Angeles, CA (Totally down for relocation) +1-213-***-**** ****************@*****.*** EDUCATION

University of Southern California, Los Angeles Jan 2024 - Dec 2025 Masters, Computer Science Coursework: Analysis of Algorithms, Databases, Advanced Data Stores, Information Retrieval, Security Systems Walchand Institute of Technology, Solapur, India Aug 2017 - Sep 2021 Bachelors, Information Technology Coursework: Object Oriented Programming, Algorithms, Data Structures, Computer Architecture, Operating Systems, Computer Networks

WORK EXPERIENCE

Oracle May 2025 - Aug 2025

Site Reliability Intern Redwood Shores, CA

•Designed, tested, and deployed Go-based microservices that ingested Prometheus data for server telemetry analysis, accelerating incident triage pipelines across Kubernetes-hosted workloads by over 35%.

•Developed interactive Grafana dashboards for real-time visibility into Kubernetes cluster health and container metrics, which enhanced observability and reduced detection time of anomalies by approximately 40%.

•Built a lightweight AI-assisted prototype that integrated OCTO tooling with Kubernetes and Prometheus alerts to recommend optimized server sizing strategies based on historical usage and failure patterns.

•Delivered SRE on-call support by resolving urgent issues across OCI production services through log correlation, service health checks, Python automation scripts, and incident coordination with global teams. Tata Consultancy Services Jun 2021 - Dec 2023

Systems Engineer Pune, India

•Managed and documented over 300 AWS EC2 instances, automated WebLogic health checks and restart operations using custom Bash scripts, provisioned infrastructure with Terraform, and containerized applications with Docker to streamline patching workflows across production environments.

•Led on-call incident response for customer-facing services by analyzing system logs, integrating Prometheus and Grafana alerts, mitigating performance issues, initiating war rooms, and coordinating rapid resolution with application and infrastructure teams.

•Conducted weekly and monthly deployments using Jenkins pipelines while serving as the main customer liaison during recurring standups, communicating progress updates, outage reports, and root cause summaries to enterprise clients.

•Suggested and implemented infrastructure automation ideas, authored detailed SOPs for patching, deployment, and escalation handling, and contributed to improving release stability and team onboarding time. PROJECT WORK

Three-Tier Web App with Kubernetes

•Built a modular three-tier application with a React frontend, Node.js backend, and MongoDB database, enabling seamless data flow and component isolation using Docker containers.

•Deployed the full-stack app on Kubernetes with custom YAML configurations, managing service discovery, networking, and scaling of individual pods across the cluster.

•Improved deployment efficiency by 60% through containerized workflows and Kubernetes orchestration, while ensuring system observability with integrated logging and monitoring. Python Calculator CI/CD Pipeline with GitOps Deployment on AWS EKS

•Built and deployed a FastAPI-based calculator application with a Jinja2 UI and PostgreSQL backend, implementing full CI automation in Jenkins with virtualenv builds, unit testing, SonarQube static analysis, and Trivy image scanning to ensure secure, high-quality releases.

•Containerized the application using Docker and implemented automated Docker Hub versioning for reproducible builds, enabling rapid promotion of tested artifacts across environments with zero manual intervention.

•Developed a GitOps-driven CD workflow using ArgoCD and Kubernetes, where Jenkins updated deployment manifests and ArgoCD synchronized changes into an EKS cluster, delivering fully automated rollouts, observable deployments, and consistent release integrity.

TECHNICAL SKILLS

•Programming Languages: Python, Go, Bash scripting, JavaScript, TypeScript, SQL, C++

•DevOps & SRE Tools: Kubernetes, Prometheus, Grafana, Docker, Jenkins, Ansible, OCTO, AWS (EC2, S3, IAM), Helm, Git, CI/CD Pipelines, Cloud Logging, Cloud Monitoring, Alerting Systems, Infrastructure Scripting, Custom Alerting Rules, Server Patching Automation, WebLogic Health Checks, Apache Logs, Infrastructure as code, Terraform, Linux

•Web Development: HTML5, CSS3, React.js, Node.js, Express.js, REST APIs, Firebase, MongoDB, Next.js

•Version Control & Collaboration: GitHub, Bitbucket, Confluence, JIRA, Agile/Scrum



Contact this candidate