SHAILJA ROY
Sunnyvale, CA 408-***-**** *******.***@****.*** LinkedIn: Shailja Roy
TECHNICAL SKILLS
Programming Languages: C++, Python, Bash/Shell Scripting DevOps: Jenkins (CI/CD), Docker, Kubernetes, Ansible, Git, GitHub, Terraform (Infrastructure as Code) Cloud Platforms: Amazon Web Services (AWS) - EC2, S3, RDS, VPC, CloudFormation Databases: MySQL, Oracle, SQL
Monitoring/Networking: Prometheus, Grafana, Network Security (VPN, Firewalls, TCP/IP) Machine Learning Tools: Scikit-Learn, TensorFlow
Operating Systems: Linux
PROFESSIONAL EXPERIENCE
DevOps Engineer Amdocs Pune, India 10/2021 - 11/2022
• Improved product stability by 90% through infrastructure planning and optimization to ensure high reliability and uptime.
• Designed, deployed, and maintained Kubernetes clusters to orchestrate containerized applications, ensuring scalable and resilient infrastructure.
• Managed OpenShift clusters and implemented Helm charts for streamlined deployment, version control, and lifecycle management of microservices.
• Utilized Docker to containerize applications and integrated Kubernetes with Jenkins CI/CD pipelines to automate builds, testing, and deployment workflows—reducing deployment time by 30%.
• Automated system workflows with Python and Bash scripts, enhancing team efficiency and minimizing manual intervention.
• Set up multiple AWS environments for development purposes, successfully migrating the virtual setup to AWS Cloud, leading to a 10% reduction in resource utilization costs.
• Set up and used Prometheus to monitor the entire infrastructure, including cluster health, resource usage, and application metrics and built real-time Grafana dashboards to visualize metrics, track system performance, and enable proactive incident response.
Site Reliability Engineer (SRE) Comviva Technologies Bengaluru, India 08/2018 - 10/2021
• Led end-to-end deployment processes and ensured seamless post-deployment operations for telecom platforms, including upgrades, hotfixes, and new feature rollouts, improving production reliability.
• Diagnosed and resolved critical infrastructure and application issues in real-time, reducing downtime and improving incident response time across customer sites.
• Implemented and maintained monitoring solutions using Nagios to track key metrics such as latency, error rates, and system health for proactive alerting.
• Applied Ansible to manage infrastructure as code (IaC), ensuring consistent provisioning, configuration, and deployment of environments.
• Conducted system performance tuning and root cause analysis to ensure SLA adherence and reduce recurring issues. EDUCATION
Master of Science in Engineering Technology
San Jose State University San Jose, CA Expected Graduation: May 2025 Bachelor of Engineering in Telecommunication Engineering Dayananda Sagar College of Engineering Bengaluru, India 2014 - 2018 CERTIFICATIONS
• AWS Certified Solutions Architect – Associate: Credential Link
• HashiCorp Certified: Terraform Associate (003): Credential Link PROJECTS
Quality Assurance for Semiconductor Wafers using Deep Learning: I used RetinaNet to predict wafer alignment using labelled image data, improving QA precision. – I implemented the model using TensorFlow and Python, achieving high accuracy. Network Intrusion Detection System using ML and DL for IoT Networks: Researched and implemented ML and Neural network models to enhance IoT security by detecting attack traffic. To make the NIDS accessible for real-time predictions, the best-performing models were deployed as a RESTful API using Flask.