Post Job Free

Resume

Sign in

Site Reliability Engineer

Location:
Atlanta, GA
Posted:
January 18, 2024

Contact this candidate

Resume:

Neeharika Tenneti

Sr. GCP DevOps Site Reliability Engineer

Contact: +1-650-***-****

Email: ad2v2m@r.postjobfree.com

LinkedIn Profile

Summary:

9+ years of experience in DevOps, Cloud and SRE by implementing and maintaining applications in cloud environments - GCP, with exposure in different phases of involving Analysis, Configuration Management, Build, Packaging, Deployment, Automation, Testing, Source Code Management, Environment Management configured and managed in cloud.

Configured and managed cloud Infrastructure on GCP services such as Google Kubernetes Engine, GCE, VPC, Dataflow, PubSub, Composer, GCS, IAM, GCR, Cloud Build, BigQuery, Cloud Armor, Cloud Functions, Cloud SQL etc.

Experienced in writing Infrastructure-As-Code in HashiCorp Terraform.

Experienced in logging and monitoring services in GCP, Site 24x7, Dynatrace and ELK.

Experienced in various deployment strategies such as canary, Blue/Green.

Experienced in GKE: Reserve Public IP, Create VPC and LB, create Front-End and Back End Config, create Network

Endpoint Group, configure Master – Worker nodes, configure PODs.

Developed and maintained Continuous integration (CI) and Continuous Deployment (CD) using Jenkins, Harness, and Nexus.

Worked with Compute services, Storage buckets, IAM roles, define auto-scaling, define VPC network, define

patching schedule and perform rolling updates.

Experienced in Microservices based Applications & Services using container technologies Docker, GKE.

Proficient in creating Docker images using Docker file, publishing Docker images to GCR.

Experienced in branching, merging and maintaining the version using SCM tools like GIT, Bitbucket, and GitHub on windows and Linux platform.

Performed and deployed builds for various Environments like DEV, QA, UAT and Production Environments.

Hands on experience in using JIRA. Configured various workflows, customizations and plug-ins for JIRA issue tracker and integration of Jenkins with Jira/GitHub to track change requests, bug fixes manage tickets for corresponding Sprints.

Strong analytical and problem-solving skills and can work either independently with little or no supervision or as a member of a team.

Good written and verbal communication skills, strong organizational skills and a hard-working team player, well- practiced in attending phone calls and answering business team queries. Certifications:

Google Cloud Certified Professional DevOps Engineer 2023 Google Cloud Certified Associate Cloud Engineer 2022 TECHNICAL SKILLS:

Cloud Services : GKE, GCE, VPC, CloudRun, AppEngine, BigQuery, Dataflow, PubSub, Cloud Functions, Airflow etc

Database : MySQL

Scripting Languages : Python, Bash

Infrastructure Tools : Terraform, Packer

Monitoring Tools : Google cloud Logging and Monitoring, Dynatrace, Site 24x7 and ELK Container Tools : Docker, Kubernetes

CICD Tools : Jenkins, Harness, Nexus

Source Code Repository: GitHub, BitBucket

Operating Systems : Windows, Linux

Education: Bachelors in Electronics Engineering from Kakatiya University. Professional Experience:

EPAM Systems July 2022 -Present

SRE/DevOps Engineer

Responsibilities:

Responsible for building data governance platform at Pearson that includes functional components such as Data Warehouse, data marts, data pipelines and data services. The data governance platform serves different Pearson BUs' needs in data field.

Responsible for building and managing CI/CD pipelines for code deployments in Jenkins for CloudRun, App Engine, Dataflow and Composer(Airflow) etc.

Experience in handling Composer Airflow services.

K8s: Experience in Namespaces, Deployments, Services, ConfigMaps, Secrets, Vault Secret Manager,

DaemonSet, Network Policies on Control Plane, Autoscaler (HPA, VPA, CPA), Sidecar injection, Ingress

Controller.

Ensuring production applications don't go beyond SLO/SLA and the allocated error budget, lead the dev teams to minimize their application downtime which could potentially generate revenue.

Responsible for balancing feature development speed and reliability with well-defined service-level objectives.

Experienced in creating and configuring Jenkins jobs for build and release pipelines using Groovy and DSL.

Work with GCP IAM – define custom roles and permissions, work with service accounts (SA).

Responsible for Reliability: (Git/CloudLab/Docker/K8s) Build Infrastructure via IaC tools, containerize applications, maintain and upgrade the application deployment platforms.

Deploying and managing complex GKE (Google Kubernetes Engine) Clusters for critical applications, ensuring optimal performance, Scalibility and Reliability.

Experienced in Monitoring and troubleshooting Kubernetes workloads using tools like ELK.

Expertise in Terraform as a GCP DevOps engineer, orchestrating infrastructure as code to provision, manage, and scale cloud resources with precision and consistency.

Managing systems routine backup, scheduling jobs like disabling and enabling cron jobs, enabling system logging, network logging of servers for maintenance, performance tuning, testing.

Utilized Packer to streamline and automate the creation of custom machine images.

Automation enabled via Jenkins and its multi-branch pipelines – provision Jenkins, configure jobs, install plugins,

set timeouts, configure jobs, define build and post-build.

Handle data based on data-classification, handle encryption of data, manage certificates and patch vulnerabilities

for services/infrastructure - Vault/Secrets Manager and Certificates Manager.

Worked as part of a team to keep Linux OS systems maintained and optimized through daily patching, upgrading, and tuning when system maintenance was required.

Administered, maintained Red Hat, operating system, applications & network problems, and performance issues.

Writing Python, Bash scripts for Automated Back- ups and Cron Jobs.

Diagnosed and resolved problems associated with Firewalls.

Responsible for any infra issues w.r.t the GCP environment. Developed Infrastructure as Code (IAC) to deploy highly scalable, portable application using Helm, Kubernetes.

Containerized the application by creating Docker file and pushed it to GCR container registry for app deployment.

Develop, maintain and deploy jobs for application code across all environments using automated tools (Jenkins, GitHub, Docker, and Nexus).

Experienced in Automating the continuous integration and continuous deployment (CICD) using Jenkins, GitHub, Nexus, Docker and GCR.

Resolution on server and client Qualys scan vulnerabilities. Environment: Windows, Linux, Google Cloud, Docker, Terraform, packer, Qualys, Vault, Kubernetes, Dataflow, Composer, Airflow, BigQuery, CloudRun, AppEngine, GCE, MySQL, CloudArmor, ServiceNow, JIRA, CI-CD, Bash scripting, Groovy, Python, Jenkins, GitHub, Bitbucket, Stack driver, Pager Duty, ELK, Terraform, HELM NCR Corporation Oct 2020–July 2022

SRE/ DevOps Engineer

Responsibilities:

Automated deployment of applications into cloud using Harness CI/CD and python scripting.

Implemented Errorbudget in Datastudio (GCP) using BigQuery to measure Availability of applications.

Ensuring production applications don't go beyond SLO/SLA and the allocated error budget, lead the dev teams to minimize their application downtime which could potentially generate revenue.

On-call production support.

Monitoring and maintenance of live production applications hosted in GCP using Site 24x7 and Dynatrace.

Assisting dev teams in deploying their applications using Kubernetes.

Involved in the GitOps POC for CI/CD implementation and Github actions.

Coordinated with developers to establish and apply appropriate branching, labeling, naming conventions using GIT, standardized Gitflow process by creating Pull request, Review and merge process.

Automated the continuous integration and deployments (CI/CD) using Harness.

Experience in monitoring, analyzing and troubleshooting application issues with Dynatrace.

Resolve incidents, perform blameless post-mortem for RCA and come up with short-term (stop-gap solution) and

long-term solutions. Ensure High Availability and develop backup strategies.

Responsible for resolving network issues using network tools like ping, TCP traceroute and TCP dump.

Managed security of servers.

Expert in Dynatrace in setting up and adjusting thresholds, alerting and integration with multiple notification channels.

Implemented process for release management, automated code deployment, configuration management, and monitoring.

Managed and optimized Kubernetes clusters to ensure seamless deployment, scaling, and management of applications

Document solutions for any issues that have not been discovered previously.

Deploying, and managing GKE clusters for critical applications, ensuring optimal performance, scalability, and reliability.

Experienced in Monitoring and troubleshooting Kubernetes workloads using tools like Dynatrace.

Performed general administration duties including permissions, upgrades, and patches on Google cloud.

Created Clusters in GCP (GKE) to deploy a managed Kubernetes cluster in Google.

Automated the scheduling, rollout and rollbacks using Kubernetes to launch container on particular cluster nodes and bring it to desired state.

Implemented CI/CD Pipeline for DevOps environment. It bridges the gap between development and operations teams by automating the building, testing, and deployment of applications using Git, Kubernetes, Jenkins, and Terraform.

Created Clusters using Kubernetes and worked on creating many pods, replication controllers, services, deployments, labels, health checks and ingress by writing YAML files.

Supported 24x7 production computing environments and providing on-call and weekend support. Environment: Windows, Linux, Google Cloud, Terraform, Kubernetes, Git, Harness, GCE, Big Query, CloudBuild, CloudFunctions, Datastudio, CloudArmor, Python, Docker, Helm, Shell Scripts, Stack driver, ServiceNow, Dynatrace, Site24x7, JIRA and Pager Duty. TCS, Ericsson June 2015–Dec 2019

DevOps Engineer

Responsibilities:

Ericsson, BID CI-E provides a best in class tools to Development units supporting Ericsson Continuous Integration, Delivery, and deployment strategy.

Maintaining the CI/CD system in ensuring continuous deliveries of Ericsson 4G products to the customers.

Designed Jenkins job flows to automate build and test loops as a continuous integration system for developing a product.

Designed and developed CI/tool machinery and build systems for building products in 5G CI Flow.

Worked as a critical support driver for BID CIE First-line Support team and trained colleagues for competency buildup of 24/7 Critical Support of 4G,5G Flow

Responsible for maintaining and improving the build scripts. Environment: Windows, Linux, Gerrit, Git, Groovy, DSL, Erlang, Nexus, Jenkins, Python, Docker, Zipline, Shell Scripts, JIRA

Consultant Developer at TCS(Ericsson): Stockholm.

Ericsson, NFVI Solution enables operators to deploy virtual telecom applications.

NFVI is a cloud platform to deploy virtual network functions and applications.: Network Function Virtualization Infrastructure CI NFVI CI provides a pre-integrated verified and tested solution for the latest cloud.

Automated the deployment process of Ericsson’s CEE (Cloud Execution Environment), SDI (Software Defined Infrastructure), SDN (Software M Defined Networking) and integrating with Ericsson Orchestrator.

Developed Auto deploy tool (ADT) staging environment with unit test cases for NFVI CI to automate the source code changes to production.

Integrate build and tests using Jenkins.

Designed and developed CI flow from scratch using the zipline framework for NFVI test automation.

Lead the team for Jenkins Pipeline POC for NFVI CI flow

Responsible for documentation of the basic business logic and processes including applications and services involved.

Responsible for reviewing all open tickets, resolve and close any existing tickets. Environment: Windows, Linux, Gerrit, Git, Groovy, DSL, Nexus, Jenkins, Python, Docker, Zipline, Shell Scripts, JIRA Consultant Developer at TCS(Ericsson): Budapest

Ericsson, CIME Minecraft provides continuous integration and continuous delivery platform to integrate at node level for all the Ericsson products in 4G and 5G.

Trained colleagues at Budapest on 24/7 Critical Support of one of the frontend visualization tools MIB of BID CI-E 4G,5G flow.

Automation enabled via Jenkins and its multi-branch pipelines – provision Jenkins, configure jobs, install plugins, set timeouts, configure jobs, define build and post-build. Environment: Windows, Linux, Gerrit, Git, Groovy, DSL, Nexus, Jenkins, Python, Docker, Zipline, Perl, Shell Scripts, JIRA

Sapeare Technologies - Hyd Feb 2014 – May 2015

Software Intern

Responsibilities:

Involved in analyzing user requirements for application enhancements and preparing testing specifications.

Involved in creating test cases based on use cases and updating them frequently.

Executed functional and system tests manually.

Actively participated in Bug Track meetings.

Submitted Coverage Status Report of tests performed on weekly basis. Environment: Manual Testing, SQL, UAT



Contact this candidate