Post Job Free
Sign in

Site Reliability Devops Engineer

Location:
Cumming, GA
Posted:
May 10, 2023

Contact this candidate

Resume:

Radhika

Phone: 678-***-****

Email: *****.****@*****.***

Site Reliability and DevOps Engineer

Professional Summary

● 7 years of experience in Site Reliability Engineer (SRE), DevOps, Con iguration Management, Build and Release management involving extensive work towards code compilation, packaging, building, and debugging, automating, managing and deploying code across multiple environments. A major focus in the areas of Build and Release Engineering and Infrastructure setup for both Cloud(GCP and AWS) and on-premises.

● 3 years of experience in Google Cloud Platform services like (Google Kubernetes Engine (GKE), GCE, Cloud Storage, Pub/Sub, IAM service account roles, Stack driver, cloud SQL, Redis, add health rules in log monitor, ISTIO, Datastudio and BigQuery.

● 3 years of experience in AWS services such as EC2, Load Balancer, Lambda, S3, RDS, DynamoDB, VPC, Route53, IAM, WAF, CloudFormation, CloudWatch, CloudTrail, Code Build, Code Commit, Code Deploy, Code Pipeline, EC2 Container Service, EBS, Elastic Beanstalk, Security Groups, VPCs.

● Design cost effective, highly available, and fault tolerant systems using multiple AWS EC2 instances, Auto Scaling, AWS Elastic Load Balance, DNS mapping with subdomain using Route 53. Pro icient with AWS Identity Access Management

(IAM) for creating roles and groups for users and resources and implementing Multi-Factor Authentication (MFA) on externally available servers and on AWS Console, Monitoring and alerting leveraging Cloud Watch and Cloud Trail. Implement AWS lambda to run servers without managing them and to trigger run code by S3 and SNS.

● Deploy Google cloud Platform Solutions using Google services like compute engine, cloud storage buckets, persistent disks, Bigquery, Redis, Data studio, GKE, Google Compute Engine(GCE), Data low jobs, cloud load balancer, auto scaling groups, cloud deployment manager, Identity and Access Management ( IAM ), stack driver, cloud IAM and cloud DNS.

● Hands-on experience with AWS Lambda work low implementation using python interact with application deployed on EC2 instance and S3 bucket.

● Design and Deploy Google cloud Platform Solutions using Google services like compute engine, cloud storage buckets, persistent disks, cloud load balancer, auto scaling groups, cloud deployment manager, stack driver, cloud IAM, and cloud DNS.

● Well experience in Disaster recovery exercise every year to make sure application will support during one region not available, having a recovery plan helps to reduce the potential damage and quickly restore operations when one occurs.

● Created Kubernetes cluster with objects like Pods, Deployments, and Services created. Extensive experience in Creating Pods and managing using Kubernetes through automated Jenkins jobs, scale down and up GKE workload pod replication based on customer transaction volume.

● Cloud cost saving by stopping/terminating GCE instances and GKE pods on the GCP cloud during non-business hours.

● Automated the shell scripts for noti ications, certs renewal in application servers, deployments, restarts, irewall testing, and Database event alerts.

● Experience on DevOps tools such as Jenkins, Maven, SVN, GIT, and Docker. Exposed to all aspects of Software Development Life Cycle (SDLC) such as Analysis, Planning, Developing, Testing and Implementing, and Post-production analysis of the projects, also experience in Agile Project Lifecycle and Sprints. DevOps/Agile operations process and tools area (Code review, unit test automation, Build & Release automation, Environment, Service, Incident, and Change Management).

● Deployment and con iguration of Microservices into Kubernetes Clusters using Jenkins Pipeline. Extensively worked with Scheduling, deploying, and managing container replicas onto a node cluster using Kubernetes. And worked with building Kubernetes run time environment of the CI/CD system to build, test and deploy in an open-source platform and Managed releases of Helm packages..

● Design and Deploy Google cloud Platform Solutions using Google services like compute engine, cloud storage buckets, persistent disks, cloud load balancer, auto-scaling groups, cloud deployment manager, stack driver, cloud IAM, and cloud DNS.

● Administered GIT Source code tools and ensured the reliability of the application as well as designed the Branching strategies for GIT. Experienced in GIT forks, tagging, handling merge requests and noti ications

● Designed and developed Cloud Service projects and deployed to Web Apps and IaaS.

● Good understanding of Openshift platform in managing Docker containers and Kubernetes Clusters, Created Docker images using Docker ile and deployed in docker registry.

● Designed and Implemented CI/CD DevOps systems for Linux on-prem and Cloud environments.

● Using build automation tools like Ant and Maven. Implementing monitoring tools Appdynamics.

● Deploying artifacts such as JAR, WAR using build automation tools like Maven and Gradle to write pom.xml and build.xml and using MAVEN repository using NEXUS Tool to share snapshots and releases.

● Experienced in Build and Deployment of Java applications on to different environments like Development, QA, UAT, and Production, which includes designing, development and implementation.

● Little knowledge in AWS Services like CloudWatch, S3 bucket EC2 instances, Worked 6 months in AWS cost-saving and support to verify logs in Cloud watch.

Technical Skills

Operating System Redhat-Linux ( 6x& 7x), CentOS (5x, 6x and 7x), Windows Server 2003/2008/2012 Cloud Technologies GCP and AWS

CI/CD Maven, Jenkins,,SVN, Nexus, Artifactory, SonarQube, Junit, Webhooks with Git/SVN, Con iguration

Management

Terraform

Containerization/Virtual Docker, Apache Mesos and Kubernetes Database MySQL, MongoDB, Cassandra, PostgreSQL, SQL Server Web/App Servers Apache, IIS, HIS, Tomcat, WebSphere Application Server, IBM HTTP Server Scripting Shell scripting, Python and Bash

Web Technologies HTML, CSS, and XML, Servlets, JSP Monitoring APPDynamics, DataDOG, Grafana, Prometheus, Elastic ELK, CloudWatch, Shell script alerts from application certi icates and databases.

Tracking JIRA and Con luence .

Methodologies Agile

Education

Master of Computer science

Work Experience

Project #1

Project Name: Bakkt loyalty solutions

Company: Bakkt Holdings, Inc

Employment type : Full-time

Role: Site Reliability and DevOps Engineer

Duration: Nov 2022 to March 9th 2023

Bakkt® Loyalty Solutions powers loyalty commerce across the full range of loyalty currencies and experiences. Deliver real value to your members through lexible, modular options that drive engagement.With the Apple Rewards storefront they can use loyalty currencies, split-pay, or card to own the latest Apple products. Deliver more options. Responsibilities:

● Deployed and con igured Elasticsearch, Logstash, and Kibana (ELK) for log analytics, and application monitoring in integration with AWS Lambda and CloudWatch. Then store that logs and metrics into an S3 bucket using Lambda

● function.

● De ined AWS Lambda functions for making changes to Amazon S3 buckets and updating Amazon DynamoDB table.

● Worked on Power Shell scripts to automate the AWS Cloud system in creation of Resource groups, Web Applications, security groups, irewall rules.

● Worked on google cloud platform (GCP) services like compute engine, cloud load balancing, cloud storage, cloud SQL, stack driver monitoring and cloud deployment manager.

● Setup GCP Firewall rules to allow or deny traf ic to and from the VM's instances based on speci ied con iguration and used GCP cloud CDN (content delivery network) to deliver content from GCP cache locations drastically improving user experience and latency.

● Integrated AWS Dynamo DB using AWS lambda to store the values of items and backup the Dynamo DB streams and Utilized Cloud Watch to monitor resources such as EC2, Amazon RDS DB services, Dynamo DB tables, EBS volumes, to set alarms for noti ication or automated actions, and to monitor logs for a better understanding and operation of the system. Integrated AWS Dynamo DB using AWS lambda to store the values of items and backup the Dynamo DB streams.

● Deployed and con igured Elasticsearch, Logstash, and Kibana (ELK) for log analytics, full-text search, application monitoring in integration with AWS Lambda and CloudWatch. Then store that logs and metrics into an S3 bucket using Lambda function.

● Designed an ELK (Elastic Logstash Kibana) system to monitor and search enterprise alerts and con igured ELK stack in conjunction with AWS and using Logstash to output data to AWS S3. Terminated SSL on to Nginx to remove overhead on API servers.

● Installing Tomcat and Apache Web server binaries for different versions and related components across all environments.

● Continuous Deployment pipeline with Jenkins and application servers like Apache and WebSphere.

● Writing Kubernetes yml iles, checking the logs, troubleshooting and deploying the application

● Attending daily meetings, collaborating with team members, and business partners.

● Performing upgrades, automating processes & procedures, identifying & mitigating technical problems, and developing documentation.

● Following Agile scrum methodologies (SCRUM, Kanban, etc.)

● Managing Docker containers and Kubernetes Clusters and installing Docker using the Docker toolbox. Created Docker images using Docker ile.

● Installing and con iguring tomcat, automating processes & procedures, tuning & troubleshooting java applications running on Linux.

● Working on Jira, Git, Artifactory, Jenkins, and scripting experience using Shell Perl or Python.

● Writing Docker iles, creating images and Docker containers, and troubleshooting.

● Resolving connectivity issues and work low problems.

● Working on HTTPD Webserver, Nginx, Con luent Ka ka, Oracle Java, Oracle Weblogic, Apache Tomcat, Kubernetes, and Docker.

● Implementing Tomcat Instances by using the Docker engine for running several Containerized AppServers.

● Contributed to establishing a process to share knowledge documents in process and procedures,

● technical documents via con luence/share point

● Experience in Server infrastructure development on GCP Cloud, Worked on Creation of Pods, and managed using Kubernetes. Utilized Kubernetes for the runtime environment of the CI/CD system to build, test and deploy.

● Implementing a CI/CD (Continuous Integration /Continuous Delivery) framework using Jenkins, Maven & Artifactory in Linux environment and Integration of Maven/Nexus, Jenkins, Git, Con luence, and Jira.

● Installed and con igured Jenkins for Automating Deployments and providing a complete automation solution and involved in the development of test environments on Docker containers and con iguring the Docker containers using Kubernetes.

● Managed Kubernetes charts using Helm. Created reproducible builds of Kubernetes manifests, provide a set of con iguration parameters to customize the deployment, and Managed releases of Helm packages.

● Used Kubernetes for automated deployments, scaling, and management of containerized applications across clusters of hosts for application deployments of docker images.

● Used Jenkins pipelines to drive all microservices builds into Docker registry and then deployed to Kubernetes, Created Pods, and managed using Kubernetes. Used Kubernetes to orchestrate the deployment, scaling and management of Docker Containers.

● Worked with the development team to migrate Ant scripts to Maven and Worked on authoring pom.xml iles, performing releases with the Maven release plugin, and managing Maven repositories.

● Created monitors, alarms, and noti ications for EC2 hosts using Cloud watch.

● Hands on experience on building cloud infrastructure using EC2, ASG, ELB, Route53, Dynamo and VPC & provisioning tool along with Ansible.

● Worked on AWS and Open stack cloud to provision new instances, S3 Storage Services, AWS EC2 & Cloud watch services.

● Involved in creating AWS Route S3 to route traf ic between different regions.

● Utilized S3 bucket and Glacier for storage and backup on AWS.

● Monitoring the server alerts through Nagios, CloudTrail, CloudWatch and troubleshooting the alerts.

● Implemented cluster services using Docker and Kubernetes to manage local deployments in Kubernetes by building a self-hosted Kubernetes cluster using Terraform and Ansible and deploying application containers.

● Utilized Amazon Route53 to manage DNS zones and also assign public DNS names to elastic load balancers IP's .

● Save cloud cost by stopping / terminating GCE instances and GKE pods on GCP cloud during non business hours.

● Automated the shell scripts for noti ications, certs renewal in application servers, deployments, restarts, irewall testing, Database event alerts.

● Send metrics to GCP Bigquery(BQ) and then Datastudio will show the report of application events

● GKE workload pods autoscale automatically based on volume by using HPA.

● Work with the Google GCP team and follow the best practices, conduct SRE guild meetings throughout the organization, share the knowledge with each other . Keep improving current design for best results. Project #2

Project Name: Email Gateway system

Client: CVS

Role: Site Reliability and DevOps Engineer

Duration: Dec 2019 to Nov 2022

Email Gateway System (EGS) application is a middleware application used for sending Order Con irmation email to customers. There are many features in EGS that can be con igured as per application requirements. It gives clients the lexibility to choose and pick from desired functionality. Error handling, Bounce back, Auto response, Retry processing, and Reporting are some of the features of EGS. EGS has strong back-end features which capture the bounce emails and process them.

Responsibilities:

● Create and maintain branching strategies for development teams.

● Created Build Jobs and Deployments in Jenkins and Implemented a CD pipeline with Docker, Jenkins and GitHub.

● Coordinate with development and testing teams and plan for test phases.

● Perform biweekly builds and share results with teams involved in testing.

● Setup Mavenscripts for building the code.

● Create Jenkins jobs for continuous integration setup.

● Build, manage, and continuously improved the build infrastructure for global software

● development engineering teams including implementation of build scripts, continuous

● integration infrastructure, and deployment tools.

● Integrating unit tests, and static code quality checks in CI pipelines.

● Organizing stand-up calls with various teams to review the issues and processes. Assigning

● the issues to owners and checking on the backlogs.

● Creating and tracking issues in Jira.

● Helping teams with deployments, troubleshooting, and rollbacks.

● Troubleshooting issues with servers and working with network teams for any issues.

● Documenting the issues, processes and reports for teams.

● Involved in production support activities and strictly followed SLA’s on resolving the

● tickets/issues.

● Responsible for generating the breached/problems/unresolved tickets reports monthly basis

● and communicating to senior management.

Project #2

Project Name: FIMO (Financial Information And Monitoring Organizer) Client: AmeriTrade, Jersey City, New Jersey, USA

Role: DevOps Engineer

Duration: Sep 2017 – Dec 2019

Ameritrade is one of the leading companies in the inancial investments industry in North America. Worked on “new feature” projects, migration projects, and maintenance projects on Conditional Orders, Portfolio Summary, Trade Stocks, Trade Mutual Funds, Portfolio Positions, and Check Images applications.

Responsibilities:

● Worked extensively on version control systems like SVN & GIT. Involved in conversion of the source code from the SVN to GIT.

● Worked closely with other CM teams, which were migrating from SVN to GIT. Helped them in the CM Planning and Build/Deploy processes. Git platform allows all team members to work together on the same ile and helps to track the changes effectively.

● Used Jenkins/Hudson to automate builds and deployments. Documented release builds and source control processes and plans.

● Supporting Maven multi-module builds and storing artifacts in the remote repository in artifacts. Worked on building and deploying Java

● code through Jenkins and Code coverage analysis using SonarQube. Run quality checks using SonarQube & upload it to Nexus

● repository.

● Experience using Maven and Ant as build tools for the building of deployable artifacts (jar, war & ear) from source code. Automated

● Weekly releases with ANT/Maven scripting for Compiling Java Code, Debugging and Placing Builds into Maven Repository.

● Supporting con iguration of Jenkins Build pipeline jobs, plug-ins installation with the team for con iguration of maven multi-module jobs. Automated the build and release management process including monitoring changes between releases.

● Coordinated with Project Managers to develop the SCM Plans and Tool Usage Models.

● Provided 24/7 on call support on Production Servers. Project #3

Project name: I VRS (Interactive Voice Response System) Client: MTNL

Role: DevOps Engineer

Duration: Mar 2011 – FEB 2012

Interactive voice Response System Allows a person to ask questions and provide answers by pressing keys on their touch-tone phone. Interactive Voice Response System (IVRS) is the generic term given to an application that integrates a Company’s/Organization’s telephone and computer system to become a voice computer that transforms the caller’s telephone into a terminal capable of directly accessing information and services.

Responsibilities:

● Provided 24/7 on call support on Production Servers.

● Involved in production support activities and strictly followed SLA’s on resolving the tickets/issues.

● Setup Mavenscripts for building the code.

● Created Jenkins Jobs for Build and deployment.



Contact this candidate