VENKATESH NOMULA
Email: ***************@*****.*** Phone: +1-717-***-****
DevOps Engineer/ Senior Cloud Engineer
PROFESSIONAL SUMMARY
Over 11+ years’ IT experience with specialization as Linux/DevOps Engineer working on Server based OS, Cloud IaaS with AWS and DevOps Culture through Continuous Integration (CI) & Continuous Deployment /Delivery (CD) and Automation of Infrastructure as Code and skilled on Windows Server 2008-R2/2012- R2/2016 on various hardware platforms and AWS/Azure cloud.
Managing Amazon Web Services (AWS) infrastructure with automation and configuration management tools such as Ansible, Puppet, or custom-built. Designing cloud-hosted solutions, specific AWS product suite experience.
Broad experience in creating the company's DevOps strategy in a mixed environment of Linux (RHEL, Ubuntu, CentOS) servers and windows along with creating and implementing a cloud strategy based on Amazon Web Services.
Knowledge of Docker components like Docker Engine, Docker-Hub, Docker-Compose, Docker Registry and Docker Swarm. Used Docker for local Application Deployment and test. Enhance the SDLC cycle by using Docker containers delivered with Kubernetes for web application deployment.
Designing and implementing fully automated server build management, monitoring and deployment By Using DevOps Technologies like Chef, Puppet.
Worked with Chef Enterprise Hosted as well as On-Premises, Managed On-site OS/Applications/Services/Packages using Chef as well as AWS for EC2/S3/Route53 & ELB with Chef Cookbooks, Installing Workstation, Bootstrapped Nodes, Written Recipes and Cookbooks and uploading them to Chef-Server.
Experience writing Chef Cookbooks and Recipes to automate the deployment process and integrating Chef Cookbooks into Jenkins’ jobs for a continuous delivery framework.
Experience in Kubernetes to deploy scales, load balance and managing Docker containers with multiple names spaced versions and good understanding of Open Shift Platform in managing Docker Containers and Kubernetes Clusters.
Providing foundational infra as code (IaC). Managing underlying infra (VMs, K8s Cluster, Rancher, Alerting, Monitoring and Logging). Code to support automation of infrastructure provisioning using ansible. Building base Docker images and standardizing container services.
Provided consistent environment using Kubernetes/Rancher for deployment scaling and load balancing to the application from development through production, easing the code development and deployment pipeline by implementing Docker Containerization.
Evaluate and recommend public, private and hybrid Cloud solutions, including Infrastructure-as-a-Service (IaaS), Software-as-a-Service (SaaS), and Platform-as-a-Service (PaaS).
Worked on JIRA, Agile/Scrum, Rally with short sprint cycle to release product, Participated in Agile/Scrum Community of Practice across the organization. Delivering monthly training sessions and facilitating discussions on team performance improvements.
Configuring and Administering NFS, DNS, NIS, DHCP, NIS+ and LDAP on Sun Solaris and on Linux server environment. Also, Expert in automation (writing Shell, Perl & Python scripts) to monitor UNIX production applications.
Installed, upgraded and configured RedHat Linux 5.x and 6.x using kickstart installation and SUSE Linux Servers manually.
Day-to-day jobs included but are not limited to handling Tickets, Monitoring, Troubleshooting and maintenance. And architected and designed Rapid deployment method using Ansible to auto deploy servers as needed.
TECHNICAL SKILLS
Cloud
AWS, Azure, GCP
Virtualization
VMware vSphere, VMware ESX, ESXi and VCenter
OS
RedHat, CentOS, Ubuntu, Windows (NT)
Version Control
Git, SVN, GitHub, GitLab, Bitbucket
CI/CD
Atlassian Bamboo, Jenkins, GitLab - CI/CD, Artifactory, Nexus, ECR
Containerization
Docker, Kubernetes
Configuration Management/Scripting
Chef, Ansible, Shell, bash, Python
Monitoring
Nagios, Splunk
Databases
MySQL, Aurora, MongoDB, SQL Server, Postgres, Redis
EDUCATION
Bachelor of Engineering in Information Technology from JNTU, Hyderabad, INDIA 2013.
Master’s in computer information systems from Bellevue University, Omaha, Nebraska, USA 2016.
PROFESSIONAL EXPERIENCE
FIS Global – Phoniex, AZ Sep 2023 – Till date
Senior System Administrator
Design, Install, configure, and maintain heterogeneous environment on on-prem and cloud-based infrastructure to support cross domain services/applications.
Installed, configured, and maintained the Octopus deploy Automation for releases, patches .net application by setting a transformed variable to change Config files accordingly.
Install, test, and validate the new emerging software to check the compatibility on existing and new infrastructure provisioning/ Deploying the software’s.
Design and implement building and deployment pipelines for Java and Nodejs microservices and build management tools like maven and npm.
Design and implementation of engineering DevOps and SaaS Infrastructure, including a move from private cloud to public cloud, and support the infrastructure stack (OS, storage, network, database, web, and virtualization).
Develop Solutions to improve architecture, deploy process applications securely to servers that are on-premises and cloud Infrastructure, AWS native PAAS offerings.
Develop Build solutions to problems that interrupt availability, performance, and stability in our systems, services, and products at scale. Perform a wide variety of technical and administrative duties in overall systems design, development, and delivery.
Responsible for hosting the application on their best suitable application servers making sure that it has ZERO DOWNTIME DEPLOYMENT when it is live in the production environment. Large extent we host on Tomcat, IIS, WebSphere, Jboss and Tcservers.
Responsible for configuring and maintaining Prod/QA environments consisting of complex middleware environments with several variations of Tomcat installations consisting of multiple instances in each installation.
Managing technical cases with hardware & software vendors to resolve problems and Analytical solve problems for complex multi-tiered applications.
Troubleshooting through the use of wire shark and browser development and diagnostic work to get to the root cause of any issues.
Managing the Disaster recovery testing/planning, building secure Red Hat Linux machines and Windows Servers 2008, 2016, 2019.
Responsible for IIS, Apache (reverse proxy) and with Windows security and load-balanced applications using NetScaler.
Develop Build solutions to problems that interrupt availability, performance, and stability in our systems, services, and products at scale. Perform a wide variety of technical and administrative duties in overall systems design, development, and delivery.
Managing technical cases with hardware & software vendors to resolve problems and Analytical solve problems for complex multi-tiered applications.
Experience in designing and implementing build and deployment pipelines for Java and Nodejs microservices and build management tools like maven and npm.
Building a production quality dashboard with standardized views across all the splunk infrastructure within the organization.
Standardize Splunk forwarder deployment, configuration and maintenance in Linux and windows platforms.
Administrating VM clusters in production, enforce security compliance and data governance rules. Application deployment on the production and non-production environments by following proper release and change management.
Coordinate Releases from Dev to Production. Communicating with Developers, Testers, Managers, Unix, Network teams and DBA teams for successful Release.
Created IAM policies for delegated administration within AWS and Configure IAM Users / Roles / Policies to grant fine - grained access to AWS resources to users.
Improved infrastructure design and approaches of different projects in the cloud platform Confidential Web Services (AWS) by configuring the Security Groups, Elastic IP's and storage on S3 Buckets.
Developed, deployed, and managed event-driven and scheduled AWS Lambda functions to be triggered in response to events on various AWS sources including logging, monitoring, and security related events and to be invoked on scheduled basis to take backups.
Mastercard – O'Fallon, MO Feb 2021 – Sep 2023
Site Reliability Engineer
Install, test, and validate the new emerging software’s/tools to check the compatibility on existing and new infrastructure provisioning/ Deploying the software’s.
Design and implementation of engineering DevOps and SaaS Infrastructure, including a move from private cloud to public cloud, and support the infrastructure stack (OS, storage, network, database, web, and virtualization).
Configured Azure Security using RBAC, Azure Security Center and Azure Monitor services and Configured Azure Infrastructure Automation using Runbooks and Terraform scripts.
Designed Network Security Groups (NSGs) & Load Balancer to control inbound and outbound access to network interfaces (NICs), VMs and subnets.
Involved in firewall deployment and management in Azure such as Palo Alto, Azure Firewall and Implemented automate build & operational tasks using Python and Power Shell scripts.
Implemented Serverless Cloud Services using Azure Functions with application insights and Implemented a CI/CD pipeline using Azure DevOps (VSTS/TFS) in both cloud and on-premises with GIT, MS Build, Docker, Maven along with Jenkins’s pipeline builds & YAML/JSON.
Troubleshoot Splunk infrastructure components in highly available, distributed splunk deployment.
Implementing CI/CD pipeline for promoting software into higher environments through validation and operational gating, and lead Mastercard in DevOps automation and best practice. Deployed JAR, WAR, EAR and J2EE applications on Apache tomcat server using Jenkins’s auto deployment.
Develop Build solutions to problems that interrupt availability, performance, and stability in our systems, services, and products at scale. Perform a wide variety of technical and administrative duties in overall systems design, development, and delivery.
Managing technical cases with hardware & software vendors to resolve problems and Analytical solve problems for complex multi-tiered applications.
Configured EC2 instance for running applications on AWS infrastructure. Configured Route53 to register domain names, check the health of resources, send notifications when a resource is unavailable.
Used AWS S3 API to create, manage buckets & upload objects. Followed various bucket policies to provide server-side encryption to all the objects stored in the bucket.
Customized Ansible modules for finding facts about AWS CloudWatch alarms and taking actions to pause/unpause those alarms during deployments. Used Ansible Tower Dashboard to view the complete status of my plays & tasks, each success & failure of Playbooks.
Used Kubernetes to deploy scale, load balance and manage Docker containers by creating Pods & Clusters with multiple namespace ids.
Managed Kubernetes charts using Helm and created reproducible builds of the Kubernetes applications, managed Kubernetes manifest files & releases of Helm packages.
Configured email and messaging notifications, managed users and permissions, system settings and maintained two Jenkins servers and one Bamboo server to deploy into production and non-production environment.
Created repositories, branches, tags in GIT, assisted the developers to fix merging issues and Creating Local, Virtual Repositories in Artifactory for the project and release builds.
Citizens Bank – Johnston, RI July 2019 – Feb 2021
Senior Cloud Engineer
Design, Install, configure, and maintain heterogeneous environment on on-prem and cloud-based infrastructure to support cross domain services/applications and develop Terraform Templates to provision AWS infrastructure and AWS native PAAS offerings.
Delivered Solutions to improve Cloud architecture, deployment for AWS, managed multiple AWS accounts with multiple VPC's for both prod and non-prod for automation, build out, integration.
Infrastructure Development and Operations, involved in designing and deploying utilizing AWS stack like EC2, S3, VPC, RDS, SES, ELB, SQS, Auto scaling, Cloud Front, Cloud Formation, Elastic Cache, Cloud Watch, SNS and Route 53.
Writing Templates for AWS infrastructure as a code using Terraform to build staging and production environments. Integrated Amazon Cloud Watch with Amazon EC2 instances for monitoring the log files, store them and track metrics.
Implemented and maintained the monitoring and alerting of production and corporate servers, storage using AWS Cloud Watch.
Designed and implemented production ready container orchestration platform which is highly available, load balanced and fault tolerant – Kubernetes.
Implemented a production ready, load balanced, highly available, fault tolerant, auto scaling Kubernetes AWS infrastructure and microservice container orchestration.
Managed Kubernetes charts using Helm, and Created reproducible builds of the Kubernetes applications, managed Kubernetes deployment and service files and managed releases of Helm packages.
Provided consistent environment using Kubernetes for deployment scaling and load balancing to the application from dev through production, easing the code development and deployment pipeline by implementing Docker containerization with multiple namespaces.
Working with Kubernetes deployments, Kubernetes volumes, Kubernetes network policy, command line utilities of Kubernetes Kubectl.
Experience in migrating the Legacy application into GCP platform and managing the GCP services such as Compute Engine, cloud storage, Big Query, VPC, Stack Driver, Load Balancing and IAM.
Experience in reducing the cost for computing engines in GCP as per utilization of services and working on TERRAFORM for provisioning the Environments in GCP.
Responsible for managing the GCP services such as Compute Engine, App Engine, Cloud Storage, VPC, Load Balancing, Big Query, Firewalls, and Stack Driver.
Setting up and configuring New Relic, improved monitoring and performance of application in new relic.
Experience in using monitoring tools like Splunk, New Relic, DataDog.
Responsible for Installing, configured and administered Splunk Enterprise on Linux and Windows servers.
Expertise in writing Splunk searches, Splunk Infrastructure and Development expert well - versed with Splunk architecture and design.
Designing and maintaining production-quality Splunk dashboards. Splunk Enterprise Deployments and enabled continuous integration as part of configuration management.
Developed and maintain New Relic scripts for API and scripted browser monitors.
Monitoring server behavior in terms of utilization of resources, drilling down to method level and query level to identify bottlenecks using New Relic.
Deploying and maintaining production environment using AWS EC2 instances and ECS with Docker. Container management using Docker by writing Docker files and set up the automated build on Docker HUB and installing and configured Kubernetes.
Managing Kubernetes and other infrastructure components using Ansible playbooks, designing Ansible playbooks for automation and integration of infrastructure.
Good knowledge of Docker and Kubernetes. Created a docker-registry and router pods in the OpenShift environment to serve user requests.
I wrote Ansible playbooks and created Roles for Scalability and Reusability in Ansible, playbooks for provisioning, orchestration, packages, services, configuration, and deployment using Ansible.
Developing helm charts and deploying into Kubernetes platform, managing build and deployment pipelines for Java and Node.js microservices.
Worked on applications like smart log and load cast application, A-MQ and fuse Jboss application.
Red hat Jboss EAP/Fuse/AMQ Installation and configuration.
Responsible for setting up logging using Splunk and integrating with microservices pipeline along with dashboards and log metrics for the microservices.
Automated AMI build using packer and worked in a dynamic routing/load balancing capability enabling large application scaling using Nginx.
Implementing monitoring solutions for Kubernetes and AWS PAAS – Datadog, CloudWatch in a hybrid public/private cloud environment.
Created datadog dashboards for various applications and infrastructure to increase observability and monitoring real-time.
Walt Disney Attractions Technology – Orlando, FL April 2018 – July 2019
Systems Engineer/ AWS Cloud Engineer
Support heterogeneous environment for maximum up time and troubleshoot the environment related fixes to make the applications up and running.
Delivered Solutions to improve Cloud architecture, deployment for AWS Private and Public Cloud. Managed multiple AWS accounts with multiple VPC's for both prod and non-prod for automation, build out, integration and cost control.
Created Terraform templates to write Cloud Formation scripts automate the provisioning of AWS resources like IAM, EC2, S3, Route 53, SNS, RDS, ELB and Auto Scaling and make calls to Puppet scripts to provision and configure servers on the instances.
Install, test and validate the new emerging software’s/tools to check the compatibility on existing and new infrastructure provisioning/ Deploying the software’s.
Developing Chef Recipes to automate build/deployment process and do an overall process improvement to any manual processes. Writing Unit test cases for chef recipe testing using Test kitchen, food critic.
Perform tasks like bootstrapping nodes to executing run lists to mirror the new servers to application servers and to run deployments against newly added servers to the clusters.
Responsible for hosting the application on their best suitable application servers making sure that it has ZERO DOWNTIME DEPLOYMENT when it is live in the production environment. Large extent we host on Tomcat, IIS, WebSphere, Jboss and Tcservers.
Working on newer F5 products and modules for user authentication, remote access, and security enforcement (app firewall, data center firewall, network firewall the F5 way) and their integration with LTM/GTM. Supporting multiple F5 LTM and GTM platforms. This includes installing, upgrading, and troubleshooting issues on these platforms.
Responsible for configuring and maintaining Prod/QA environments consisting of complex middleware environments with several variations of Tomcat installations consisting of multiple instances in each installation.
Creating Application Security Certificates using AWS SSL Certificate Manager and AWS Route 53 Modules for Domain Name Server (DNS) Hosting.
Creation Terraform Scripts for Provisioning / Creation of EC2 and Deploying Load Balancers and configuring Health Check End Points for AWS applications.
Update recipes to implement Zero Downtime Deployment, to make sure application is running during deployment and for Jenkins Push Button Deploy.
Responsible for generating the TMSH script and executed on the LTM to create a VIP for application. Adding pool-members, data bags and HTTP profiles along with I-rules. Configuring TLSv1.2 upgrade on the Load balancer for security protocol. Troubleshooting the VIP if any incident occurs from application team. Sometimes getting downtime for all VIP for F5 LTM upgrade.
Provisioned infrastructure in AWS cloud using terraform and Atlantis an open-source advanced version terraform tool which is integrated with GitHub.
Worked with many application teams of Disney globally for building and Dockerizing their application source code using docker and Jenkins. Integrated Jenkins with repositories like GitHub to build Automation tool (Maven) and artifact repository for pushing successful build code. Managed Continuous Delivery and Integration Pipelines using Jenkins as CI and Run deck as CD.
Installed, configured and maintained the Docker environments in Linux & Unix systems to house all microservices and created several scripts in python to monitor and admin all docker containers.
Used Service now for incident management and also Splunk and AppDynamics for monitoring to detect potential outages and reacted quickly to mitigate downtimes.
Maintenance of various applications infrastructure Deployment and Load Balancing involving 24/7 on-call rotations for uninterrupted service of the applications.
Pitney Bowes – Danbury, CT May 2017 – March 2018
DevOps Engineer/ AWS Cloud Engineer
Implemented AWS high availability using AWS Elastic Load Balancing (ELB), which performed balance across instances in multiple availability zones. Architect Development, Test, Integration, and Production AWS environments.
•Performed AWS CLI Auto Scaling and Cloud Watch Monitoring Creation and Designed AWS Cloud Formation templates to create customized VPC, subnets, NAT to ensure successful deployment of Web applications and database templates.
•I worked at optimizing volumes and EC2 instances and created multiple VPC instances and experience in configuring and networking of Virtual Private Cloud (VPC) and worked AWS EC2, EBS, Trusted Advisor, S3, Cloud Watch, Cloud Front, IAM, Security Groups, Auto-Scaling.
•Experience in installation and management of network related services like TCP/IP, FTP, SSH, DNS, TELENT, HTTP, HTTPS, LOAD BALANCING, VPN, FIREWALL, SUBNETS, SMTP, CDN.
•Broad experience in creating the company's DevOps strategy in a mixed environment of Linux (RHEL, Ubuntu, CentOS) servers and windows along with creating and implementing a cloud strategy based on Amazon Web Services.
•Good Knowledge on Docker components like Docker Engine, Docker-Hub, Docker-Compose, Docker Registry and Docker Swarm. Used Docker for local Application Deployments and testing.
•Extensive experience in working with Docker container snapshots, attaching them to a running container, removing images, managing directories structures and managing containers.
•Setup and build AWS infrastructure resources – VPC, EC2, S3, IAM, EBS, Security Group, Auto Scaling, and RDS in Cloud Formation JSON/YAML templates. Provided DevOps teams with clear direction - including appropriate governance and decision making.
•Configured AWS IAM and Security Group in Public and Private Subnets in VPC. Implemented AWS solutions using EC2, S3, RDS, EBS, Elastic Load Balancer, Auto scaling groups, AWS CLI, Cloud Formation, Cloud Front.
•Application Monitoring Tools, Dynatrace, Splunk, AppDynamics. Used Application Monitoring Tool AppDynamics troubleshooting AppD configuration issues, metric data, backend issues etc.
•Develop custom app configurations (deployment-apps) within Splunk to parse, index multiple types of log format across all application environments. Supporting and monitoring Splunk cluster infrastructure in AWS cloud environment.
•Maintain AppDynamics for application performance-related monitoring. Work with employees to discuss application diagnosis and remedy using the tool. Integrate AppDynamics alerts into Splunk.
•Installed, configured, and administered IBM HTTP Web Server, WebSphere Application Server, and Apache Tomcat on Red Hat Linux & AIX platforms for the on-going application deployments.
•Managed Jboss Application server on DEV, QA, and PROD systems and configured Apache for Jboss domains with virtual servers.
•Installed and configured monitoring tools such as Nagios, Zabbix, and Splunk for resource monitoring/network monitoring/log monitoring.
Kiewit Constructions – Omaha, NE Nov 2016 – March 2017
DevOps Engineer/Cloud Engineer
Installed chef server and Workstation bootstrapped Nodes. Wrote Cookbook recipes and uploaded them on to the Chef-server, Managed On-site Applications/ OS/ Packages/ Services using Chef as well as AWS for EC2/S3&ELB with Chef Cookbooks.
Develop continuous delivery product release pipelines, using tools such as Rundeck, Jenkins, and enable release and operations automation using Chef.
Supported and implemented recipes for Chef Environment with over 100 servers and implemented automated deployment configuration using Chef Cookbooks and recipes. Created alarms and trigger points in CloudWatch based on thresholds and monitored logs via metric filters.
Integrated Jenkins with repositories like GitHub to build Automation tools (Maven and Ant), and Artifact repository for pushing successful build code.
Automated various infrastructure activities like Continuous Deployment, Application Server setup, Stack Monitoring using Ansible playbooks and has integrated Ansible with Rundeck and Jenkins.
Extensively worked on Maven in setting up SCM polling with Maven Repositories and Professional experience in installing Jenkins/Plugins for GIT Repository.
Executed the Maven builds locally to troubleshoot Java code issue and merging related issues and created and maintained Jenkins jobs that execute Shell scripts for automation.
Integrated GIT into Jenkins to automate the code check-out process and responsible for design and maintenance of the GIT Repositories, views, and the access control strategies and performed all necessary day-to-day GIT support for different projects.
Configuring Splunk enterprise instances so that each instance performs a specialized task over large scale distributed Splunk environment.
Experience working on several Docker components like Docker Engine, Hub, Machine, Compose and Docker Registry.
Experience working on Docker hub, creating Docker images and handling multiple images primarily for middleware installations and domain configurations.
Developed an opensource project with Kubernetes to manage a cluster of Linux containers, running Docker containers across multiple hosts.
Experience in troubleshooting, performance tuning, storage & security management and user administration in LINUX servers.
Logic4G Solutions Pvt. Ltd – Hyderabad India July 2013 – Nov 2014
Linux/VMware System Admin
Installation of ESX 4 Nodes, vCenter 4, Update manager, and Administering VM's across 4 Datacenters Running 2000 VM's.
Installed SQL Server 2012 with latest Service pack and established connectivity with the database created, configured and maintained database maintenance plans.
As a member of a group of Linux/Unix administrators, responsible for providing for 24x7 support for production and responding to user’s tickets created from Peregrine Ticketing Tool. Involved in Deploying the Redhat Linux enterprise server 5.x, 6.x versions.
Setup Kickstart server for quicker installation and ease of adding additional servers. Installed Linux (RHEL 5.5) on HP Blade and VMware ESX servers using RedHat tool Cobbler.
For monitoring the network bandwidth and the hard drives status, Installed and configured monitoring tool Nagios.
Created Microsoft Windows Installer packages using Wise Package Studio and Admin Studio. Advertised and managed packages and collections in SCCM environment.
Installed, integrated and tuned LDAP, Apache Tomcat, JBOSS, WebSphere and Web logic application server.
Performed user account administration, created groups and add users to the appropriate group.