Sai Suresh *****************@*****.***
DevOps/Cloud Engineer
Cloud enthusiastic team player having 8 years of experience in the IT industry as a DevOps Engineer with proven expertise in Automation, Build/Release Engineering, and Software development involving cloud computing platforms like Azure, Amazon Web Services (AWS). Experience in Azure Cloud, Azure Data Factory, Azure Data Lake Storage, Azure Synapse Analytics, Azure Analytical services, Azure Cosmos NO SQL DB, and Data bricks.
Professional Summary:
Experience in several AWS Services including EC2, VPC, IAM, S3, RDS, ELB, Route 53, Cloud Watch, Cloud Formation Templates, Cloud Front, Cloud Trail, AWS CDK, ALB/NLB Load Balancers, AWS inspector and administering them using AWS Console and AWS Command-line Interface
Hands on experience on AWS cloud services (VPC, EC2, S3, RDS, Redshift, Data Pipeline, EMR, DynamoDB, Workspaces, Lambda, Kinesis, RDS, SNS, SQS)
Experience in tuning JBoss parameters for optimal performance and scalability.
Having Strong experience in container Technology Docker and container Orchestration Technology Kubernetes.
Expert in various Azure services like Compute (Web Roles, Worker Roles), Caching, Azure SQL, NoSQL, Storage, and Network services, Azure Active Directory (AD), API Management, Scheduling, Azure Auto Scaling, and PowerShell Automation.
Experience in managing and automating infrastructure using Ansible Tower or AWX
Work collaboratively with software engineering teams to deploy and operate the company's system in a seamless fashion and help automate builds and deployments by utilizing different DevOps tools.
Worked as Cloud Administrator on Microsoft Azure, involved in configuring virtual machines, storage accounts, resource groups.
Experience in designing, deploying, and managing OpenStack-based cloud infrastructure
Hands on experience in Azure Development, worked on Azure web application, App services, Azure storage, Azure SQL Database, Virtual machines, Fabric controller, Azure AD, Microsoft SQL Server, Microsoft Visual Studio, Windows PowerShell, Cloud infrastructure, Azure search, and notification hub.
Experienced in setting up the Kubernetes Cluster Setup on premise datacenter using Kubespray.
Designed, configured and deployed Microsoft Azure for a multitude of applications utilizing the Azure stack, focusing on high - availability, fault tolerance, and auto-scaling.
Experience in Microsoft Azure Cloud Services (PaaS & IaaS), Application Insights, Document DB, Internet of Things (IoT), Azure Monitoring, Key Vault, Visual Studio Online (VSO) and SQL Azure.
Experience in Designing Azure Resource Manager (ARM) Template and in designing custom build steps using PowerShell.
Web application development using Groovy Grails jQuery, AJAX.
Experience in creating and managing Jira filters and gadgets for custom data visualization and tracking.
Experienced in working with Deployment strategies like Blue Green Deployment and Canary deployment.
Jira development using JAVA and Groovy scripting.
Experienced in configuring Cloud platform such as Virtual Networks Multi-site, Cross-site, VMs, Azure AD, Load Balancers, Azure SQL, Cloud Services, etc.
Knowledge of Tomcat's manager and host-manager applications for deployment and monitoring tasks.
Experience in managing Amazon Web Services like EC2, S3 bucket, Route53, RDS, Dynamo DB, SNS, SQS, Lambda, EBS, ELB, Auto-Scaling, AMI, IAM through AWS Console and API Integration and focusing on high-availability, fault tolerance and auto-scaling in AWS Cloud Formation, deployment services (OpsWorks and Cloud Formation) and security practices (IAM, CloudWatch, CloudTrail).
Experience in cloud automation using, AWS cloud Formation Templates, Chef, Puppet.
Implemented multiple CI/CD pipelines as part of DevOps role for on-premises and cloud-based software using Jenkins, Maven, Nexus Repository, GitHub, Chef, Puppet, Terraform, Docker.
Configured and managed Elastic Load Balancing (ELB) to implement fault tolerance and avoid a single point of failure of applications, thus providing high reliability and efficient usage.
Installing, configuring and administering Jenkins CI tool on Linux machines and built Continuous Integration and Continuous delivery environment. Setup Docker on Linux and configured Jenkins to run under Docker
Experienced with Docker container service and Dockerized applications by creating Docker images from Docker file.
Used Jenkins pipelines to drive all microservices builds out to the Docker registry and then deployed to Kubernetes, Created Pods and managed using Kubernetes.
Developed microservice on boarding tools leveraging Python and Jenkins allowing for easy creation and maintenance of build jobs and Kubernetes deploy and services.
TECHNICAL SKILLS:
AWS
AWS EC2, ELB, S3, EBS, VPC, Route 53, RDS, Auto-Scaling, IAM, SNS, SES, SQS, Cloud Front, Cloud Formation, Cloud Watch, Elastic Beanstalk
Azure
Virtual Machine, Vnet, Traffic Manager, VPC, Storage accounts, Azure Web App, AKS, ACR.
Programming/Scripting Languages:
Python, Groovy, Shell, YAML, Shell/Perl Script, Node.js, Ruby, SQL, PL/SQL.
Technologies & frameworks
Spring, Spring Boot, Spring MVC, Spring Data JPA, Spring Cloud, Hibernate, Apache Kafka, CI/CD, Docker, AWS, EC2, S3, Lambda, Groovy, Jenkins, Kubernetes.
Operating Systems:
Windows, Unix, Linux, Solaris
Web Servers:
Apache Tomcat, Nginx, WebSphere, JBOSS, WebLogic
SCM Tools:
GIT, GitHub, Subversion,
Version Control Tools
SVN, GIT, GitHub, Git lab, Bit bucket
Tracking Tools
Jira, Rally
Build/Release (CI/CD)
Chef, Puppet, Ansible, Jenkins, Kubernetes, Azure, Cloud Foundry, Circle CI, TeamCity, Maven, ANT, Git, SVN, TFS, Atlassian Jira, Selenium, Nexus, J Frog, Docker
Monitoring Tools
Nagios, App Dynamics, Dynatrace, Splunk, Log Stash, Apache JMeter
Infrastructure as code
Terraform, CloudFormation, Ansible, Puppet, Chef
Containerization Tools
Docker, Kubernetes, AWS ECS/EKS, Apache Mesos, OpenShift, Rancher, Marathon
Application Servers
Apache Tomcat, Nginx, Httpd, WebSphere Application Server, Kafka, JBoss, WebLogic
Networking Protocols
DNS, DHCP, TCP/IP, Cisco Routers/Switches, WAN, LAN, FTP/TFTP, SMTP
Monitoring Tools
Nagios, AWS CloudWatch, Splunk, ELK, Grafana
Bug Tracking Tools
JIRA, Confluence, ServiceNow, Bugzilla, Red Mine
PROFESSIONAL EXPERIENCE:
Client: JP Morgan Chase Jan 2022 – Till Date
Role: Sr. DevOps Cloud Engineer
Key Contributions:
Worked on designing and deploying a multitude application utilizing main services of the AWS stack (like EC2, S3, RDS, VPC, IAM, ELB, and Cloud watch, Route 53, Lambda and Cloud Formation) focused on high availability, fault tolerance environment.
Implemented a 'server less' architecture using API Gateway, Lambda, and Dynamo DB and deployed AWS Lambda code from Amazon S3 buckets. Created a Lambda Deployment function, and configured it to receive events from your S3 bucket
Experience in integrating Tomcat with Apache HTTP Server or Nginx for load balancing and reverse proxy setups.
Ability to create and manage inventories, credentials, and job templates in Ansible Tower/AWX.
Experience in designing, deploying, and managing OpenStack-based cloud infrastructure
Proficiency in OpenStack components such as Nova, Neutron, Cinder, Glance, and Swift
Experience in writing CloudFormation scripts to deploy different AWS Infrastructure components related to the respective services, managed various resources and data elements defined in the scripts.
Written Jenkins pipeline using Groovy script.
Skilled in integrating Jira with other DevOps and collaboration tools, such as Bitbucket, Jenkins, Confluence, and Slack.
Managed and administered PostgreSQL databases, including installation, configuration, and ongoing maintenance tasks.
Knowledge of Ansible Tower/AWX best practices for scaling, security, and monitoring.
Hands-on knowledge of deploying and configuring OpenStack services for networking, storage, and compute.
Installed and configured CI/CD tools in AWS environment and deploying applications in Dev, QA and Production environments using CloudFormation scripts.
Skilled in troubleshooting JBoss performance issues, memory leaks, and application-specific errors.
Worked on setting up LDAP, SSL certificates on WebLogic using the key stores.
Creating scripts in DSL Groovy which integrates with Jenkins for automation to create seed jobs.
Experience in AWS Lambda to deploy Micro-services and trigger to run code by S3 & SNS. Provide high durable and available data by creating and managing policies for the S3 data store, versioning, life cycle policies.
Developed Devops script in groovy to automate and collection analysis of Cassandra.
Have experience in automation of the infrastructure using Terraform in AWS.
Deployment of cloud services including Jenkins and Nexus using Terraform.
Used AWS Beanstalk for deploying and scaling web applications and services developed with java, PHP, Node.Js, Python and Docker on familiar servers such as Apache.
Integrated Datadog with various tools and services, such as AWS, Kubernetes, Docker, and more, to collect comprehensive data for monitoring and analysis.
Handled several Docker components like Docker Engine, Hub, Machine, Compose and Docker Registry and experience with Docker container service and Dockerized applications by creating Docker images from Docker file and running them to create the Docker containers and Docker consoles for managing the application life cycle.
Collaborate with development support teams to setup a continuous delivery environment with the use of Docker.
Involved in installing and managing different automation and monitoring tools on Red hat Linux like Nagios, Splunk and Puppet.
Created dashboards, reports and alerts on Splunk for multiple applications.
Used Kubernetes for automating deployment, scaling and operations of application containers across clusters of hosts providing container centric infrastructure.
Created Kubernetes cluster with objects like Pods to deploy applications quickly and predictably. Used Kubernetes dashboard for container Management and pod creation etc.
Experience in migrating SVN repositories to GIT and manage GIT repositories.
Enabling customers to better manage software development, deployments and infrastructure with tools such as Ansible, Jenkins, Maven, SVN and Git.
Automated Weekly releases with ANT/Maven scripting for Compiling Java Code, Debugging and Placing Builds into Maven Repository.
Implemented high availability solutions for PostgreSQL databases using techniques like replication, failover, and load balancing, ensuring maximum uptime and data reliability.
Ability to set up and manage Jira Software Data Center for high availability and performance scalability.
Familiarity with JBoss management tools, such as JBoss CLI and the web-based administration console
Upgraded WebLogic server 9.2 to WebLogic 10.3.6 and updated JDK's and installed service packs and patches.
Used Service Now as a CRM tool for release management, incident management and change management process.
I have experience on deploying the application on prem like openshift and marathon
We deploy all the Java javascript and go lang application in openshift pods and will maintain the pods
Deployed Container applications into the Kubernetes cluster using Helm charts.
We maintain all the different environment clusters in openshift and doing manual deployment and automated the deployment.
Update the cluster and installed the operators like redis kafka in openshift.
In Openshift we install and maintainer conjur to take and maintain all the serects from conjur pod.
Deployed Container applications into the Kubernetes cluster using Helm charts.
Installed Grafana and integrated with openshift to collect metrics of pods cpu memory usage and restarts of pods.
Using the Grafana we collect the metrics of pods restarts pods memory usage and pods cpu usage through Prometheus we collect all metrics.
Environment: AWS, EC2, S3, Groovy, RDS, EBS, Elastic Load Balancer Docker Ansible, Python, Jira,Weblogic, Terraform, Hudson, Chef, Vagrant, Linux, SVN, Datadog,Git, Gradle, Jenkins, Ant, Service now.
Client: T-Mobile Feb 2021 – Dec 2021
Role: Sr. DevOps Engineer
Key Contributions:
Worked on Cloud Administrator on Microsoft Azure, involved in configuring virtual machines, storage accounts, resource groups.
Monitored WebLogic/Jboss Server health and security.
Utilized Datadog's monitoring data to identify performance bottlenecks and optimize system resources, leading to enhanced application performance and reduced costs.
Ability to troubleshoot and resolve issues related to OpenStack infrastructure.
Involved in data warehouse implementations using Azure SQL Data warehouse, SQL Database, Azure Data Lake Storage (ADLS), Azure Data Factory v2
Involved in creating specifications for ETL processes, finalized requirements and prepared specification documents Migrated data from on-premises SQL Database to Azure Synapse Analytics using Azure Data Factory, designed optimized database architecture
Responsibilities:
Created Azure Data Factory for copying data from Azure BLOB storage to SQL Server
Managed and administered PostgreSQL databases, including installation, configuration, and ongoing maintenance tasks.
Implement ad-hoc analysis solutions using Azure Data Lake Analytics/Store, HDInsight/Databricks
Familiarity with OpenStack APIs and integration with other cloud platforms.
Work with similar Microsoft on-prem data platforms, specifically SQL Server and SSIS, SSRS, and SSAS
Create Reusable ADF pipelines to call REST APIs and consume Kafka Events.
Manage Windows 2012 servers, troubleshooting IP issues and working with different support teams.
Use Power shell to write, debug scripts to automate the processes and do migration of VM, involving copying and creating of VHDS.
Ability to configure and manage Jira Service Management (formerly Jira Service Desk) for IT service management and support ticket tracking.
Ability to create and manage inventories, credentials, and job templates in Ansible Tower/AWX
Configuration of Microsoft DevTest Labs to migrate the virtual machines from one subscription to another subscription.
Developed robust backup and recovery strategies for PostgreSQL databases to safeguard critical data and minimize downtime in case of disasters.
Experience in Creating ARM templates for Azure Platform.
Managing day to day activity of the cloud environment, supporting development teams with their requirements.
Creating Labs, Virtual Machines along with setting up policies and using Formulas and Custom Images to deploy the network.
Convert existing Virtual Machine from Standard to Premium Storage Account. Patching and Validating of Virtual Machine in Azure.
Monitor Azure Infrastructure through System Centre Operation Manager (SCOM).
Worked in Azure Development, worked on Azure web application, App services, Azure storage, Azure SQL Database, Virtual Machines, Fabric controller, Azure AD, Azure search, and Notification hub.
Expertise in Microsoft Azure Cloud Services (PaaS & IaaS), Application Insights, Document DB, Internet of Things (IoT), Azure Monitoring, Key Vault, Visual Studio Online (VSO) and SQL Azure.
Developed solutions and automated Azure operational tasks and worked with ARM templates and scripts using Azure PowerShell during Automation and Build Process.
Worked on Azure Fabric, Microservices, IoT & Docker containers and involved in setting up Terraform continuous build integration system.
Environment: Azure DevOps, Kubernetes, Docker ACS & AKS, Prometheus, Splunk, Terraform,Weblogic, Ansible, Jenkins, Git, Azure Boards, Maven, SonarQube, Python, PowerShell, YAML, VSTS, Visual Studio Code, Tomcat, Nginx, Linux, Windows servers 2012/2016, ANT, Subversion (SVN), CHEF, Docker, Datadog, Vagrant, EC2, JIRA, RHEL, SNS, SQS, Shell/Perl Scripts, Bitbucket, TFS, SCM, API, Java, GitBash.
Client: Prudential Financial Oct 2017 – Feb 2021
Role: DevOps Engineer
Key Contributions:
Configured and managed various AWS services including EC2, RDS, VPC, S3, Glacier, Cloud Watch, Cloud Front, ELB, SNS, SQS, SWF and Route 53.
Created S3 buckets, defined lifecycle policies, configured object versioning and Event Notification for S3 bucket.
Designed AWS cloud formation templates to create custom sized VPC, Subnets, NAT to ensure successful deployment of web applications and database templates.
Implemented cloud services AWS and implemented Bash and Python Scripting.
Responsibilities:
Designed and Developed AWS Cloud infrastructure utilizing Ansible with AWS Cloud Formation and Jenkins for Continuous Integration with high availability and secure multi-zones and automated end-to-end transit hub connectivity resource deployment in AWS via Terraform.
Leveraged Datadog's anomaly detection capabilities to identify unusual behavior in metrics, enabling early detection of issues and potential outages
Worked with AWS Code Pipeline and creating cloud formation JSON templates which are converted to Terraform for infrastructure as code.
Implemented Terraform modules for deployment of applications across multiple cloud providers.
Implemented to reprocess the failure messages in Kafka using offset id.
Familiarity with Jira's advanced features, such as automation rules, post functions, and custom scripts using Jira Script Runner.
Deployed and configured Elasticsearch, Logstash, and Kibana (ELK) for log analytics application monitoring in integration with AWS Lambda and CloudWatch.
Created and maintained various DevOps related tools for the team such as provisioning scripts, deployment tools and staged virtual environments using Docker.
Docker Registry and deploy to Docker Swarm Cluster using Ansible.
Extensively worked on using Chef Attributes, Chef Templates, Chef Recipes, and Chef Files for managing and configurations across various nodes.
Knowledge of Tomcat security configurations, including SSL setup, user authentication, and access control.
Managed Chef Server in pushing the infrastructure change to client nodes, monitoring the logs and security.
Designed, implemented, and maintained Continuous Integration (Jenkins) and delivery environments.
Managed the Maven repository using Nexus tool and used the same to share the snapshots and releases of internal projects.
Installed Hadoop, Map Reduce, and HDFS and developed multiple MapReduce jobs in PIG and Hive for data cleaning and pre-processing.
Implemented Datadog to monitor infrastructure, applications, and services. Set up custom dashboards and alerts to track performance metrics and respond proactively to incidents.
Worked on GIT (GIT Hub) repositories as Distributed Version Control System.
Extensively worked on integrating GIT into the Continuous Integration (CI) environment along with Jenkins configured the services using modern DevOps tools.
Performed WebLogic Server administration tasks such as installing, configuring, monitoring and performance tuning on Linux environment.
Have good experience with shell scripting. Using scripting we automates the deployment and creating the merge for prod environment.
Using shell scripting we clear the kafka topics for the one of the application.
Merging release branches back to trunk after the production release and resolving of conflicts during merging in both GIT and Subversion SVN.
Installed and configured the Apache Tomcat, WebLogic Application servers for Dev and Integration test environments. Automated the process of deployment to Apache Tomcat Application servers by deploying bash Scripts and Python Scripts.
Configured and administered JMS Server, Connection Factory, Topic/Queue, File Store, and Database Store and Distributed Destinations, JDBC Connection Pools, Multi Pools and Data Sources on Web Logic Server.
Environment: AWS EC2, S3, RDS, Glacier, Atlassian, ELB, Kubernetes, Windows, Elastic Cache, Redis, Aurora, CloudWatch, SNS, Ansible, Jenkins, Nagios, Jira, GitLab, RHEL 7.X, Shell, EMR, Bash, PowerShell, Python, Grafana, Tomcat, Git, WebLogic, WebSphere, Groovy.
Client: Acuity Software Technologies, India Mar 2015 – Sep 2017
Role: Linux Administrator
Responsibilities:
Provide L1 and L2 +Remote Administration Support for problems raised on Linux servers.
Remote administration through Putty, SSH.
Created and maintained user accounts in RedHat Enterprise Linux (RHEL) and other operating systems.
Installed and configured Debian and Ubuntu for Testing the applications.
Experience on file system and disk management.
Creating and managing user accounts, groups and access levels using SUDO.
Configuring the LVMs and adding the additional disk space as and when required.
Scheduling/Controlling of jobs using Cron.
Performance monitoring using, Top, SAR, Vmstat, iostat, netstat and analyzing these reports to figure out any performance issues.
Having good command on ITIL process incident, change management.
Experience on file system and disk management and managing disk quotas.
Performing Mount and un mount operations on file systems.
Environment: Jenkins, Docker, Nginx, Nagios, Maven, Nexus, Chef, GIT, Shell Scripts, Windows PowerShell, Unix/ Linux environment, Agile.