Sujan
Phone: +1-469-***-****
Email: *****************@*****.***
DEVOPS/SRE Engineer
Professional Summary:
DevOps / SRE / Cloud Engineer with 11 years of experience in designing, implementing, and managing scalable infrastructure and CI/CD pipelines across cloud environments.
Strong hands-on experience with AWS cloud services, including EC2, S3, CloudWatch, Lambda, CloudFormation, EBS, RDS, EKS, DynamoDB, VPC, SNS, OpenSearch, Route53, KMS, Security Groups, Auto Scaling, and AMIs.
Experience working with Microsoft Azure services such as Azure DevOps, Azure Active Directory, and Azure Data Factory for cloud infrastructure and CI/CD processes.
Skilled in Linux administration and server virtualization, including creating and managing Linux VMs for server consolidation and infrastructure optimization.
Extensive experience installing, upgrading, and configuring Red Hat Enterprise Linux (RHEL) 5.x, 6.x, and 7.x using Kickstart servers.
Hands-on experience implementing CI/CD pipelines using Jenkins, with strong expertise in build automation using Maven and Gradle.
Automated CI/CD workflows by developing Jenkins pipelines using Groovy scripts to streamline build, test, and deployment processes.
Experience working with multiple Application Performance Monitoring (APM) tools including Dynatrace, AppDynamics, CA Nimsoft, Foglight, Compuware, BMC, IBM, and Microsoft monitoring solutions.
Implemented serverless architectures on AWS using Lambda, API Gateway, Route53, S3, and Kinesis to improve scalability and reduce infrastructure management overhead.
Implemented AWS security best practices using services such as AWS Inspector, WAF for preventing SQL injection and XSS attacks, and AWS Certificate Manager for SSL certificate management.
Designed and deployed infrastructure using Terraform for Infrastructure as Code (IaC), provisioning cloud resources such as virtual networks, compute instances, and auto-scaling groups across cloud environments including GCP.
Experience designing and deploying scalable, highly available cloud-native applications on AWS and Google Cloud Platform (GCP).
Strong expertise in cloud infrastructure automation using tools such as AWS, Ansible, Terraform, Docker, Jenkins, Harness, GitHub Actions, Maven, Tomcat, and Linux.
Hands-on experience implementing and managing the ELK stack (Elasticsearch, Logstash, Kibana) for centralized logging and monitoring within AWS environments.
Configured Logstash pipelines to process and export logs to AWS S3 for long-term storage and analysis.
Experienced in containerization and orchestration technologies, including Docker, Docker Compose, Kubernetes, and Helm charts for deploying and managing microservices-based applications.
Built and managed Kubernetes clusters using AWS CloudFormation, monitoring pod health and application performance using Kubernetes tools and Helm charts.
Strong experience deploying and managing Azure Kubernetes Service (AKS) clusters for container orchestration and cloud-native application deployment.
Implemented auto-scaling strategies for AKS clusters to dynamically adjust resources based on workload demands and ensure optimal performance.
Extensive experience with Docker containerization, including creating Docker images, managing Docker Hub repositories, and deploying containerized applications.
Experience with Docker Swarm and Kubernetes orchestration for managing containerized environments and microservices deployments.
Proficient in configuration management using Chef and Puppet, developing automation scripts and modules for infrastructure provisioning and application deployment.
Developed Chef automation scripts using Ruby for provisioning services and applications on cloud infrastructure.
Implemented Puppet-based configuration management, including Puppet Dashboard and PuppetDB, and created custom modules for infrastructure automation.
Built CI/CD automation pipelines using Jenkins, Hudson, and Bamboo, including job configuration, pipeline setup, and automated build and deployment processes.
Hands-on experience configuring Bamboo for continuous integration and deployment, including build plans, automated testing, and environment deployments.
Strong experience with build automation tools such as Maven, Ant, and Gradle to generate deployable artifacts (JAR, WAR, EAR).
Experience using artifact repositories such as Nexus and JFrog Artifactory for managing build artifacts and dependencies.
Integrated applications with modern monitoring and observability platforms including Prometheus, Datadog, and ELK for performance monitoring and alerting.
Technical Skills:
Cloud Platforms: Amazon Web Services (AWS), Google Cloud Platform (GCP), Microsoft Azure
AWS Services: VPC, IAM, S3, Elastic Beanstalk, CloudFront, Redshift, AWS Lambda, Kinesis, DynamoDB, Direct Connect, Storage Gateway, Database Migration Service (DMS), Server Migration Service (SMS), Simple Notification Service (SNS), Simple Workflow Service (SWF)
Operating Systems: Linux (Red Hat Enterprise Linux, CentOS, SUSE), Ubuntu, Solaris, Debian, HP-UX, Microsoft Windows
Scripting & Programming Languages: Shell Scripting (Bash), Groovy, Python, Ruby, Perl, PowerShell, YAML
Version Control Systems: Git, GitHub Actions, GitLab, Subversion (SVN), Bitbucket
Build & Code Quality Tools: Maven, Gradle, Ant, SonarQube, Nexus Repository Manager
Containerization Technologies: Docker, Kubernetes, Amazon ECS, Apache Mesos
Application Servers & Web Servers: WebSphere Application Server, Apache Tomcat, JBoss, Oracle WebLogic, Nginx
Automation & Configuration Management: Ansible, Chef, Puppet, Jenkins
Container Orchestration & Cluster Management: Kubernetes, Docker Swarm, Apache Mesos, Marathon, Google Compute Engine
Networking & Protocols: TCP/IP, DNS, DHCP, LAN/WAN, FTP/TFTP, SMTP, Cisco Routers and Switches
Monitoring & Logging Tools: Nagios, AWS CloudWatch, Splunk, ELK Stack (Elasticsearch, Logstash, Kibana)
Issue & Bug Tracking Tools: JIRA, Bugzilla, Redmine
Professional Experience
Wells Fargo – Charlotte, NC Jan 2024 – Till Date
DEVOPS/SRE Engineer
Responsibilities:
Developed automation scripts to streamline CDN configuration and deployment, significantly reducing deployment time and improving operational efficiency.
Implemented and managed Content Delivery Network (CDN) solutions to enhance content delivery performance and minimize latency for end users.
Monitored CDN performance metrics and implemented optimization strategies to improve cache hit ratios and reduce load on origin servers.
Designed and implemented monitoring and alerting systems using Prometheus and Grafana, reducing Mean Time to Resolution (MTTR) for production incidents.
Built reusable infrastructure components using AWS CDK to enable faster and standardized cloud deployment.
Performed application performance analysis using Dynatrace Pure Path, conducting transaction analysis and code-level diagnostics to identify and resolve performance bottlenecks.
Provisioned and managed infrastructure environments on Google Cloud Platform (GCP) using Terraform as part of Infrastructure as Code (IaC) practices.
Developed Python-based AWS Lambda functions to automate tagging of cloud resources by integrating AWS services with JIRA ticket data.
Created and maintained API documentation to ensure clear integration guidelines and facilitate collaboration across development teams.
Automated cross-platform tasks across Linux, Windows, and GCP environments to improve operational efficiency and consistency.
Generated automated reports to track cloud resource utilization and implemented monitoring using AWS Limit Monitor to maintain resource governance.
Implemented Infrastructure as Code solutions using Terraform, Ansible, and Helm for automated provisioning and configuration of cloud resources.
Designed and deployed AWS Lambda functions to automate backend workflows and event-driven processes.
Developed and implemented Python-based microservices architectures, integrating cloud infrastructure, deployment pipelines, and application components.
Automated and maintained CI/CD pipelines using Jenkins, GitLab CI/CD, CircleCI, and GitHub Actions, enabling continuous integration, automated testing, and streamlined application deployments.
Implemented monitoring and observability solutions using Prometheus, Grafana, ELK Stack, and Datadog for real-time monitoring, logging, and performance analysis of cloud-native systems.
Deployed and managed Kubernetes environments, implementing monitoring and alerting mechanisms to proactively detect and resolve pod-level issues.
Managed Kubernetes deployments using Helm charts, ensuring reproducible builds and streamlined release management for containerized applications.
Regularly reviewed and updated AWS CloudFormation templates to align with evolving infrastructure requirements and maintain configuration consistency.
Ensured AWS Lambda security best practices, including secure data handling, encryption, and compliance with cloud security standards.
Implemented security configurations within the ELK Stack, including authentication, authorization, and encryption mechanisms.
Supported large-scale cloud migration initiatives, contributing to infrastructure setup in AWS for migrating 250+ on-premise applications to cloud environments.
Implemented configuration management using Ansible and Puppet, ensuring consistent application configuration across development, staging, and production environments.
Designed and implemented automated build, test, and deployment pipelines using Jenkins and GitLab CI/CD.
Provisioned and managed cloud infrastructure across AWS and Azure using Terraform and Ansible, following Infrastructure as Code best practices.
Optimized cloud infrastructure for performance, cost efficiency, and security, implementing auto-scaling and monitoring strategies.
Developed and maintained event-driven AWS Lambda functions for real-time data processing and automated workflows.
Implemented centralized logging and monitoring solutions using ELK Stack and Splunk, providing real-time visibility into system health and performance.
Collaborated closely with development, QA, and operations teams to improve deployment processes, increase system reliability, and enhance overall DevOps practices.
Environment: Ansible, Jenkins, XL Release, PowerShell, Linux, VMware, REST APIs, JFrog Artifactory, AWS (EC2, RDS, ELB, S3, CloudWatch, Route53, Lambda, VPC, Auto Scaling), CloudFormation, Terraform, Harness, Maven, Nexus, Chef, Docker, Kubernetes, Apache, JBoss, Nginx, Tomcat, Git, Python, Shell Scripting, Datadog, Nagios, Jira.
USAA – San Antonio, TX Jan 2022 – Dec 2023
DEVOPS/SRE Engineer
Responsibilities:
Worked extensively with Google Cloud Platform (GCP) services including Compute Engine, Cloud Load Balancing, Cloud Storage, Cloud SQL, Stackdriver Monitoring, and Deployment Manager to manage scalable cloud infrastructure.
Implemented Infrastructure as Code (IaC) using Terraform, managing execution plans, resource graphs, and automated infrastructure provisioning across GCP and AWS environments.
Integrated Jenkins, Helm, Kubernetes, and HashiCorp Vault with GCP to enable automated and semi-automated application deployments across development, staging, and production environments.
Built a fully automated CI/CD platform using Jenkins, Harness, and GitHub to coordinate code builds, promotion pipelines, and orchestrated deployments.
Managed and maintained cloud infrastructure operations, including middleware platforms and application deployments across clustered and hybrid cloud environments.
Developed Infrastructure as Code scripts using Terraform and AWS CloudFormation to automate provisioning and configuration of cloud resources for modern applications.
Implemented Terraform state management and remote backend configurations to ensure secure and consistent infrastructure deployments.
Designed and implemented container orchestration solutions using Kubernetes, improving multi-tenant container platform capabilities and deployment automation.
Implemented configuration management using Puppet, automating deployment and configuration of infrastructure components across multiple environments.
Managed and maintained OpenShift clusters, including scaling AWS application nodes and optimizing cluster performance.
Utilized Ansible automation to manage and replace OpenShift components, improving deployment consistency and infrastructure reliability.
Designed and implemented end-to-end CI/CD pipelines using Git, Jenkins, Maven, ECR, and EKS, automating the process from code check-in to deployment of Docker containers into Kubernetes clusters.
Developed Puppet manifests and modules to standardize system configurations across distributed environments and ensure infrastructure consistency.
Containerized applications using Docker, enabling consistent development, testing, and production environments.
Automated provisioning and lifecycle management of AWS resources using CloudFormation templates.
Leveraged Microsoft Azure migration services, including Azure Migrate, Azure Site Recovery, and Azure Database Migration Service, to support cloud migration initiatives.
Implemented Azure services such as Azure App Service, Azure SQL Database, and Azure Blob Storage to improve application scalability and performance.
Performed Terraform code reviews to ensure infrastructure code adhered to organizational standards and best practices.
Developed custom Slack bots using Slack APIs to automate operational tasks, generate real-time notifications, and enhance team collaboration.
Built reusable infrastructure constructs to standardize and accelerate cloud infrastructure deployments.
Established and managed Kubernetes clusters using Helm charts, improving reproducibility of builds and managing Kubernetes application releases.
Developed and maintained Node.js applications using TypeScript, improving code maintainability and development efficiency.
Collaborated with development and operations teams to implement Infrastructure as Code practices and integrate Puppet within CI/CD pipelines for seamless application deployment.
Implemented Slack automation workflows and real-time alert integrations with external systems for proactive incident notifications.
Developed and maintained RESTful APIs using Python and API Gateway for integration with cloud services and internal applications.
Designed Jenkins pipelines using Groovy scripts to automate builds using tools like Gradle and CocoaPods and deploy mobile applications for testing and validation.
Developed automation scripts using Groovy DSL for Jenkins to create seed jobs and automate CI/CD tasks.
Automated operational workflows and data collection tasks using Groovy-based DevOps scripts.
Designed and deployed microservices architectures using Node.js, enabling independent service scaling and modular development.
Implemented LDAP-based security configurations for Jenkins using Groovy scripts and security matrix controls.
Automated configuration management tasks using Ansible and Ansible Tower, streamlining application deployments and infrastructure updates.
Implemented and monitored AWS Lambda functions, optimizing performance and cost using AWS CloudWatch monitoring.
Built CI/CD pipelines across multiple platforms including AWS CodePipeline, Azure DevOps, Bitrise, and Jenkins, supporting deployments for microservices and mobile applications.
Configured and managed AWS services including EC2, S3, RDS, Elastic Load Balancing, ECS, and Lambda for application hosting and infrastructure scalability.
Developed and deployed Kubernetes manifests and Helm charts for microservices deployment within Kubernetes clusters.
Monitored infrastructure performance using CloudWatch, Datadog, and other observability tools.
Created Shell scripts to automate canary and blue-green deployments using Harness.
Developed PowerShell automation scripts with robust logging and error handling to support operational troubleshooting.
Implemented monitoring and observability solutions using Splunk, improving visibility into application performance and infrastructure health.
Contributed to improving engineering quality and operational excellence by enhancing Splunk Observability for web applications, services, and APIs.
Implemented containerization solutions using Docker and Kubernetes to support auto-scaling, CI/CD integration, and container image deployment through registries.
Environment: AWS, EC2, RDS, ELB, S3, CloudWatch, Datadog, CloudFormation, Route53, Lambda, VPC, Auto Scaling, Maven, Nexus, Chef, Terraform, Harness, Jenkins CI/CD, Nagios, Jira, Shell Scripting, Python, Apache, JBoss, Nginx, Tomcat, Git, Docker, Kubernetes, Google Cloud Platform (GCP), ServiceNow, Cassandra, Kafka, Blue-Green Deployment.
Salesforce – San Francisco, CA Aug 2019 – Dec 2021
Devops Engineer
Responsibilities:
Worked extensively with AWS services including EC2, S3, RDS, EBS, Elastic Load Balancing, and Auto Scaling Groups to build and manage scalable cloud infrastructure.
Automated cloud infrastructure provisioning and deployments using AWS CloudFormation, Chef, and Python, improving efficiency and reducing manual intervention.
Maintained and optimized AWS cloud environments, ensuring adherence to best practices and keeping up to date with new AWS features and service updates.
Managed and configured iOS/macOS build environments, including Xcode upgrades and Android build tools for mobile application development pipelines.
Installed, upgraded, and configured Red Hat Enterprise Linux (RHEL) versions 5.x, 6.x, and 7.x using standardized Gold Disk Images.
Configured Linux environments for Oracle Real Application Clusters (RAC) and implemented SAN-based mount points for high-performance storage integration.
Managed and maintained cloud computing infrastructure and enterprise systems across multiple environments and locations.
Configured and administered AWS cloud resources, including EC2 instances, S3 storage, RDS databases, and load balancing services.
Built and managed Kubernetes clusters using Helm charts, creating and managing pods, replication controllers, replica sets, services, deployments, labels, health checks, and ingress/egress configurations through YAML manifests.
Implemented Infrastructure as Code (IaC) to automate provisioning, configuration, and scaling of Apache Kafka clusters.
Implemented monitoring and logging solutions for Kafka clusters, ensuring proactive detection and resolution of performance and operational issues.
Integrated Kafka with multiple data sources and downstream systems, enabling real-time data streaming and event-driven architecture.
Created and maintained Helm charts to package, version, and deploy applications consistently on Azure Kubernetes Service (AKS).
Managed and upgraded OpenShift clusters, including node scaling, node evacuation, and cluster maintenance activities.
Implemented Canary deployment strategies using Harness by designing CD workflows and pipelines to ensure safer application releases.
Managed AWS infrastructure using Terraform, implementing Infrastructure as Code for automated resource provisioning and lifecycle management.
Configured Azure Blob Storage services, including block blobs, page blobs, and append blobs, for efficient storage of large volumes of unstructured data accessible through HTTP/HTTPS endpoints.
Built and managed Docker containers and Docker files for multiple application environments, enabling consistent containerized deployments.
Worked extensively with Docker images, Docker Hub, and container registries, deploying containerized workloads into Kubernetes environments.
Developed Python automation scripts using JSON and XML for HTTP requests, API integrations, and web data parsing.
Configured and managed Ansible automation frameworks, including inventory management, host configurations, YAML playbooks, and custom modules.
Implemented continuous deployment automation using Ansible playbooks, enabling streamlined application deployment processes.
Utilized Chef configuration management for automated infrastructure provisioning and unattended node bootstrapping within AWS environments.
Managed CI/CD pipelines and cloud resources to support continuous integration and continuous delivery for high-velocity development environments.
Configured Chef Workstations and bootstrapped enterprise nodes, managing keys and infrastructure configuration policies.
Implemented Git-based version control strategies, including feature branches, development branches, staging branches, and master branch workflows, with pull requests and code reviews.
Environment: AWS, Azure, Apache Kafka, Chef, Docker, Ansible, Jenkins, Terraform, Kubernetes, ANT, Maven, Ruby, Shell Scripting, Python, WebLogic Server 11g, Apache Tomcat 7.x, Load Balancers, WLST, Apache HTTP Server 2.4, Nginx, LDAP, JDK 1.7, XML, GitHub, Nagios, Splunk, Virtualization.
Vanguard - Malvern, PA Feb 2018 – Aug 2019
Devops Engineer
Responsibilities:
Developed build and deployment processes for Pre-production environments.
Designed and deployed AWS solutions using EC2, S3, RDS, EBS, Elastic Load balancer, Auto Scaling groups.
Designed AWS Cloud Formation template to create custom sized VPC, subnets, NAT (Network Address Translation) to ensure successful deployment of Web Applications and database templates.
Migrated an existing legacy infrastructure and recreated the entire environment within AWS.
Worked on Python scripting to deploy the applications using Puppet across Linux servers.
Developed the scripts to push patches, files and maintain configuration drift through the Puppet Tool.
Used ANT build tool for scripts deployment and deploy processes using Jenkins to move from one environment to other.
Configured Jenkins to build Java Code using Meta Case Software and done C.I process on the java code generated.
Maintained branches/forks in GITHUB version control for the changes made in cookbooks as per release.
Used Splunk APM for Service now ticketing tool, log aggregation and analysis on different application servers and integrating the Splunk with Single Sign-On authentication and service now ticketing tool
Used Shell/Perl scripts for automation purpose.
Used Nexus repository manager to share the artifacts by configuring the repository manager.
Worked on continuous integration tool like Jenkins for End to End automation for all build and deployments.
Used Puppet and Urban code deploy tool for application delivery automation
Worked on developing client-side web interface using HTML, CSS, Java Scripts and deployed into S3 buckets.
Worked on Puppet in Ruby to manage deployment automation.
Automate-Testing, build and deployment by developing and maintaining the processes and associated scripts/tools.
Configured and maintained common Linux applications such as Apache, active, NFS, DHCP, BIND, SSH, and SNMP.
In the production environment implemented and configured Nagios for continuous monitoring of applications and enabled notifications via emails and text messages.
Involved in Building and configuring Red Hat Linux Servers using Kickstart server as required for the project. Maintained maximum uptime and maximum performance capacity for enterprise production, QA and UAT/staging.
Environment: Linux, Perl, AWS Services, VPC, IAM, Security, Chef, Ansible, Puppet, Jenkins, Nagios, PHP, Ruby, JIRA, GIT, Docker, Vagrant, SVN, Ant, Bash, JavaScript, HTML, CSS, PowerShell, NT Command Shell.
State of Oregon, Salem, Portland June 2016 – Jan 2018
DevOps Engineer
Responsibilities:
Deployed and managed many servers utilizing both traditional and cloud providers like VMware and Amazon EC2 with the Chef and Puppet Configuration management through initial technology development, and into production and maintenance.
Designed and deployed AWS solutions using EC2, S3, RDS, EBS, Elastic Load balancer, Auto Scaling groups.
Responsible for the building and deploying the artifacts into DEV, SIT and QA Environments.
Docker container configuration, activation, handling live webhosting & troubleshooting.
Installed Chef Server Enterprise on workstation and bootstrapped the nodes using Knife.
Implied in testing cookbooks on different platforms using Chef Kitchen.
Hands-On knowledge with the Chef DK for storing cookbooks and in documenting the pattern and use of Delivery CLI to automate the adeptness in Chef.
Maintaining and accessing the infrastructure through Chef Compliance and Inspect the node to test the infrastructure by specifying the policies to follow.
Involved in writing Chef Cookbooks and Recipes to automate the deployment process and to integrating Chef Cookbook.
Implementing change requests raised by user for server configuration, kernel tuning, software installation and OS hardening.
Managing Linux VMs using puppet as per some Project requirement, have knowledge of modules, manifests and roles on puppet.
Worked in an agile development team to deliver an end-to-end continuous integration/continuous delivery product in an open-source environment using tools like Chef, Jenkins. Integrated delivery (CI and CD process) Using Jenkins, Nexus, Yum and Chef.
Developed Configuration Management GIT from scratch and prepared documentation accordingly.
Involved in pushing the code to GitHub.
Automated the process of release by pushing all the code to GitHub.
Created Python scripts to totally automate AWS services which include web servers, ELB, Cloud Front distribution, database, EC2 and database security groups, S3 bucket and application configuration, this script creates stacks, single servers, or joins web servers to stacks.
Troubleshoot the automation of Installing and configuring applications in the test environments.
Environment: AWS, SVN, Jenkins, GIT, GITlab, Artifactory, WebSphere, Ansible, Chef, JIRA, Confluence, Service now.
Indus Software Solutions – Hyderabad, India June 2012 - Dec 2014
Linux Admin
Responsibilities:
Installing, configuring, monitoring system performance, system logs and activity, storage management, user management, system and network security of Linux services.
Experience on Virtualization technologies like VMware VSphere & XEN.
Extensive use of Logical Volume Manager (LVM) and RAID Administration.
Installing, configuring and supporting Apache web server on Linux servers.
Writing shell scripts and scheduling jobs using cron and crontab to automate the processes to check disk usage, perform file systems cleanup, daily back-ups and push new changes out to manage and maintain consistency.
Experience in creating package and patch management.
Installation, configuration, administration and troubleshooting.
Experience with Internet technologies including TCP/IP and HTTP.
Experience in deploying and configuring LAMP stack - Linux, Apache, MySQL and PHP.
Determine the root cause of network outages and issues using Linux command line tools like ifconfig, tcpdump, netstat, nslookup, host, who is and traceroute.
Provided support by 24x7 shift rotation basis by on calls.
Environment: Red hat Linux, VMware, VSphere, XEN, LVM, RAID, RPM, Yum, DHCP, DNS, NFS, FTP, Apache, MySQL and PHP
Certifications:
AWS Certified DevOps Engineer – Professional
Microsoft Certified: DevOps Engineer Expert (AZ-400)
GitLab Certified DevOps Professional
Education
M.S in Computer Science Texas A&M University – Commerce – 2016
Bachelors in ECE – JNTU Hyderabad - 2012