Susan Mathangi
CLOUD/DEVOPS ENGINEER
Email Id: ***********@*****.***
Phone no: 484-***-****
PROFESSIONAL SUMMARY
Over 10 years of experience in IT with extensive expertise in DevOps automation, Build and Release management, AWS and EKS, worked in technical roles both in Linux and Windows environment for build/Release automation process in Web & cloud/server Environment using .NET and Java Technology, AWS & open-source technologies and Strong knowledge in EKS services
As part of continuously delivering Agile team, develop, test, and deploy Data platform features Develop ongoing test automation using Python based framework Using Ansible to Setup/teardown of ELK stack (Elastic search, Log stash, Kibana)
Building/Maintaining Docker container clusters managed by Kubernetes Linux, Bash, GIT, Docker, on AWS.
Utilized Kubernetes and docker for the runtime environment of the CI/CD Pipelines to build, test deploy.
Focus on continuous integration and deployment, promoting Enterprise Solutions to target environments.
Created and deployed many Virtual Machines (VM's) on Azure and have worked on many other services like Azure Active Directory, Virtual Network, Traffic Manager, Load Balancing for Azure, Azure Resource Manager, Application Insights, Notification Hubs, Azure Key Vault, Functions, Auto scale, Express Route etc.
Utilized Azure Service Bus and Web services to handle messaging from thousands of devices, enabling smart phones to interact with vehicle telemetry.
Configured and maintained Jenkins to implement the CI process and integrated the tool with Maven to schedule the builds. Took the sole responsibility to maintain the CI Jenkins server.
Proficient in Build & Release automation framework designing, Continuous Integration and Continuous Delivery, Build & release planning, procedures, scripting & automation. Good at documenting and implementing procedures related to build, deployment, and release.
Involved in creating User Interface using JavaScript (backbone and handlebars).
Experience in Branching, Merging, Tagging, and maintaining the version across the environments using SCM tools like Subversion (SVN), GIT (GitHub, GitLab), Clear case, Harvest and VSS
Experience in Web service client generation and process of XML using JAXB.
Involved in build and maintaining of highly available secure multi-zone AWS cloud Infrastructure utilizing Chef with AWS Cloud Formation and Jenkins for continuous Integration.
Developed modular Terraform configurations to improve scalability and reusability.
Implemented Terraform state management using S3 backend with DynamoDB locking.
Automated CI/CD pipelines with Terraform and GitHub Actions for seamless deployment
Design develops and debug tests and test Framework in a complex using tools like Selenium, Jenkins, and GitHub.
Creating end to end pipelines using GIT, Jenkins for CI/CD, Ansible for Configuration management and monitoring using Nagios/CloudWatch
Migrating on-premises databases into AWS cloud using various AWS resources including EC2, Route53, S3, RDS and IAM policies and migration between Cloud Services (AWS and Azure).
Architected and developed CI/CD systems with Jenkins on AWS Kubernetes, utilizing Kubernetes and Docker for runtime environments to build, test, and deploy applications, establishing a scalable foundation for future ML model integration.
Managed and fixed failed production containers using Docker UCP across multiple environments, ensuring high availability and contributing to 99.9% uptime for critical applications.
Developed modular Terraform configurations to improve scalability and reusability of infrastructure, implementing S3 backend with DynamoDB locking for state management, crucial for reproducible AI/ML environments.
TECHNICAL SKILLS:
CI/CD & Automation: Jenkins, GitLab, GitHub Actions, Terraform, Ansible, Chef, Puppet, Python, Bash, PowerShell
Cloud Platforms: AWS (EC2, S3, VPC, CloudFormation, RDS, Lambda, EKS), Azure (Virtual Networks, SQL DB, ARM), GCP
Containerization: Docker, OpenShift, Kubernetes, Helm, Docker Swarm
Scripting languages: Bash/Shell, Perl, Python, Ruby.
Monitoring & Logging: ELK Stack (Elasticsearch, Logstash, Kibana), Prometheus, Grafana, Splunk, New Relic
MLOps & AI Tools: MLOps, Generative AI (LLMs, Prompt Engineering), AIOps, Model Deployment, Model Monitoring, Data Drift, Pipeline Orchestration
Education qualifications:
Bachelor of Technology Bachelor of Technology JNTU Kakinada, 2012
Masters in information systems and engineering management, Harrisburg University
Certification: AWS solutions architect- Associate.
Microsoft Certified: Devops Engineer expert- well trained, MLOPS- Trained
EMPLOYMENT EXPERIENCE
Comcast, Pennsylvania, Philadelphia.
Role: DevOps/SRE Lead Feb 2019- Till date
Responsibilities:
Expertise in Production support tools Service Now, Remedy, Catch point, Adobe Analytics, ELK, AppDynamics, Splunk, and SCOM etc.
Production support for e-commerce website utilizing .net /site core/ AWS/ SQL technologies.
Involved in requirement gathering, analysis, user story estimation, design, and development of the new features.
Driving the application changes through continuous follow up with business stakeholders, collaborating with solution architects & scrum master and bringing to closure.
Developed proofs of concept for undergoing problems and successfully implemented the solutions.
Identified various errors and issues from the Kibana tool and provide solution through code fix.
Involved in the preparation of technical documentation and functional documentation to agreed quality.
Creating Dashboards in VSTS for CI/CD pipelines, work items and bugs.
Working with the ELK (Elastic Search, Logstash, and Kibana) stack to analyze log data obtained from Microsoft Business Intelligence tools.
Set up and maintained Logging and Monitoring subsystems using tools like; Elasticsearch, Kibana, Prometheus, Grafana, and Alert manager.
Coach teams on Agile principles and Scrum practices, fostering a culture of self-organization, accountability, and iterative delivery.
Remove impediments that block team progress by coordinating with stakeholders and resolving cross-functional dependencies.
Track and report team metrics such as velocity, burndown charts, and sprint health to support data-driven decision-making.
Promote collaboration between Product Owners, developers, and business stakeholders to ensure shared understanding of priorities and deliverables.
Authored detailed user stories and functional requirements for API-driven features, aligning development and QA efforts across Agile sprints.
Identified and resolved API bottlenecks through performance tuning and load testing, improving response times by 25% under peak traffic.
Developed Python scripts to automate API test execution and integrate results into observability dashboards using Grafana and Prometheus.
Designed and implemented scalable Java-based RESTful APIs using Spring Boot, deployed on AWS with integrated CI/CD pipelines using Jenkins and GitHub Actions.
Built microservices with Spring Cloud and Xumo OSS components, supporting service discovery, load balancing, and centralized config management.
Developed serverless Java applications on AWS Lambda, integrating with services like S3, DynamoDB, and Event Bridge for Realtime processing.
Maintained message-driven architectures with Apache Kafka, ensuring high throughput and reliability
Provided end-to-end support for distributed Java-based applications, ensuring reliability across payment platforms
Administered cloud-native deployments on AWS and PCF, optimizing resource usage and scalability
Orchestrated containerized microservices on Kubernetes with secure CI/CD pipelines
Maintained Kafka clusters to handle high-throughput streaming for financial transactions
Tuned PostgreSQL databases for performance, availability, and complex query execution
Leveraged Splunk for log correlation, real-time alerting, and dashboard creation tailored to transaction flows
Established infrastructure and service monitoring using Prometheus and Grafana.
Fixed failed production containers for our environment using Docker UCP and responsible for over 6 different environments, including production.
Providing daily support of service Management Platform (SNOW), Including scripting, Configuration and Customization.
Administration ServiceNow processes (User management/Group management), Functions, Service Catalog, and Workflow.
Generating different kinds of scheduled Reports and creating new Dashboards, Homepages
Experience authoring Helm charts
Experience deploying and maintaining applications in cloud systems like Amazon AWS and EKS
Experience with Windows and Linux system administration, including Bash and PowerShell
Experience with all phases of the software development lifecycle
Experience with CI/CD pipeline tooling including build, test, code scanning, deployment
Understanding of continuous monitoring, security scanning, container image scanning, identity management, PKI, IAC,
Experience with tools such as SonarQube, Git, JFrog Artifactory, Nexus, Jenkins, Gitlab, Helm, and Jira
To provide file management, user management, scheduled maintenance, installation and configuration, documentation, and troubleshooting. Made structured, and Ad-Hoc deployment and git-releases using GitLab for multiple environments including production.
Worked on Docker Maintenance such as searching and killing zombie containers to help
increase the all-around productivity of the swarm nodes that are hosting important applications.
Used Logstash monitoring to identify potential failures and errors for our internal apps, including docker containers.
Monitoring of EC2 instances in AWS and servers in Azure using New Relic and ELK by creating New Relic and ELK Dashboards. Involved in Monitoring, Network Monitoring and log Trace Monitoring, Automatic notifications send to the required team on the status of servers.
Designed and maintained CI/CD pipelines on OpenShift using Jenkins and GitHub Actions, streamlining application delivery and rollback processes.
Implemented role-based access control (RBAC) and integrated OpenShift with enterprise identity providers for secure multi-tenant access.
Environment: NET, Node.js, Express, Server-Side JavaScript, Jenkins, Docker, Python, AWS CSS3, HTML5, Client Site JavaScript, jQuery, Dust.js, Elastic search, AWS (EC2, S3, EBS, ELB, IAM, SQS, RDS, Autoscaling), GIT, Bitbucket, Cloud Formation Templates, Jenkins, Groovy, Docker, JIRA, Red Hat Linux, WebLogic Servers, Nginx, Frog, Shell scripts, Kubernetes, Networking.
Dimension Data Americas, Virginia, Reston.
Role: Cloud & DevOps Engineer. Jan 2017- Dec2018
Responsibilities:
Created AWS cloud formation templates to create custom-sized VPC, subnets, EC2 instances, ELB, security groups. Worked on tagging standard for proper identification and ownership of EC2 instances and other AWS Services like Cloud Front, cloud watch, RDS, S3, Route53, SNS, SQS, Cloud Trail.
Implemented and maintained the monitoring and alerting of production and corporate servers/storage using AWS cloud watch.
Creating Python scripts to totally automate AWS services which includes web servers, ELB, Cloud Front distribution, database, EC2 and database security groups and application configuration, this script creates stacks, single servers, or joins web servers to stacks.
Design EC2 instance architecture to meet high availability application architecture and security parameters.
Specific project experience using AWS and Google Cloud Platform for hosting virtual instances and handling scalability.
Designed Azure-native solutions using ARM templates and Azure SQL Database, improving deployment efficiency by 25% post-migration by retiring legacy Compute Engine instances.
Orchestrated migration of monolithic apps to Azure using Terraform-provisioned infrastructure and Azure Pipelines for blue-green application deployments, achieving 99.9% uptime by shifting traffic via AZURE Traffic Manager.
Standardized Terraform configurations with Terr grunt to maintain IAC parity between GCP (legacy) and
Azure (target) environments, reducing migration-related configuration errors.
Automated Azure IaaS VM provisioning using Terraform modules and deployed VM Scale Sets in production, reducing manual configuration time.
Automated Azure and AWS infrastructure provisioning using Ansible and Ansible Automation Platform
Implemented a Continuous Delivery pipeline with Docker, Jenkins and GitHub and AWS AMI's, whenever a new GitHub branch gets started, Jenkins, our Continuous Integration server, automatically attempts to build a new Docker container from it, The Docker container leverages Linux containers and has the AMI baked in.
Developed microservice onboarding tools leveraging Python and Jenkins allowing for easy creation and maintenance of build jobs and Kubernetes deploy and services.
Environment: AWS, Azure, Load Balancers, Chef, Ansible, Shell, Python, Linux, Jenkins, Docker, Virtualization, Kubernetes, Configured plug-ins for Apache HTTP server, LDAP, JDK1.7, XML, SVN, Git
Idea Cellular Pvt Ltd, India
Role: Linux Administrator Sep 2012 to Oct 2015
Responsibilities:
Custom build of Windows servers which includes adding users, SAN, network configuration, installing application related packages
, managing services.
Responsible for maintenance of development tools and utilities and to maintain shell, Perl automation Scripts.
Installation, maintenance and administration of Oracle and Db2 Server on Sun Servers.
Log file was managed for troubleshooting and probable errors.
Adding servers to domain and managing the groups and user in Active Directory, installing and configuring send mail.
Remote system administration using tools like SSH, Telnet, and Rlogin.
Setup Clustering for Linux servers.
Environment: SNrMP, SMTP, NFS, NIS, NIS+, Microsoft Windows, Linux.