Name: Kumar G
Email: *****.*.***@*****.*** Phone: 657-***-****
Cloud & DevOps Engineer/ Linux System Administrator
PROFESSIONAL SUMMARY
●7+ years of overall professional experience in DevOps Tools, Cloud Computing (AWS, Azure, GCS), and Linux Systems (CentOS, RHEL, Ubuntu) Installation, Configuration, Troubleshooting, administrating, Build & Release Management process.
●3+ years of experience in Python development, and infrastructure management. Skilled in scripting, CI/CD pipeline design, cloud infrastructure, and container orchestration. Ability to automate workflows, improve system performance, and ensure the reliability and scalability of systems.
●Experience in various Azure services like Compute (Web Roles, Worker Roles), Azure Websites, Caching, SQL Azure, NoSQL, Storage, Network services, Azure Active Directory, API Management, Scheduling, Auto Scaling, PowerShell Automation, Virtual machines, virtual network, SQL database, traffic manager, and ExpressRoute.
●Experience in Database Administration, development, design, maintenance, and production support of relational databases, business applications, new server setup, MySQL Server installation, upgrade, and migration.
●Experience in maintaining the version across the environments using SCM tools like GitHub, bit bucket, and GitLab) with the help of VCS like GIT.
●Collaborated with cross-functional teams to deploy, monitor, and maintain applications on the Wind River Cloud Platform, reducing operational costs and improving system uptime.
●Experience using MAVEN and ANT as build tools for the building of deployable artifacts like jar, war & ear through Jenkins from source code. Experience in Configuring the Nexus repository manager for maintaining the release code packages to publish snapshots and release versioned artifacts within the company.
●Experience in continuous integration technologies like Harness, Bamboo, and Jenkins. Designed and created multiple deployment strategies using Continuous Integration (CI) and Continuous Development (CD) Pipelines and configuration management tools.
●Hands-on experience in Chef, Puppet, Ansible, and Ansible Tower as Configuration management tools to automate repetitive tasks, quickly deploy critical applications, and proactively manage change.
●Worked on automation of activities through Python script, and Bash script, logs file analysis (troubleshooting) to resolve issues using Python and Ruby scripts.
●Experience in creating Docker containers leveraging existing Linux Containers and AMIs, Also creating Docker containers from scratch. Worked on Kubernetes and Docker images to provide platform as a service (PAAS) on private and public clouds, VMware, and improved manageability of containers using OpenShift.
●Experience with NIC bonding/NIC Teaming configuration in Linux/UNIX/Windows/VMware systems to increase the bandwidth or redundancy based on the requirement of the application.
●Expertise in writing Unix/Linux, shell scripting, Bash Scripts, Pearl Scripts (hash and arrays), and Python programming for deployment of Java applications on bare servers or Middleware tools.
●Experience working with Network monitoring tools like Nagios, and Splunk. Maintenance of Splunk Environment with multiple indexes and configuring the index settings.
●Performed capacity planning and resource allocation using OEM to ensure optimal performance during high-traffic periods, preventing downtime and maintaining system reliability.
●Used Terraform to write infrastructure as a code and created Terraform script for EC2 instance, ELastic load balancers, and S3 buckets.
●Created custom Helm Charts for multi-environment deployment pipelines, allowing seamless transitions from development to staging and production environments.
●Configured Kong to integrate with AWS Elastic Load Balancers, optimizing API traffic routing and ensuring load balancing for high-traffic applications.
●Experience in Server infrastructure deployment on AWS Cloud, extensive usage of Virtual Private Cloud (VPC), Cloud Formation, CloudFront, EC2, RDS, S3, Route53, SNS, SQS, Redshift, EBS, and Cloud Trail
EDUCATION
Bachelor’s degree in Computer Science – SZABIST Karachi, Pakistan
Master’s in Computer Science – Westcliff – Irvine, California
TECHNICAL SKILLS
Operating Systems
Windows servers 2000/2003/2008, UNIX, LINUX, Ubuntu, VMware ESX 4.0/5.1/5.5, CentOS 5/6/7
Databases
MySQL, MongoDB, Cassandra, SQL Server
Automation/Build Tools
ANT, MAVEN, Jenkins, Nagios, JIRA
Bug Tracking Tools
Remedy, JIRA, and ServiceNow
Version Control
Subversion, CVS, SVN, GIT
Monitoring tools
Nagios, Zabbix, Sensu, Graphite, Splunk
Web Technologies
HTML, CSS, JavaScript, .Net, Bootstrap
Cloud Technologies
AWS, Microsoft Azure, Google Cloud, OpenStack and Rackspace
PROFESSIONAL EXPERIENCE
META (Redmond, WA) Aug 2022 – Till date DevOps/Infrastructure Engineer
●Building & deploying packages as part of deployment & integration testing as per customer requirements. work closely with customers and provide recommendations for OS and infrastructure hardening.
●Extensive knowledge of server administration, kernel upgrade, and deployment of patches and applying all firewall and security policies with an emphasis on maintaining best practices.
●Performing daily system monitoring, verifying the integrity and availability of all hardware, and embedded devices such as AR Glasses, server resources, and systems.
●Experienced with Boards/Systems upgrades and troubleshooting to component level. P.C board repair, solder surface mount technology solder connectors and touch up, wiring P.C Board, remove excess solder from components.
●Prepare daily and weekly status reports on Test case execution, defects, RTM, etc.
●Developed Python and shell scripts for automation of the build and release process. Also, improve the continuous integration environment's speed, scalability, and efficiency, automating wherever possible using shell, Python, and PowerShell scripts.
●Administered and managed both CentOS and Windows Server environments, ensuring high availability and performance across multiple production systems.
●Installed the racks and configured the DELL server infrastructure, performing firmware updates as per compatibility.
●Set up PXE to deploy customized system images, allowing automatic provisioning of hardware with minimal manual intervention.
●Implemented security configurations for Tomcat servers, including SSL/TLS setup, access control, and user authentication, to ensure secure deployment of web applications.
●Configured and maintained automation tools such as Chef for scheduling and deployments through cookbooks and amended as per the requirements.
●Managing lab equipment, installing test prototypes, maintaining setups, functionality troubleshooting, and ensuring calibration are up to date.
●Collaborate with Electrical Engineers or other development teams to identify, define, and solve the prototype/Board issues.
●Managed software updates and patches in lab environments by integrating platforms like Wind River Cloud with CI/CD pipelines, ensuring smooth deployment of software and firmware updates for embedded devices and systems.
●Wrote tests, validated, and debugged the decoder component during different stages of pre-silicon validation.
●Setupping Cisco routers and DLI power switches also configured IP addresses within the lab for the devices.
●Contributed to the deployment of 5G infrastructure, including integrating vRAN, DU, and CU (Centralized Units), supporting the transition from 4G to 5G in alignment with industry standards.
●Configured Helm Chart-based deployments for microservices architecture to ensure high availability and fault tolerance.
●Reviewed and applied security patches to Helm Charts, ensuring the protection of sensitive data during deployments.
●Set up Grafana alerting to monitor critical metrics such as server uptime, database performance, and error rates, ensuring immediate notifications through email, Slack, and PagerDuty for any thresholds exceeded.
●Diagnosed the hardware and software problems after capturing the UART logs and replacing defective components.
●Testing the functionality of hardware and FW features, Smoke testing, and Regression testing.
●Communicate and coordinate with senior management on major design changes in the lab such as changes on rack elevations, and increasing the server capacity.
●Integrated network segmentation into the SDN architecture using Nuage VSP to ensure network isolation, enhance security, and optimize resource utilization.
●Designing and implementing the CI System, configuring the Jenkin servers, and Jenkins nodes, creating required scripts on Python, and stabilizing the VMs(Windows/Linux).
●Maintained multiple CI/CD pipelines that fully integrate with agile development practices. These pipelines utilize both traditional and cloud-native tools like Jenkins, Harness, and GitLab CI, allowing for seamless deployments while ensuring security and compliance at every step of the pipeline.
●Design and implement the automated Health Checks for the prototypes, also calling the libraries using Python & shell.
●Onboarding and troubleshooting the AR Devices in CI lab infrastructure.
●Monitoring the automating Health Checks, and performing the Manual Remediations on board-level prototypes.
McKinsey & Company (Los Angeles, CA) Feb 2021 – July 2022
DevOps/Cloud Engineer
●Involved in designing and deploying large applications utilizing almost all the AWS stack (Including EC2, Route53, S3, RDS, DynamoDB, SNS, SQS, and IAM) focusing on high availability, fault tolerance, and auto-scaling in AWS Cloud Formation.
●Used templates for AWS infrastructure as a code with Terraform to build staging and production environment and defined Terraform modules such as Network, Operations, Compute, and User to reuse in different environments
●Worked on various OpenStack components, including Heat Orchestration templates, to automate infrastructure provisioning and management across different projects, ensuring scalability and efficiency.
●Developed cloud-based applications using Wind River Cloud Platform tools, optimizing the integration of edge devices, sensors, and cloud resources to enable real-time data analytics and processing.
●Designed, developed, and maintained Helm Charts to automate the deployment of applications in Kubernetes environments, improving consistency and reducing manual configuration errors.
●Utilized Terragrunt to simplify and streamline the management of Terraform configurations across multiple environments, reducing code duplication and improving maintainability.
●Worked with Jenkins as a Continuous Integration/Deployment Tool and had strong experience with Ant and Maven Build Frameworks. Implemented a Continuous Delivery (CD) by applying tools like Dockers, and GitHub.
●Integrated Kong with AWS RDS (Relational Database Service) to store configuration data, API keys, and logs, ensuring a highly available and managed database solution.
●Managed and administered Apache Tomcat servers for deploying and maintaining Java-based web applications, ensuring high availability and reliability.
●Used Ansible playbooks to automate AWS services like launching EC2, provisioning IAM, configuring VPC, and EBS, also Monitoring using CloudWatch and CloudTrail.
●Implemented cluster services using Azure Kubernetes Services (AKS) & docker to manage local deployments in Kubernetes by building a self-hosted Kubernetes cluster using Harness CI/CD pipeline.
●Database migrations planning and implementation for DB2 to Oracle, SQL Server to Oracle.
●Developed and deployed serverless applications using AWS Lambda for efficient event-driven processing and reduced operational overhead.
●Designed and implemented secure cloud architectures focusing on Kubernetes, cloud-native computing, and microservices, ensuring that security is baked into the system from the start.
●Developed high-performance, concurrent applications using Go's goroutines and channels to efficiently handle multiple tasks in parallel. Also, improving system performance and maintaining high availability.
●Installed and configured Nginx as a reverse proxy, web server for multiple sites and worked with server, location, and SSL configuration sections in Nginx.
●Worked on GITHUB Branching, Merging, and Tagging Responsible for building out and improving the reliability and performance of cloud applications and cloud infrastructure deployed on Amazon Web Services.
●Worked closely with stakeholders to define and implement security best practices based on industry standards, applying a "security mindset" to every aspect of the system, including network security, identity management, and secure data handling.
Afiniti (Karachi, Pakistan) Sep 2018 – July 2020
Build & Release/VM ware Engineer
●Deployed and configured Git Repositories with Branching, Forks, Tagging, and Notifications and was proficient in deploying and administering GitHub
●Installation, Configuration, and administration of VMware. Performed physical to virtual (VMware) Migrations for over 50 servers using VMware Converter Standalone. Also, Installation of recovery solutions for VMs with Veeam Backup and VMware vSphere Data Protection.
●Hands-on experience with Web and Application Servers such as Apache HTTPD, Light HTTP, Nginx, Tomcat, WebLogic, and WebSphere. Very well versed in Python and Jython scripting to automate the WebSphere Administration tasks in the Development and Testing Environment.
●Enabled IPv6 for vRealize Automation Entitlements/Blueprints/IaaS Components. Maintenance, Troubleshooting, and creation of workflows in vRealize Orchestrator. Worked on vRealize Operations manager (VROPS) for monitoring the vCenter alerts.
●Managing Azure DevOps build and release pipeline. setting up new repos and managing the permissions for various GIT branches. Deployed micro services including provisioning Azure environment.
●Created storage pool and stripping of Disk for Azure Virtual Machines. Backup, Configure, and Restore Azure Virtual machine using Azure Backup.
●Assemble the Release Kit and complete all paperwork, including implementation/backout plans. Creation of Release Plan, Definition, collection, analysis, and presentation of Release Project Metrics every week.
●Wrote Pom.xml for Maven application builds automation. Maintained Maven/Ant and streamlined the entire build system. And managed build and release infrastructure, optimized build scripts
●Developed UNIX and Bash scripts for manual deployment of the code to the different environments and kept the team updated when the build was completed.
●Worked on Java Coding for Portal Applications and Tomcat Web server for hosting web apps. Used the SVN version control system to manage different builds for the system.
●Experienced in operations support for monitoring tools which include Nagios, Dynatrace, Graphite, Splunk, Solar Winds, and Lancope.
Touch Point (Pvt) Limited (Karachi, Pakistan) June 2017– Aug 2018
Linux/Middleware Administrator
●Responsible for Tomcat & Apache servers in the environment, which includes installation, upgrades, and maintenance across all environments, Configured SSL for mutual authentication between client and server.
●Installation of Red Hat Enterprise Linux. Configuration and troubleshooting for Red Hat Linux. Handling troubleshooting issues like boating, RPM packages, User management, and backup. IP bonding for load balancing and failover, Firewall, SELINUX.
●Administered Linux servers for several functions including managing Apache/Tomcat server, mail server, and MySQL databases in both development and production.
●Creating and deploying LVM, resizing the logical volume and volume group. Configuration and administration of LDAP, NFS, FTP, SAMBA, and POSTFIX Servers in Red Hat Enterprise Linux. Performances, CPU, and Memory monitoring of Red Hat Enterprise Linux.
●Performed regular security audits and code reviews to identify and resolve potential vulnerabilities within web applications, reducing the risk of data breaches or unauthorized access.
●Experience with Configuration Management Tools (PUPPET). Worked with Puppet for Monitoring system and automation.
●Managed TCP/IP packets, and DHCP servers resolved TCP/IP network access problems for the clients, and worked with various TCP/IP implementations like NFSv4, NIS, DNS, and DHCP.
●Applied the Clustering Topology that meets High Availability and Failover requirements for performance and functionality.
●Installation, Configuration, and administration of DNS, LDAP, NFS, NIS, and Sendmail on Redhat Linux/Debian Servers.
●Experience in migration of consumer data from one production server to another production server over the network with the help of Bash and Perl scripting.
●Created and maintained network users, user environment, directories, and security.
●Deploying Veritas Clusters and Oracle test databases to implement disaster recovery strategies, ensuring uninterrupted availability of the global systems.