MUHAMMAD IRTAZA KHALID
**** ***** **** *****, ************, MD 21030 • 443-***-**** • ****************@*****.***
PROFILE SUMMARY
I have extensive experience in Configuring, Administering, Maintaining, and troubleshooting a diverse range of Linux, Windows and Sun Solaris Systems. My advanced knowledge spans System security, and I excel in troubleshooting and configuration both Linux and Solaris environment also Windows server 2012R, 2016, 2019 & 2022. Additionally, I am adept at integrating, designing, configuration, development and security management of Linux Solaris and Windows. Proven ability to thrive in result-oriented, and highly dynamic work setting, adapting quickly to new hardware and software. Possess strong troubleshooting, problem solving and analytic skills. Creative, attention to detail, well organized and ability to meet deadlines. Skilled in interacting, developing, and communicating high end technical ideas to technical staff. Some notable highlights are
Over 10 years of experience with strong expertise in the fields of DevOps using various automation tools to oversee the end-to-end SDLC process.
Experienced in Administration of Production, Development and Test environments carrying different operating system platforms like Windows, Linux, and Unix.
Demonstrated success in maintaining system integrity, managing configuration changes, performing upgrades and monitoring cluster performance in a multi-tenancy environment.
Strong initiative and drive with ability to absorb technical information/knowledge quickly.
Organized and dedicated team player with good communication, management, analytical and organizational skills.
SKILLS
Infrastructure & Cloud Management:
VMWare vSphere, ESXi 6.5/6.7/7/8, Proxmox VE, OpenStack, AWS, Azure, GCP, RedHat Satellite, Terraform, Ansible, Puppet, Varitas VCS, Veeam, Apache Tomcat, Nginx.
Operating Systems:
Windows server (2008, 2012, 2016, 2019, 2022), Linux/UNIX RedHat (6,7,8,9), CentOS, Fedora, Ubuntu, Debian, Solaris 9/10, AIX 7.1/7.2
Monitoring & Logging:
Prometheus & Grafana, StackDriver, Zabbix, Nagios XI, Pub/Sub, ElasticSearch
CI/CD & Version Control:
Git, GitHub, Piper, Jenkins, Rapid
Project Management & ITSM:
Jira, ServiceNow
Databases:
Oracle (12c,18c,19c), SQL Server, ProtgreSQL
PROFESSIONAL SKILLS AND ACCOMPLISHED TASKS
Extensive experience as System Administration/Engineering with various Linux flavors i.e. Centos, Rocky Linux, RedHat, Ubuntu, Solaris and Software configuration management and DevOps methodology.
Specialized in Application Support on RedHat Linux, Fedora, CentOS, Ubuntu, and Virtualization.
Proficient in installation, configuration, maintenance and troubleshooting of RedHat Enterprise Linux 6.x/7.x/8.x/9.x, CentOS, Fedora, Ubuntu, Rocky Linux etc.
Expertise in configuring JumpStart and Kickstart servers for mass installation of Red Hat Linux.
Expertise in Installation, Configuration, Integration, Fine Tuning, Backup, Crash recovery, Upgrades, Patching, Monitoring System Performance, Network Security and Troubleshooting of 2000+ Red hat Linux Servers.
Provide support for physical, virtual, and cloud-based Linux like & Windows Machines.
Experience with Virtualization tools including VMware vSphere, PROXMOX VE for managing the infrastructure.
Skilled in setting up RAID0, RAID1 and RAID5 for data redundancy using Veritas Volume Manager and Veritas File System.
Experience in installing the VERITAS, installing, and configuring the Veritas cluster server (VCS) using the Veritas storage administration (Veritas volume manager, Veritas file systems) for the SAN configuration.
Expertise in DevOps, Release Engineering, Configuration Management, Cloud Infrastructure, Automation including AWS, Ansible. GCP, AZURE.
Strong experience in Backup, Job Scheduling, Disk Management, Logical Volume Management (LVM), Logical Partitioning, troubleshooting network problems, recovery system performance monitoring, kernel tuning and debugging OS failure.
Configured YUM to install and upgrade RPM package and local repositories.
Performed UNIX System Administration Fine tuning, Kernel debugging, process scheduling, disk and file system I/O, kernel internals, TCP/IP communications.
Designing, implementation and administration of networks using Linux/Unix/Windows server, configuration of switch, router & firewall & Web min, MySQL, Bash Scripting, VMware, LAMP and Apache web servers.
Installation, Configuration and administration of DNS, LDAP, NFS, NIS and Send mail on Red Hat Linux/Debian Servers.
Proficiency in writing automation scripts which analyze and monitor the system performance through various systems using Shell, PowerShell, Python, Java to support infrastructure as code, continuous deployment.
Administration and Maintenance of software services like FTP, SFTP, TCP/IP, HTTP, NFS, SCP, SAMBA, VMWare, DNS, DHCP, LDAP, Firewall, Kickstart, and SMTP(simple mail transfer protocol) on servers.
Expertise in user administration setup and monitoring account performance using Zabbix, Nagios, Splunk and Cloud watch.
Created Bash, Shell & python scripts for various system Administration tasks to automate repeated processes. Created shell scripts for automating the CRON job for daily maintenance and updates.
Provide the support of building the server, patching, user administration tasks, deployment, software installation, performance tuning and troubleshooting and KVM.
Tomcat/Apache installation, configuration and troubleshooting.
Providing administration of Linux – create users and groups, permissions, folders, mail, web, installation
Maintaining patch updates, security hardening, and software and hardware updates to provide for a stable infrastructure.
Monitoring server resource utilization and administrating servers to maintain system efficiency.
Interacting with other system engineers, developers, database administrators, client teams, vendors, and other teams for infrastructure compliance and system certification.
Planning and coordinating outages with application teams, database team, and other stakeholders.
Making technical recommendations on architecture, capacity planning, security, and system performance optimization.
Analyzing server/system logs to identify health concerns.
Developing and maintain scripts to automate administrative activities.
Implementing, documenting, and coordinating server and storage policies, procedures, and standards.
Perform and test Linux system configuration backups and restores to ensure system recovery from anomalies or catastrophic failures.
Perform daily system monitoring, verifying the integrity and availability of all hardware, server resources, systems and key processes, reviewing system and application logs, and verifying completion of scheduled jobs such as backups.
Audit and harden systems that communicate with external clients and servers.
Strong experience working with various Development & Test teams in an Enterprise level environment.
Good experience in reviewing system logs files for errors. Settings up CRON jobs for backups and monitoring process.
Add, remove and resizing Logical volumes using LVM in Linux and implementing software RAID at installation time.
Define/setup network protocols, Network File Services (NFS), and Network Information Services (NIS) in Linux Environments.
Knowledge on Amazon Web Services (AWS) administration.
Experience in writing scripts to automate jobs and debugging scripts.
Experience with process engineering, and change management in a complex environment.
Experience working with ticketing systems.
Strong problem-solving approach, monitoring and 24/7 production support in mission critical environments.
Maintenance level upgrades and software administration RPM installation on Linux using YUM, DNF.
Provide drill down reporting for application teams to use in monitoring their application resource usage/performance using Nagios. Zabbix.
Experienced in DNS, NIS, NFS, FTP, NIS+, Samba Server, LDAP, remote access, security management, and system troubleshooting skills.
Familiar with SAN migration.
Experience in VMware Installed and monitored Virtual environments with ESXI 6.7 servers and VSphere.
Installation, Configuration and Management of RDBMS and NoSql tools such as SQL, MySQL, ORACEL DB, MongoDB.
Experience in all phases of the software development life cycle (SDLC) methodologies like Waterfall and Agile/Scrum, with specific focus on the build and release of quality software.
Excellent communication and strong interpersonal skills with ability to interact with end-users, technical personnel, and teammates.
Deliver comprehensive hand-on system management and administration support to maintain 24x7 opertions.
EDUCATION
Master of Scienc Computer Sciences University of the Punjab (2005)
Bachelors of Science Computer Sciences University of the Punjab (2001)
PROFESSIONAL EXPERIENCE
Office of Inspector General DOT(Contractor) 02/24 - Present
The Office of Inspector General (OIG) at the DOT is an independent office within the DOT. Its responsibility is to conducting audits, investigations and evaluations to promote efficiency, effectiveness and integrity in DOT. I joined OIG as a contractor and having designation of Cloud Engineer to support their infrastructure and migrate the on prem infrastructure to Azure could. My responsibilities include here
Administered and maintained mixed environment infrastructure, encompassing both Windows Server and Linux system ensuring seamless interoperability and integration.
Configured and managed Active Directory, Group Policy and DNS on Windows Server, while also overseeing LDAP on Linux machines.
Designed to implemented a scalable Azure Architecture for high availability applications, leveraging Azure Resource Manager, Virtual Networks and Load Balancers.
Conducted detailed assessment and developed migration strategies for transitioning legacy system to Azure, ensuring minimal downtime and data integrity.
Implemented Azure AD and plan to implement Azure Security Center to enhance security posture and compliance with industry standards and regulations.
Automated CI/CD pipelines using Azure DevOps, improving release frequency by 40% and reducing manual intervention.
Collaborated with stakeholders to define migration goals and deliverables, ensuring alignment with business objectives and minimizing impact on ongoing operations.
Developed and executed disaster recovery plans in Azure, ensuring business continuity and data protection in case of system failures.
Successfully plan the migration of Microsoft Teams from on-premises infrastructure to Azure, enhancing scalability and performance for all department’s users.
Implement Azure-based solutions to streamline Microsoft Teams deployment, integrating Azure AD for seamless authentication and user management.
Managed Microsoft 365 tenant, including user provisioning, licensing and security settings for a diverse organization.
Configured and maintained Exchange Online, SharePoint Online, and Teams to ensure seamless communication and collaboration.
Managed and maintained SharePoint sites, including custom workflows and automated processes, enhancing team productivity and document management.
Optimized Microsoft Teams performance by leveraging Azure services, resulting in an improvement in application responsiveness and user experience.
Managed Azure resource allocation and budgeting, reducing operational costs while maintaining high availability and performance for Microsoft Teams.
Enhanced security protocols by configuring Azure Security Center and Azure Policy to ensure compliance and protect Microsoft Teams data during and after migration.
Created and updated the technical documentation and migration guides to Azure, facilitating knowledge transfer and future maintenance.
Conducted regular performance tuning and optimization of VEEAM backup jobs, resulting in a reduction in backup windows and improved system performance.
Developed and tested comprehensive disaster recovery plans using VEEAM tools, including failover testing and recovery simulations, to ensure minimal downtime in case of system failures.
Configured VEEAM monitoring and alerting systems to proactively address backup issues and generated detailed reports for compliance and performance review purposes.
Conducted capacity planning and management for backup storage including scaling solutions to accommodate growing data volumes and optimizing storage utilization.
Integrated VEEAM backup & replication with cloud services to enhance overall data protection strategies.
Maintained, managed and updated SCCM environment to streaming software deployment, patch management and operating system deployment on endpoints across the board.
Configured and maintained SCCM to automate the patch management process, resulting in 35% reduction in vulnerabilities and improved system compliance.
Monitored and troubleshot patch deployment issues, ensuring timely updates and system stability.
Developed and maintained custom task sequences and deployment images to reduce OS deployment time and managed and updated Windows image builds to ensure compatibility with current hardware and software requirements.
Utilized SCCM reporting tools to identify and resolve issues, improving operational efficiency and system performance.
Oversaw the deployment and configuration of Azure update management solutions for Azure environments, ensuring timely and efficient application of updates across multiple virtual machines and services.
Configured and managed update deployments using Azure Update Manager, including patch management for Windows and Linux VMs, ensuring compatibility and stability of applications.
Conducted performance tuning and resource optimization for vSphere environments, improving VM efficiency and reducing operational costs.
Utilized VMware Tools and vCenter Operations manager to monitor and troubleshoot performance issues, enhancing overall system reliability.
Implemented and tested disaster recovery plans using VMware Site Recovery Manager (SRM) and vSphere Replication, ensuring business continuity and data protection.
Managed backup solutions with VMware Data Protection (VDP) or third-party tools, ensuring regular backups and swift recovery processes.
Led the successful upgrade of VMware Horizon 7 to Horizon 8, overseeing all phases of the project including planning, execution and post upgrade validation.
Coordinated with cross-functional teams to ensure seamless integration and minimal disruption during the Horizon upgrade process.
Executed a comprehensive migration strategy from Horizon 7 to Horizon 8, including the installation and configuration of Horizon 8 components.
Identified and resolved issues during and after the upgrade process, ensuring a smooth transition and minimizing downtime.
Integrated Horizon 8 with existing IT infrastructure and applications, ensuring compatibility and seamless operation within the broader IT environment.
Globant IT Corp San Francisco
DevOps Engineer (Client Google) 02/21 – 12/23
The Google’s supply chain infrastructure department plays a crucial role in managing and supporting the internal supply chain for it data centers and other infrastructure needs. The ASCII Infra team ensures that all the components required for Google’s vast network of data centers are procured, delivered and managed efficiently. The infrastructure main consist to manage BlueYonder’s WMS, TMS and WCS applications for supply chain managment. My role here includes
Experienced with deploying and Maintenance applications on GCP, including configuration and monitoring Google Cloud Platform.
Upgraded Windows Server 2012 to 2019 for Active Directory Domain Control for Warehouse Control System (WCS).
Analyzed vendor applications, providing operational support, administering and implementing new systems, ensuring transition of plans to production, documenting production applications, training new employees, monitoring performance metrics, preparing project requirements, developed associated projects for diverse applications, participating in weekly meetings to discuss appropriate strategies and maintaining all production applications.
Configured and automated Google Cloud Services including GCP Compute Engine. IAM, Cloud DNS, VPC, Cloud Pub/Sub and Google Storage Buckets.
Utilized StackDriver and ElasticSearch for Infrastructure monitoring and logs.
Implemented GCP Firewall rules to control traffic to/from VM instances
Used GCP Cloud CDN to enhance user experience by reducing latency through content caching.
Designed and deployed a production ready, load balanced, highly available, fault tolerant, auto scaling Google cloud platform infrastructure and micro-services container orchestration.
Developed and managed Terraform scripts for infrastructure provisioning and configuration.
Employed infrastructure as code (IaaC), execution plans, resources management and change automation using Terraform as a code.
Used Ansible for Configuration management and to automate repetitive tasks, quickly deploy critical updates and proactively manage changes.
Created Ansible Playbooks for setting up a continuous Delivery Pipeline and deployed micro services, and provisioning environments.
Created VMs clusters to run Kubernetes and pushing them into GCP using Ansible and deploying them into hosting environments using GCP container as service.
Deployed and managed Kubernetes pods, auto-scaling and monitoring via the Kubernetes dashboard.
Configured Puppet for managing scalable infrastructure, including module creation, updates and configuration management.
Deployed Puppet and Puppet dashboard for configuration management to existing infrastructure and monitor scalable infrastructure on GCP & configuration management using Puppet and Ansible.
Wrote, Managed, reviewed and documented modules, manifest, piper repositories for Puppet on RHEL and Windows platform.
Managed Docker orchestration and Docker containerization using Kubernetes, deploy, maintain and improve performance over containerized applications in GKE to support application development.
Sync Puppet code to DEV, UAT, Connect and PROD environments through Rapid CI/CD pipeline tool (google’s tool).
Developed and implemented custom metrics in StackDriver to monitor and alert on database backup statuses, enhancing system reliability and data integrity.
Configured automated alerts and notifications based on backup completion and failure metrics, ensuring prompt response to potential issues.
Coordinate and executed comprehensive Oracle DB 12c refreshes for WMS (Warehouse Management System), ensuring minimal downtime and adherence to project timelines.
Executed data migration tasks and synchronized databases across development, test and production environments, utilizing Oracle Data Pump and SQL Loader.
Monitored and optimized database performance during and after refreshes, identified and resolved issues to maintain high efficiency and reliability.
Add & remove ports to different spoke and HUBS (sites) in JDA (application) through puppet and deployed the code through Rapid CI/CD pipeline to DEV, UAT and PROD on GCP.
Deployed various application like WCS, TMS using Puppet in Test and Production environments and provisioning the environment through Terraform.
Add sysPassword and systemPassword to Fraggle (Secret Manager) to automate the db refresh procedures.
Used Google’s tool like Piper(repo), Cider(editor), Code Search for day to day tasks and projects.
Experienced with configuration management automation tool Ansible and worked on integration Ansible YAML Scripts; created playbooks, roles & tasks, and push rollouts through Ansible.
Validated Puppet using PDK and edit the existing scripts for updated Puppet code.
Worked on Puppet to organize and execute configuration plans on servers & worked on modules of Puppet for its manifests.
Deploy IRM (Incident management tool) for end to end incident lifecycle management.
RedFort Technologies (Client Intelsat) 03/18 – 12/20
SYSTEM ADMIN/DEVOPS ENGINEER
Experienced all aspects of AWS: EC2, ELB, S3, SNS, SQS, RDS, VPC, Elastic IP, Route 53, Glacier, IAM, CloudFormation and Cloud Watch.
Designed and implemented scale-able, secure cloud architecture based on Amazon Web Services. Leveraged AWS cloud services such as EC2; auto-scaling; and VPC (Virtual Private Cloud) to build secure, highly scale-able and flexible environment.
Created AWS Route53 to route traffic between different regions.
Utilize CloudWatch to monitor resource such as EC2, CPU, memory, DB services, EBS volumes etc.
Experienced with configuration management automation tool Ansible and worked on integrated Ansible YAML Scripts; created playbooks, roles & tasks.
Created multiple Terraform modules to manage configurations, applications, services, and automated installation processes of web and app servers.
Installation configuration of Puppet master and Puppet agent to ensure central administration, user creation, package update and maintenance of the infrastructure.
Worked on Puppet to organize and execute configuration plans on servers & worked on modules of Puppet for its manifests on servers.
Validated Puppet using PDK and rewrote existing scripts for updated Puppet code.
Adept in installation, configuration, and administration of AIX, Red Hat Linux RHEL Red Hat Satellite 5.6
Integrated machines and systems onto satellite 6 using bootstrap scripts.
Created channels for Red Hat Satellite for patching process.
Created scripts to sync channels with older versions
Server provisioning and decommissioning
Installation and setting up kubernetes cluster on AWS from scratch.
Worked along with team LDAP Migration to AD; perform user management
Installed Sophos Antivirus for Linux Agents onto Linux to connect with Sophos console
Held daily Kanban meeting with team, to offer support and guidance with daily workload
Created spreadsheet that is now being implemented for future budget reports along with helping to keep up with compliance.
Upgraded Oracle DB from 11g to 12c and 18c. Supported Oracle people soft financial application on a VCS clustered environment. Worked with RMAN to backup and restore Oracle DB. Configured Data Guard to replicate data DR site.
Experienced aspects of Azure: Azure VMs, Blob Storage, Service Bus, Virtual Network, VPN Gateway, DNS, Loadbalancer.
Successfully planning, implementing and managing the migration of on-premises Active Directory to Azure Active Direcotry. Adept at leveraging cloud technologies to enhance organizational efficiency and security.
Executed end to end migration project including directory synchronization, user account provisioning, and application integration, minimizing downtime and ensuring business continuity.
RedFort Technologies (Client Hewlett Packard) 03/15 – 02/18
SYSTEMS ADMIN/DEVOPS ENGINEER
Experienced in working on DevOps/ Agile operations process and tools area (code review, unit test automation, build and release automation, environment, service, incident, and change management).
Installation, configuration and OS upgrades and patches on RHEL 5/6/7 as well as CentOS.
Implemented and administered VMware ESX for running the Windows, Centos and Red Hat Linux Servers on developmental and test servers.
Creating volume groups, logical volumes and extending them using LVM.
Responsible for configuring networking concepts like NIS, NFS, SAMBA, LDAP, SSH, SFTP, SNMP, DNS, DHCP and troubleshooting network problems related to LAN and TCP/IP issues.
Managed shared NFS mounting and unmounting NFS server, NFS client on remote machine, sharing of remote file folder.
Configuring, managing, and scheduling CRONTABs for app accounts and backup management on a regular basis.
Installing and configuring Apache Web Server.
Responsible for setting up new instances, migrating existing services from physical servers to AWS cloud.
Configure, monitor, and maintain AWS VPC environment.
Create, maintain, and manage automated build processes for AWS environment using Puppet.
Experience configuring and managing Puppet master server, updating, and creating modules and pushing them to Puppet clients.
Worked on Puppet to organize and execute configuration plans on servers & worked on modules of Puppet for its manifests on servers.
Automating administration tasks with Puppet Enterprise Edison using predetermined manifests to configure, update, and install servers as well as complete support for applications, systems, and user support. Complete package management and monitoring inventory of the nodes.
Trained and supported Linux engineers in the use of the company’s Puppet Infrastructure.
Working with SAN team for exporting and mounting shared volumes.
Extensive exposure to Configuration Management policies and practices with regards to SDLC; along with automation of scripting using BASH, Python scripting.
Developed scripts for automating administration tasks like customizing user environment and performance monitoring and tuning with nfsstat, netstat, iostat, and vmstat.
Created Bash shell scripts to monitor system resources and system maintenance and performed administrative tasks such as system startup/shutdown
Promptly resolve issues ranging from hardware and software issues to desktop start up and server performance issues.
Installing Netbackup server, clients, and ensuring clients are being backed up regularly.
Ensuring backup of all the servers and clients using Symantec Netbackup Software, restoring data in the event of emergency.
Administered security, users, group administration and daily backup and restore operations, networking service, performance and resource monitoring and performed disaster recovery management procedures.
Administration responsibilities include user, group, disk, and security management.
Managed to user and application account creations, deletions, and setting up sudo access for DBA and application account access.
Worked with team for database performance issues, network related issues and with vendors for hardware related issues.
Installed and configured monitoring tools Nagios, Zabbix, Splunk for monitoring the network bandwidth.
BNY Mellon 01/14 – 03/15
SYSTEM ADMIN/DEVOPS ENGINEER
Working on CentOS and Red Hat, installing and upgrading operating system.
Duties spanned from user accounts creation, modification, disk management adding and removing packages and patches.
Monitoring servers process using tools like Nagios.
Monitoring AWS resources using cloud watch.
Managing and maintaining AWS EC2, VPC, IAM, S3 Bucket, AMI, Route 53.
Setting up Idrac and ILO IP addresses to Dell and HP.
Setting RAID levels on Dell, IBM, and HP systems.
Resolving Level 1 and Level 2 issues on Linux systems.
Working with vendors to order new hardware
Setting up network printers, performing backups and restoration using NetBackup as needed.
Responsible for documenting client’s day to day issues and resolutions using internal Wiki site.
Cloning VM machines.
Installed and deployed Red Hat Enterprise, CentOS and installation of packages and patches for Red Hat servers.
Strong knowledge on file System Ext3, Ext4, and NFS.
Configuration management and administration on standard UNIX services like SSH, LDAP, SSL, NFS, Sudo, and FTP.
Managed sharing, mounting, and unmounting NFS server, NFS client on remote machine, sharing remote file folder, starting and stopping the NFS services.
Responsible for maintenance, enhancements, and production support.
Plan, organize, and direct sustainment activities. Establish work standards, methods, and controls for preventative, scheduled and unscheduled maintenance actions.
Worked with the business and database team creating design documents, screen designs, process flows and data definitions and working prototype.
Responsible for Continuous Integration (CI) and Continuous Delivery (CD) process implementation using Jenkins along with Shell scripts to automate routines jobs, configure enterprise scale infrastructure and application deployments.
PSI Pikesville MD 03/10 – 11/13
LINUX ADMINISTRATOR
Worked on CentOS and Red Hat, installing and upgrading operating systems.
Support their infrastructure, troubleshooting, system integration, patching, updating, setting WAN and LAN infrastructure.
Duties spanned from user accounts creation, modification, disk management adding and removing packages and patches.
Setting up Idrac and ILO IP addresses to Dell and HP.
Setting RAID levels on Dell, IBM, and HP systems.
Resolving Level 1 and Level 2 issues on Linux systems.
Working with vendors to order new hardware
Up and running Database servers, health checks of the servers
Monitoring the servers.
CERTIFICATIONS
Certified AWS Cloud Practitioner