Engineer Aws

Location:

Brooklyn, NY

Posted:

March 08, 2021

Contact this candidate

Resume:

Mohammad Waqas

Linux/Cloud Engineer

*******.*********@*****.***

347-***-****

Career Objective:

Over all 7 years of experience in the field of IT and presently occupying the position of Linux System-DevOps Engineer. Experience entailing deploying, configuring, installing, maintaining, monitoring of Linux based servers. Excellent in trouble shooting OS and networking related issues. Strongly familiar virtual environment and smoothly handle VMware and adept with automation tools such as Ansible, bash scripting and other DevOps tools like Docker and GIT. Worked with cloud-based technology (AWS) and now looking to leverage my experience and get a diverse environment where I can grow and acquaint myself to cutting-edge technology.

Core Competence:

Operating Systems

RHEL/ CentOS 5, 6, 7,8

Virtualization

Oracle Virtual Box, ESXi, VMWare, vSphere, vCenter,

Storage

SAN, NAS, NFS, S3, Glacier

Software Management

YUM, RPM, Repos

DevOps Tools

Ansible, Ansible Tower, Git, GitHub, Docker, Kubernetes

Applications/Servers

LVM, RAID, NIC Bonding, File System (EXT3, EXT4, XFS), Nagios, PXE, KickStart, DHCP, DNS, NFS, FTP, TFTP, SSH, HTTPD, Apache.

Experience:

Linux Systems Engineer 12/2018 – Present

FedEx, Buffalo, NY (fedex onsite 3488 main street)

Managing assigned projects and program components to deliver services in accordance with established guidelines.

Monitoring, analyzing, and optimizing the virtualization performance of ESXi hosts and virtual machines.

I have done worked on esxi on 5.5 and 6.7. I worked with vcenter make templates of vm, snapshot, clone and configure drs and enable HA.

Have extensive experience in monitoring, automating, deploying as well as troubleshooting in a Build/Release Environment

(monitoring == Nagios, automation use ansible)

we have a monitoring team who is responsible for monitoring the servers. Im responsible to resolve the incident tickets generate through the monitoring tool.

Installing, configuring, administrating RHEL 6, 7, and 8 PxeBoot with Kickstart, custom configuration like package management, volume management of servers.

Primarly worked on red hat 7. But I have experienced on redhat 6 and 8. For bare metal provision server pxeboot with kickstart. Install multiple packages and deploy applications on O.S

Skillful in Amazon Web Services AWS provisioning and acumen in AWS services like EC2, ELB, S3, Glacier, IAM, RDS, EBS, EFS, Cloud Watch and autoscaling.

I have experience on aws platform and their services including launching ec2 instances.

Creating and managing AWS users and groups and use permissions to allow and deny their access to AWS resources by using IAM.

Experienced on creating IAM. (. I worked with aws iam to create and manage aws users and groups. I also used policies to allow and deny in aws resources. i also created roles for users and apply policies to users)

Set up AWS CloudWatch for continuous monitoring of EC2 instances and AWS resources to manage infrastructure integrity.

Create billing alert cloudwatch to monitor services. (billing alerts, server health) We can CloudWatch the health check, system performance, billing alert, overall, the health of all the server. We need to configure alarm for threshold.

Expertise’s in creating Amazon EC2 instances and work closely with infrastructure teams to troubleshoot complex issues and setting up security groups.

I have experience with provision and spinning ec2 with AMI. or golden templates.

Implement AWS High-Availability using AWS Elastic Load Balancing (ELB), which perform a balance across instances in multiple Availability Zones.

For high availability I configure ELB. So, If the load one of the servers go up and reached it to the threshold than spin up the ec2 instances to balance the load that is called auto scaling

Build S3 buckets and managed policies for S3 buckets and Glacier for storage and backup on AWS.

I build s3 buckets to store object files in s3. And manage policy in s3. For long term we store object on glacier.

Hands on Experience in AWS Elastic Block Storage (EBS), various volume types and utilization of different types of EBS volumes based on requirement.

(I have done spin ec2 instance. I worked with ebs volume I know about ephemeral storage, I worked with s3 bucks and assig permission to s3 buckets, I have access the s3 buckets through ec2. I attach ebs volume attach them and detach them. I worked with ss to open ports. I I create new filesystem based on ebs create logical volume extend them through the ebs volume. I know different types of ec2. I monitor the service with cloud watch. There are different type of instance it depend on your need.)

Implementing patching, upgrades, vulnerability/ security management, and file system maintenance.

I did patch with ansible playbooks and run on ansible tower. Patching quarterly. Did non-prod servers first and after that prod. And after patch i validate.

Installing, Configuring and mange in Ansible Centralized Server and creating the playbooks to support various middleware application servers, and involve in managing Ansible Tower to automate repetitive tasks.

I make inventory on ansible tower and create host and inventory. And run playbooks on ansible tower. Creating job templates in ansible tower.

Proficient in handling Ansible including creating host inventories, ad-hoc commands, and creating Ansible Playbooks with Ansible modules, using roles, adding tags and handlers.

Create adhoc command and run multiple servers and wrote several playbooks using different modules and create roles to organize the playbook and reuse the playbooks. Add tags to skip tasks and do one tasks from one playbook. And handlers to do the condition if the task done so do other task.

Managing Ansible Tower to create templates from the GIT master server and deploy the configuration across the Linux environment.

We use git for control versioning all the playbooks are saved in git and I pull those playbooks from git and run them on the ansible tower

Experience in branching, tagging and maintain the versions across the environments using version control tools like GIT and GITHUB.

For control versioning I used git push pull code from git and manage code through cli by making clone.

Excellent in creating Docker images by using Docker File, worked on Docker container snapshots, managing Docker volume.

Working experience on docker. Maintaining the docker by stop, run, start and also build docker images.

Dockers is basically Containerization and it runs as a process, where you can isolate apps from each other and run them on multiple containers using the same resources same operation system but different libraries.

Commands: Docker pull hello-world

docker images -a - shows all images

docker run hello-worlddocker run -it

docker start <container id> -->

do ctrl p and ctrl q to quit without killing container

docker rmi <image id removes image.

Working experience of Kubernetes including setting up a Kubernetes cluster, managing clusters, creating & managing pods, deployments, and application management.

Kubernetes is an open source orchestration tool developed by google for managing microservices or containerized application across a distributed cluster of nodes Kubernetes provides highly resilient infrastructure with zero downtime deployment capabilities automatic rollback scaling and self healing of containers which consists of auto placement auto restart auto replication and scaling of containers of the basis of cpu usage.

1master node has different components which include api server controllers schedulers and etcd so there are 2 type of controller cloud and kube ctl and cloud is responsible for the cloud environment dependencies so the kube ctl responsible for all the master node process and scheduler is responsible for the scheduling the jobs …the etcd is the internal volume that stores the most recent information in form of key value when ever you want some most recent information regarding the cluster you go to the etcd .

2why? So the main reason of Kubernetes is to hide the complexity managing fleet of containers by providing rest api s .

3its high availability .

Troubleshooting and correcting server, client, and network configuration issues on operating system level.

By doing basic network issue using different network tools by ifcfg, ping, tcpdump and make sure the server are up and running on network.

Using Nagios XI as IT infrastructure monitoring tool for monitoring host resources such as processor load, disk usage, system logs, monitoring applications, services, and network protocols.

Server sluggishness it will generate the ticket and im responsible to solve those tickets accordingly

Worked with Jira Service Desk ticketing system to work on day to day issues.

Incident tickes ssh cpu needs attention

Configuring NFS server sharing of files and using NFS for storage management purpose.

Configured nfs server and make sure the shared file export properly.

Implementing various RAID levels. Including RAID (0, 1, 5, 6, 10) based on the customers priorities related to performance, and redundancy.

Improve speed, efficiency, and scalability of the continuous integration environment, automating wherever possible using Shell and Bash Scripts.

Write scripts and run via for a loop script to gather and perform certain tasks.

Linux Systems Engineer 10/2015 – 11/2018

HP, Austin, TX (3800 quickhill road)

Primarily responsible for keeping the servers up and running as well as providing direct user support for any technical issues related to Linux systems.

So I make sure that network is up and running. User ssh working properly

Managed full lifecycles of servers i.e. deployed, provisioned, troubleshoot, maintained, and decommissioned.

to maintain So since im part of the operations team Im responsible deploy configure and validate the application on the os level. I provision the servers if its baremetal I do it with pixeboot with kick start and if it’s a vm I do it with a golden template.check the servers health and decommissioned with my team by doing change request.

Configured Linux with Kickstart on multiple servers for network installation and Perform custom configuration like package management, volume management.

I manage the packages with yum and rpm (updating,,, removing,,,installing,,, erasing),,,using lvm the mange the disk space by taking free space from the vg and allocating to the lv.

Created mount points for Server directories and mounted these directories in FSTAB as well as with Autofs.

To make it mount persistent we make entries in etc fstab

Used LVM and created physical volume, volume groups and Logical volumes. Created file systems (Ext3, EXT4, XFS) on demand considering the nature of data.

To manage the file space I used lvm. First create the pv the vg create and give space to lv by taking the space from vg. And make file system by mkfs.

Created and pushing SSH keys to configure password-less SSH for security and automation purposes.

For security and automate task on remote node I created ssh key and copy it to client nodes.

For fault-tolerance, load-balance, and redundancy configure NIC-bonding/ Channel-bonding on new builds.

Create NIC bonding in which I create multiple interfaces and slaves to load the balance on interfaces.

Monitored the performance of the system using top, sar, vmstat, netstat and iostat to check the CPU utilization, bottlenecks of I/O devices, memory usage and network traffic.

Monitored the health of the server by top command and look on load average and narrow down te issue of sluggishness. And after that go into further commands vmstat, iostat and sar to narrow down the issue.

Worked on Linux kernel parameter tuning and load unload kernel modules.

Change different kernel parameters value to increase the server performance ( like kernel memory pid) and load and unload the modules

Resolving network issues using network tools like ping, traceroute, tcpdump.

I resolved the basic network issues by using different tools like ping, ifconfig, traceroute, tcpdump and swnd the overall network activity to network team by tcpdump -w

Managed TCP and UDP. Also experience in transferring files across networks using protocols like SCP, SFTP and FTP.

To send files securely over the network by scp. Sftp. Because it is more secure and work on ssh.

Examined system Log files of all system events and controlled the System Logging services.

Monitor and examined var/log/message and secure and also look into live messafe to narrow down the issue ( tail -f /var/log/message)

Resolving the alerts generated by Nagios such as network connectivity issues, high CPU spikes, systems resources being maxed out, critical system services and application availability.

In my day to day I resolve incident generated by nagios and troubleshhot os related and network related issues.

Utilized cron to schedule jobs. Created and modified Linux shell scripting to automate tasks & integrate solutions.

Create cron jobs to schedule the tasks and shell scripting to run tasks on multiple servers.

Managed the virtual memory and managing the processes i.e. nice and renice process, sending kill signals,

Creating user accounts and adding user into groups on need basis, manually adding users, granting sudo permissions as required.

Managed Access Administration, restricting and securing unauthorized access to the hosts and granting privileged access to the user accounts.

Used Bash Shell scripts to automate the deployment process.

Linux Admin 11/2013 – 09/2015

Yahoo, Lockport, NY 5319 Enterprise Dr, Lockport, NY 14094

Provided support in the installation and implementation of hardware and software components.

Check computer hardware (RAM, HDD, mouse, keyboards etc.) to ensure functionality.

Check Fc and Ethernet cable connection between server, patch panel and core switch.

Proactively Handled tickets that involved troubleshooting equipment hardware issues.

Configured iLO/iDRAC console. and performed onsite data center deployments for racking/stacking new physical servers.

Experience installing, upgrading, and configuring Red Hat Linux using Kickstart Servers and interactive Installation.

Used SSH keys to connect local and remote servers on daily basis.

Performed tasks following the ticketing system Service Now.

Monitored System performance such as CPU, Memory, Disk usage.

Performance monitoring done with conventional tools like Sar, top, iostat & netstat.

Performed NetBackup and Recovery, maintained regular backups using tar, gzip and snapshot.

Scanning hosts for vulnerabilities and remediating them through timely pat.

Maintained the health of the systems and troubleshoot network issues and make sure the networks are up and running.

Manage Remote Access using third party software MobaXterm, PuTTY.

Implementing solutions to optimize the infrastructure solutions.

EDUCATION

Brooklyn College, New York June 2022

Contact this candidate