Asad Talha Mohsan
Linux Operation Engineer
• New York • 929-***-**** • *************@*****.***
A seasoned Linux Operation Engineer with over 6 + years of expertise in building, configuring, and maintaining scalable, secure, and highly available hybrid and complex environments. No heartbeat managing AWS cloud services, automation Linux based environment using Ansible and manging source code using GITlab. Proven experience in incident response, taking proactive measure and expert in root cause analysis, performance tuning. Exposed to Docker and Kubernetes for handling containerized applications, enabling scalable and resilient platforms while continuously driving operational excellence and system reliability. Additionally, expert in user management, storage and filesystems management.
Experience
NOV 2023 - CURRENT
Linux Operations Engineer Accenture, New York, NY
Architected and scaled resilient Linux-based infrastructure to support mission-critical workloads across hybrid cloud environments, aligning with SLOs and compliance requirements.
Defined and enforced infrastructure standards across dev, staging, and production environments, automating Linux system provisioning using Ansible.
Automated Linux system provisioning and configuration management across dev, staging, and production environments using Ansible, ensuring consistency and reducing manual effort.
Led organization-wide adoption of Ansible and Git, establishing version-controlled infrastructure workflows that improved change predictability, auditability, and deployment reliability across Linux server fleets.
Led the performance tuning and optimization of large-scale Linux environments, applying advanced kernel tuning and system diagnostics to ensure consistent low-latency performance.
Directed design and rollout of observability stacks Prometheus, Grafana to provide actionable insights across compute, storage, and application tiers.
Executed SRE best practices around incident management, SLIs/SLOs, and reliability metrics, creating dashboards and runbooks for proactive issue detection and response.
Spearheaded the adoption of Ansible and Git, enabling predictable, version-controlled changes across Linux server fleets.
Provided technical leadership during incident response, owning root cause analysis, postmortems, and driving long-term remediation plans across cross-functional teams.
Designed and implemented secure authentication and access strategies for Linux systems, integrating LDAP/AD, SSH key management, and sudo policy enforcement.
Oversaw the migration of legacy workloads into containerized environments using Docker and Kubernetes, improving deployment reliability and system elasticity.
Mentored junior engineers on advanced Linux internals, shell scripting, Ansible automation, and debugging practices, fostering a culture of knowledge sharing and operational excellence.
Collaborated with security teams to implement OS-level hardening, vulnerability remediation, and proactive patching workflows for Linux servers.
Defined backup and disaster recovery strategies across heterogeneous Linux platforms, incorporating snapshot automation and routine restore testing to validate RPO/RTO targets.
Led cloud-native platform adoption (AWS) by designing scalable VPC architectures, integrating IAM, EC2 auto-scaling, and leveraging native monitoring and tagging for governance.
Drove AWS cloud-native platform adoption, by designing scalable VPC architectures, implementing
IAM best practices, enabling EC2 Auto Scaling, and leveraging native monitoring and tagging for governance and cost visibility
Integrated performance and log data with alerting pipelines, reducing false positives and aligning escalations with business-critical service thresholds.
Evaluated and deployed open-source tooling for Linux system administration, driving platform improvements and reducing third-party tool dependencies.
Implemented and maintained LVM, RAID, and filesystem tuning on Linux systems to optimize disk I/O and storage redundancy.
Drafted and maintained high-quality documentation for architecture decisions, standard operating procedures, and incident workflows, supporting team scalability and audit readiness.
OCT 2020 – SEP 2023
Linux Admin Santander Bank, New York
Provisioned Linux servers using cloud-init and Kickstart in both on-prem and AWS environments, ensuring standardized configurations across all instances.
Managed software installations and updates using YUM, RPM, DNF and custom repositories, keeping systems up to date in offline and restricted networks.
Configured and maintained PXE boot and SFTP servers to streamline automated OS deployments in production environments.
Set up and monitored NFS and Autofs mounts for shared storage, troubleshooting connectivity and performance issues with multipath and mount options.
Created and modified systemd service files to control custom applications and enforce proper startup behavior on RHEL and CentOS servers.
Deployed OpenVPN and managed SSH configurations for secure remote access, including hardened SSHD settings and key-based authentication.
Implemented centralized log rotation and forwarding using rsyslog and Logrotate, forwarding logs to Splunk and local retention directories.
Built and maintained DNS and DHCP configurations using Bind and DHCPD, supporting dynamic and static IP address management.
Wrote shell scripts to automate common tasks such as user creation, permission changes, disk usage checks, and log cleanup.
Participated in change windows for patching and reboots, documenting pre- and post-change verification steps and confirming successful deployments.
Managed Linux filesystems, including creation, tuning, and maintenance, with expertise in mount options, persistent mounting (/etc/fstab), and ensuring performance, security, and reliability across environments.
Performed filesystem integrity checks and recovery using tools such as fsck, proactively identifying and resolving disk and filesystem issues to prevent data loss and downtime.
Implemented and managed NIC teaming / bonding for Linux systems to achieve high availability, fault tolerance, and improved network throughput, configuring modes such as active-backup and LACP (802.3ad).
Performed advanced Linux kernel tuning by optimizing sysctl parameters, CPU and memory settings, and I/O scheduling to improve system performance, stability, and scalability under high workloads.
SEP 2019 – AUG 2020
Linux Analyst Halliburton, Houston, TX
Responded to user-reported issues via JIRA and ServiceNow, diagnosing Linux system errors, application crashes, and login failures.
Used journalctl, dmesg, and application logs to investigate service disruptions and escalate unresolved issues to engineering teams when necessary.
Performed routine system checks, including disk space monitoring, zombie process cleanup, and service restarts to maintain operational health.
Maintained and updated static and dynamic hostname entries in hosts file and DNS records, resolving name resolution issues across Linux nodes.
Assisted with sudoers file updates and permission escalations, ensuring proper group assignments and security compliance on shared systems.
Provided first-level support for issues related to email delivery, cron job failures, and file permission conflicts on shared file systems.
Managed password resets, user lockouts, and account provisioning using built-in Linux tools and Active Directory-integrated authentication.
Created and maintained internal documentation with troubleshooting steps, system access procedures, and support checklists for junior analysts.
Installed and configured utilities and developer tools., curl, wget, net-tools, zip/unzip to support user environments and application dependencies.
Skills
Infrastructure : RHEL 7/8/9, CentOS, Oracle Linux (OEL), Ubuntu, VMware vSphere/ESXi, vCenter, KVM, RAID (0/1/5/6/10), LVM, SAN/NAS, filesystem management (XFS, ext4), SELinux .
Cloud & Automation : AWS (EC2, S3, IAM, VPC, Security Groups, ALB/NLB, CloudWatch, CloudTrail, Route 53), Ansible, Terraform, Git, CI/CD, GitHub, GitLab.
Containers : Docker, Docker Compose, Kubernetes, container networking basics, troubleshooting pods & nodes.
Monitoring & Logging : Prometheus, Alertmanager, Grafana, Splunk, incident investigation & root cause analysis.
Networking : TCP/IP, DNS, DHCP, NTP, SSH, HTTP/HTTPS, load balancers, VLANs, VPN, firewalls, key-based authentication, bastion hosts, OS hardening best practices
Scripting : Bash scripting, YAML, cron/at job scheduling, writing small automation tools for routine ops tasks.
Tools & Platforms : Jira, ServiceNow, Confluence, iLO, iDRAC, VS Code,
Education
Bachelor in Military Art & Science National university of science & Technology 2020