Post Job Free

Resume

Sign in

Site Reliability Management Specialist

Location:
Lithonia, GA
Posted:
June 13, 2023

Contact this candidate

Resume:

PIETRO SPIVA

678-***-****

adxorq@r.postjobfree.com

Advanced Information Technology Professional

OBJECTIVE:

To obtain a senior level contract or permanent position focusing on process improvement initiatives that will utilize my experience as an IT professional and that offers continuous professional development opportunities.

CAREER SUMMARY:

An intelligent, enthusiastic, able to work under pressure, open minded, strongly self-motivated System Engineer with a broad experience in real time operating with expertise in designing local and wide area networks, server-client systems, security management and disaster solutions.

KEY SKILLS & COMPETENCIES:

Operating Systems

Hardware Supported

3rd Party Software Supported

•Centos 6 / 7

•Solaris 9 / 10 / 11

•RHEL 5.x, 6.x, 7.x

•AIX4.1, 4.3, 5L

•Ubuntu 15.04

•Windows (2003, XP, 2000, 2010, NT, 9x)

•IBM Datapower XG45

•IBM pSeries

•IBM System/390

•Sun T-Series

•Hitachi Disk Storage

•Compellent Disk Storage

•EMC Storage

•NEC HYDRAstore

•BEA Weblogic 8.1 / 10.2 / 10.3

•IBM WebSphere MQ Explorer

•Puppet / Chef Deployment

•Jenkins Deployment Server

•VERITAS NetBackup 6.0

•VERITAS Cluster Server

•Centrify DirectAuthorize

•Compuware Dynatrace

•Principia Reporting Servers

•Actuate 2.0 / 2.1

•JBoss Application Server

•Apache / iPlanet Web Servers

•Compuware ServerVantage

•Sterling Connect:Direct

•Compuware TestPartner Framework

•RSA Cleartrust envision 4.1

Database Servers Supported

Infrastructure Software Components Supported

OS Protocols / Scripting Supported

•IBM DB2

•Oracle 9i / 10g /11g / 12

•MySQL

•Postgres

•Websense Proxy

•McAfee Firewall Enterprise (Sidewinder) S4016

•VERITAS NetBackup 6.0

•BluecatNetworks Adonis DNS/DHCP Appliance

•Tivoli Storage Manager (TSM)

•Third Brigade Deep Security 6

•LDAP

•SendMail

•DHCP

•DNS / Bind

•SSH/FTP

•NFS

•Active Directory

•Python

•Perl

EDUCATION:

1991- 1996 FLORIDA A&M UNIVERSITY Tallahassee, FL

• Bachelor of Science Degree in Computer Information Systems

1996- 1998 FLORIDA A&M UNIVERSITY Tallahassee, FL

• Master of Business Administration

EMPLOYMENT CHRONOLOGY:

1/21 – present Kaiser Permanente, Atlanta, GA

Site Reliability Engineer

Kaiser Permanente is one of the largest nonprofit healthcare plans in the United States, with over 12 million members.[1] It operates 39 hospitals and more than 700 medical offices, with over 300,000 personnel, including more than 87,000 physicians and nurses.

Coordinate changes with multiple teams using ServiceNow and Jira with application owners to ensure minimal user impact.

Met with team leaders and customer to gather project requirements and Service Level Agreements and worked with DevOps team to build application to corporate standards

Created scorecard and monitored team activities and reported to upper management successes and failures of application and application hardware

Setup and managed PagerDuty notification and set escalation procedures

Set meeting w/ DBA and developers to perform Root Cause Analysis and developed procedural fixes in future release

Setup Kibana and Dynatrace Monitoring procedures for infrastructure servers and process monitoring

Act as top-tier on-call support for critical uptime business applications to maintain availability and minimize downtime during outage scenarios

Plan, scheduled, and tested all software and application updates by writing testing shell scripts (ksh, bash) and other application testing tools

Setup file integrity monitoring using Titanium, AppDynamics, and Protegity

Is responsible for multi-platform operating systems, utilities, and related software to meet organizational needs.

Maintained responsibility for the availability, integrity, and reliability of assigned systems.

Makes recommendations on system upgrades and new technologies.

Installs, maintains, and monitors one or more multi-platform operating systems, utilities, and related software to meet organizational needs.

Supports the availability, integrity, and reliability of assigned systems.

Provided daily support for the EPIMS, POS, and IHP environments with a focus on Red Hat Enterprise, Ubuntu, and other *nix operations both on-premises and within cloud computing platforms.

Developed and updated procedures and guidelines to install, patch, configure, customize, troubleshoot, upgrade, integrate, and maintain Red Hat Enterprise, Ubuntu, other *nix operating systems, and related software.

Experience with log analytic tools such as Splunk.

Researches, analyzes, and resolves problems, providing root-cause analysis for Red Hat Enterprise, Ubuntu, and other *nix operating systems.

Proactively seeks information and utilizes analytical and creative problem-solving skills along with standard processes and technologies resulting in secure use of systems, applications, and infrastructure.

Demonstrates quality service and accountability in the process of resolving requests, supporting daily operations, and ensuring system stability that results in accurate, timely, and efficient solutions and data as evidenced by meeting customer needs.

Learning and keeping current with HPC technologies, such as backups, job-scheduling and parallel file system management.

LDAP, user and group account administration.

3/19 – 12/20 Equifax, Atlanta, GA

Equifax is a global data, analytics, and technology company. We believe knowledge drives progress. We blend unique data, analytics, and technology with a passion for serving customers globally, to create insights that power decisions to move people forward. Headquartered in Atlanta, Equifax operates or has investments in 24 countries in North America, Central and South America, Europe and the Asia Pacific region.

Site Reliability Engineer

Managed application migration project from Legacy bare metal servers on Site A to Cloud based BlueBird virtual machines.

Tracked and monitored team activities using JIRA and Confluence

Met with team leaders and customer to gather project requirements and Service Level Agreements and worked with DevOps team to build application to standards

Created scorecard and monitored team activities and reported to upper management successes and failures

Managed and track Rapid7 vulnerabilities on Centos6/7 systems keeping systems in compliance with the EFX Security

Setup and managed PagerDuty notification and set escalation procedures

Set meeting w/ DBA and developers to perform Root Cause Analysis and developed procedural fixes in future release

Setup Datadog Monitoring procedures for infrastructure and process monitoring

Act as top-tier on-call support for critical uptime business applications to maintain availability and minimize downtime during outage scenarios

Coordinate changes with multiple teams using ServiceNow and Jira with application owners to ensure minimal user impact.

Plan, scheduled, and tested all software and application updates using Chef cookbooks and Rundeck deployment tool by writing shell scripts (ksh, bash) and perl scripts

Setup file integrity monitoring using Titanium, AppDynamics, and Protegrity

Deploy and maintain international server environment for 24/7 critical uptime business product offering in a mixed Windows/Linux environment.

Served as an escalation point for other Systems Administrators, Engineers, and other technology teams in the resolution of server and system problems.

Maintain Git repositories for developers and promote topic branch workflow

Leverage automation tools such as RunDeck, in order to decrease end-to-end deployment times, reduce downtime, and increase reliability.

Maintain PCI and SOX compliance with required applications and environments

Managed Equifax CI/DC pipeline using Jenkins

Coordinate changes with application owners to ensure minimal user impact

Created and maintained documentation of systems and processes for existing and new systems using Confluence

Wrote automation scripts in BASH / Go / Korn Shell to maintain Bluebird environment

1/18 – 2/19 Merchant e-Solutions, Atlanta, GA

Linux Site Reliability Engineer

Merchant e-Solutions helps merchants accept payments anywhere and easily manage all on one platform. Merchant e-Solutions provides a global network and enables merchants to securely do business in multiple channels including online, mobile, and in-person. Our industry-leading technology platform, flexible and customized reporting, and world-class service provide customers, banks, partners and developers with the most comprehensive payment services in the market.

Linux administrator

Created and implemented disaster recovery procedures and remote backup site

Deployed over 89 production Linux servers and maintained their availability at 99.8%.

Created jobs in Jenkins and set up global permission and schedule jobs

Performed branching, tagging, and release activities on version control tools like GIT

Performed system troubleshooting to isolate and diagnose common server/system/DNS/storage resource problems.

Highly involved in configuring and monitoring distributed and multi-platform servers.

Experience with Docker and Vagrant for different infrastructure setup and testing of code

1/17 – 1/18 CNN, Atlanta, GA

An American basic cable and satellite television news channel owned by the Turner Broadcasting System, a division of Time Warner. CNN was founded in 1980 by American media proprietor Ted Turner as a 24-hour cable news channel.

Patch Management Specialist/Linux administrator

•Worked with application operations team to developed vulnerability management strategies of dev and production systems

•Provided Tier I and II SCCM administrator support to Turner application suppoprt

•Provided remediation services to infrastructure teams based on vulnerability & policy compliance scans

•Negotiated, planned and manage patch activities for RHEL, Ubuntu, and Solaris platforms

•Participate in Change Advisory Board and technical review meetings to discuss patching, vulnerability impacts and considerations and utilized tools to automate patch deployment, such as IBM BigFix and Spacewalk

•Assisted in proactively developing patch and vulnerability management procedures and processes within the operations team and in conjunction with business and IT partners

•Ensured that patch management schedules are being followed and systems meet/exceed defined configuration and security standards.

•Identified causes and created action plans for any deviations from the schedule/policy and report to management weekly.

1/15 – 1/16 First Advantage, Sandy Springs, GA

First Advantage provide easy-to-understand background screening results so you can confidently make decisions about prospective employees, vendors and renters.

Senior system engineer/Linux administrator

•Led technical site relocation project involving up to 40 servers successfully completing project in a single weekend.

•Troubleshoot application, leveraging in-depth knowledge of system functionality and business logic

•Installed TSM clients and setup backup policies on Linux/Solaris/Windows based servers and verified backups

•Monitor platform and applications for errors and performance problems

•Independently identified and performed general administrator/engineer tasks (patches, application upgrades, etc.) on over 60 Red Hat and FreeBSD servers.

•Installed and updated JBoss EAP rpms on RHEL 6.x platform and update operating system modules to standard

•Installed and configured JBoss apache web servers on RHEL 6.x platforms and configured and troubleshoot virtual interfaces

•Setup and configured MySQL on Windows 2012 servers

•Setup users and groups for Active Directory administration

•Monitored system performance and prevented resource exhaustion using ssh, sar, vmstat, iostat, netstat and nmon

•Created Solaris Jumpstart and Linux Kickstart servers and processes to automate and standardize the installation process, reducing installation time by 35% and post-installation errors by 50%.

•Develop or update existing scripts (Python / bash) for automation of routine tasks and deployment of applications using Puppet for DEV, QA, and Production enviroments

•Patched and performed system upgrades of OS kernel and 3rd party software using Satellite server and rpm standalone installs

•Managed and Cross Trained Technical Support Team, teaching personnel Linux standards.

•Deployment production applications using BigIP F5 Loadbalancing configurations of Windows and Linux servers

•Provide 4th tier application support and vendor engagement

•Adhere to established change management and documentation procedures using Jira and Confluence

•Assist in the defining of requirements to ensure that future deployments are operationally sound using Jenkins deployment manager

•Assist in the documentation of current and future systems in Linux, Solaris, and Windows platforms

•Provide continued support of in-house mail system. This includes daily maintenance and all software and hardware upgrades.

•Worked with overseas clients and vendors performing a datacenter migration of multiple applications and operating systems

•Set up proof on concept systems for outside vendors which included installing software, configuration, and setup of VMware Linux systems

•Setup backup and restore software on server and verified backups on systems for data recovery and DR purposes

3/14 – 1/15 COX Communications, Sandy Springs, GA

Cox Communications is a privately owned subsidiary of Cox Enterprises providing digital cable television, telecommunications and wireless services in the United States. It is the third-largest cable television provider in the United States, serving more than 6.2 million customers, including 2.9 million digital cable subscribers, 3.5 million Internet subscribers, and almost 3.2 million digital telephone subscribers, making it the seventh-largest telephone carrier in the country.

Senior system engineer/Linux administrator

•Planning, managing and installation VMware ESX 3.5 / 4.0 server, VMware virtual center, V-Motion, Storage Motion, HA and DRS, P2V, V2V and troubleshooting

•Plan and schedule OS and firmware upgrades on RHEL5.x and RHEL6.x platforms

•Creation and deployment of virtual machines in datacenter, set of the VLAN/Port-group

•Performed VMware crash/error analysis and wrote root cause analysis

•Installed and configured JBoss apache web servers on RHEL 6.x platforms and configured and troubleshoot virtual interfaces

•Configured and managed user, group, permission, role, and resource pools

•Worked in a 24x7 environment supporting over 100 multi-OS (Solaris 11, Linux 2.6, VM Clusters) servers with rotating on-call assignments

•Created and maintained UNIX scripts (ksh, Borne, Perl) used to interface data to and from third party software

•Converted Linux systems from local password authentication to LDAP and NFS homes significantly increasing administrative efficiency.

•Successfully developed and implemented security and patch update strategies for multiple UNIX operating systems (Sun / Linux).

•Successfully installed and managed N-tier J2EE BEA WebLogic Applications on UNIX operating systems

•Installation and configuration of Solaris, Linux for new build environment.

•Installed, managed and implemented various java (JDK7) applications on multiple UNIX platforms

•Hardening of servers to ensure proper security measures are implemented

11/13 – 3/14 LexisNexis Inc, Alpharetta, GA

LexisNexis® is a leading global provider of content-enabled workflow solutions designed specifically for professionals in the legal, risk management, corporate, government, law enforcement, accounting, and academic markets

Senior Systems Engineer/ Linux administrator

•Worked in a 24x7 environment supporting multi-OS ( SunOS, Linux, VM Clusters) servers with rotating on-call assignments

•Attended project meetings at set requirements for new customer facing project being rolled out in 2014

•Provided hardware specifications and assisted with project manager in the ordering project supplies

•Worked with production sites and disaster recovery sites using F5 to divert traffic during maintenance windows

•Documented existing applications and provided necessary support and automation of current processes

PROFESSIONAL DEVELOPMENT TRAINING:

Certified Solaris 10 Administration

Certified IBM AIX5 Systems Administration

Red Hat Certified Systems Administrator (RHCSA)

Certified Checkpoint Firewall Security Administration (CCSA)

Certified Checkpoint Firewall Security Engineer (CCSE)

Solaris Containers / Zones

VERITAS Cluster Server for UNIX

VERITAS Storage Foundation 5.0 for UNIX

McAfee Firewall Enterprise (Sidewinder)

IBM Power 520 Express

IBM HACMP Systems Administration

iPlanet Netscape Application Server Administration

Sun Solaris 2.x Internals

VERITAS NetBackup for Solaris

Practical Solaris 10 Security

Compuware ServerVantage Installation and Administration

REFERENCES AVAILABLE UPON REQUEST



Contact this candidate