SUSAN SPODDIG Platform Engineer clearance: actuve Public Trus tNACI-5
650-***-**** ********@*******.*** https://linkedin.com/in/spoddiglane
Dynamic IT Infrastructure SRE with over 15yrs experience in supporting Bare-metal and AWS environments. Demonstrated expertise in improved efficiency, reliability, infrastructure automation and performance in Linux and cloud environments; lowering cost while increasing customer confidence and satisfaction. Cloud experience with AWS public and GovCloud, private clouds, AWS and on-premise clustered enterprise Linux, Kubernetes and Docker containers, IaC, S3 and on-premise storage architecture. Maintaining, deploying, monitoring, installing upgrading troubleshooting critical high volume enterprise systems running a wide variety of web, Java, and SpringBeans python applications as well as COTS and OpenSource packages. Created pilots on a variety of cloud platforms. Verified best fit and function for refactored production monolith cloud migrations. Worked closely with developers and architects launching production HPC ingest and analysis applications in the cloud.
Technical Skills
OS AWS • CentOs6 • 7 • 8 • UNIX • Linux • Solaris • HPUX • SUSE • RedHat 6-9• NETAPP DataONTap
Automation/ChMgmt Ansible • git • AWS • Puppet • Argo • Cri-o • terraform • Satellite • Foreman • RPM builds • yum
Networking Load balancers • TCP/IP • Networking • NFS • DNS
Storage S3 and Glacier • LVM • Gluster • NetApp hardware, configuration both physical and virtual NetApp heads in the cloud. Optimization, disaster recovery storage tiering; StorageGrid and updates through 9.13 DataOnTap •
Storage and Reliability Software • Veritas Volume Manager • iSCSI • RAID
Programming/SW Shell Scripting Change Management Software ERP Systems • IVR Systems • WFM tools SOA • Tuxedo BEA Oracle Fusion Middleware RDS Analytics Tableau
Classes Taught – Kubernetes • Container and orchestration · NetApp HW and SW · Wireshark trace investigation · TCP/IP · DNS · Solaris • Linux
Valuable skilled/experience: Root cause analysis, incident management, ci/cd, distributed systems, cloud infrastructure, performance tuning, security compliance, oncall rotation
Experience
Experience architecting stateless and stateful applications for containers and container manager/schedulers such as Kubernetes, AWS EKS, AWS ECS and Docker
Compute Technial Lead/ senior systems engineer
Leidos FTC Washington, DC. Nov 2023-present
Managing a team of 18 senior engineers who support Unix Storage and Windows environments at FTC, supporting the organizations compute efforts as well as developers and end users pursue their legal analytical research to benefit American consumers. Ensured uptime of all environments and applications through rigorous patching and compliance reviews and monitoring. Improved processes and relatable automated deployment. Working to make best use of physical NetApp storage and hybrid clusters of NetApp clusters with virtual systems mixes with physical. Helping engineers revisit conversations with C level government personnel to guarantee requirements are complete in their capture of both stated needs and deeper utilization needs. Developing a culture of being the trusted technical advisor and partner as well as the ‘smart hands’ executing technical and complex project plans.
Resizing and architecting projects to best fit technical needs and cost efficiency with minimal downtime or user retraining. Ensuring DC-MLS clearance at different tiers is maintained in RHEL systems and storage. Tuning monitoring tools (including Science Logic) to reduce false alerts and provide meaningful alerts. Lowering toil for engineers and speeding troubleshooting and impact identification during incidents.
Systems Engineer II Level 5
Amazon Web Services AWS Arlington, VA 10/22-08/23 Amazon Data Analytics Gov Cloud
Guided AWS team through several management reorganizations and transitioning from manual efforts building and supporting US Government secure systems. Assisted teams’ Agile adoption while delivering functioning Linux EC2 and CloudFormation systems for running Amazon Glue and LakeFormation in AWS/GovCloud. Created architectural frameworks for altering historical build methods to using container- based separation of system configuration from build. Rendering them with automated flexibility to deploy in bulk and configure for unique customers afterward in an air-gapped environment. Built tracking system auditing distributed microservice delivery across regions and teams. Monitored, patched and updated systems’ configurations, controlling misbehaving jobs or MQ starvation situations in production while ensuring clients’ cloud database work was uninterrupted and suffered minimal impact. Creating and updating roles and VPCs according to government users’ DC-MLS classifications if information and applications.
Sr. Linux Administrator IV
NOAA College Park, MD 10/18-09/22 Weather Satellite information processing citizen alerting
Deploying Infrastructure as code, maintaining Kubernetes cluster for satellite image ingest and processing on CentOS and RedHat systems in the cloud as well as on-premise. Worked as ITAR cleared Senior Unix Administrator IV working with ingesting satellite images and processing. RedHat systems with message queues (MQ) and a variety of WebLogic and Apache/Tomcat modules had critical functions such as creating US citizen alerts for developing tornados and dangerous weather conditions. Supported HPC systems running Docker containers and Kubernetes clusters’ workflows in multiple sites running various COTS and in-house software. Launched NOAA and government teams’ cloud migration after demonstrating cloud performance and security met or exceeded the on-premise HPC cluster’s scientific weather information processing capability. Updated CentOS 6 systems to 8, migrated to RedHat after CentOS support changed. Migrated Linux servers, tuned kernels for the specific work being performed. Physical troubleshooting and maintenance of on-premise equipment. Created maintained and managed local storage volumes, permissions, application versions and insured mission critical open source and NOAA developed software modules were running by using scripts, service stop and starts as well as Kubernetes pods to help the clusters self-heal during performance alerts and outages.
Infrastructure Technical Program Manager
Cadence Semiconductor Design Systems San Jose, CA. 4/18-10/18 Semiconductor Design
Using Agile Project management and lean sigma processes to drive
business process improvements delivering Paas, SaaS, and IaaS.
collaborating with architects, engineers and business owners on-time on-budget quality project deliverables
osoftware releases
oMQ control and security
ocomplete Data Center environment buildouts from raised floor to ESX chassis hosting virtual customer semiconductor chip CAD design environments on RedHat stacks
oProblem-solving with engineering and developer teams; designing and prompting creative resolutions which fit the business’ mission, budget and customer requests.
Cloud Service Delivery Lead
Ellucian (remote) Maitland, FL 07/17- 01/18 Higher level ERP SaaS
Acting as customer relationship manager to the technology teams, and as adviser from cloud technology teams to university officers I lead steady state Service Delivery team maintaining and troubleshooting customers’ on- premise and AWS WebLogic based CRM and ERP installations. Assisted universities with upgrading Banner and it’s dependent subsystems to Banner 9. Delivery involved upgradin and monitoring Oracle, RHEL, Java, J2EE, and third-party billing software and sizing AWS instances for performance. Insured 24 x 7 uptime of universities’ Cloud and on premise systems through on-call work, ticketing, and monitoring systems. Lead role varied between front-line maintenance, troubleshooting, Incident Commander during outages. Using my knowledge of Linux, Unix, Oracle databases, network connectivity, authentication, Java, Firewalls, WebLogic behavior, Apache, AWS and local systems I supported customers’ endeavors.
Infrastructure Program Manager
GOOGLE INC. Sunnyvale, California 2/2015 – 2/2017
Troubleshooting Oracle middleware, SOA and Oracle Form difficulties on Goobuntu Linux systems running core Google financial and logistics systems. Identified and resolved Oracle upgrade and performance bottlenecks. Provided quality IT service and support to external and internal customers. Insured Google Shopping Cart ordering supply chain functioned properly. Worked with service architect to expand capacity by 2000%. Troubleshot issues and Delivery problems with DBAs and business partners.
Achievements:
Upgraded financial engines to Oracle 12c. Planned the upgrade of expiring Oracle 11g-class versions and subcomponents including Java, SSH and HFM.
Established update processes and procedures maintaining uptime while ensuring SEC and SOX compliance.
Proven experience managing projects in both waterfall and agile methodologies in matrixed environment.
Sr Storage Engineer
NETAPP INC. Durham, North Carolina 3/2005 – 1/2015
Created technical bridge between complex enterprise storage administration and customers’ support staff Worked to learn each customers’ complex storage implementation in Unix and Windows usage in a few minutes and create recovery steps, best integrity posture, resolve any issues experienced by acting as their remote storage administrator advisor. Developed technical solutions for customers during mission critical production outages. Escalated issues to backline Hardware and Firmware engineers from Support organization.
Achievements:
oIntroduced automation lowered cost from over $100 to $34 per disk replacement. Worked cross organizationally with logistics, part delivery management, remote depot stock measurement to insure availability.
oTrained Outsourced technical teams:
Performed Storage system ONTAP training, trained engineers from no NetApp knowledge through hands-on command line, performance, physical network and performance troubleshooting capability.
Created outsource engineer ticket ownership behavior, maintaining customer service and support levels lowering cost.
Implemented seamless support escalation procedures for 14,000 tickets and 9000 calls per month for outsource team to senior NetApp staff engineers
Raised customer satisfaction ratios from mid 80% range to 98% by upgrading call-flow and work process.
Built call centers, outsourced work, grew project from 16 to 375+ agents.
oGuided Troubleshooting tool allowing engineers to ask detailed questions for escalation and faster ticket resolution. Helped route tickets to correct expert improving customer satisfaction, lower callbacks, reduced cost.
Senior UNIX Administrator Positions
Senior UNIX Administrator Positions:
SUNTRUST Bank Durham, NC
Transitioned CCB customers and accounts into SunTrust Bank production systems
Administered Solaris, SuSE, HPUX, and HP Blade systems running Ubuntu.
GSK Glaxo Smith Klein ) Durham, NC
Brought 135,000 IP addresses in 60 DNS+ domains under centralized change management
Administered Solaris servers controlling DNS and corporate firewall solution
USPS (United States Postal Service ) Raleigh, NC
Maintained Database providing uninterrupted global street address and zip code system availability for
Streamlined IVR system. Reduced long distance costs by $24 million/month.
Executive Office of the President of the United States (POTUS/ EOP) Washington, DC
Managed Unix email servers and web servers utilized by the public and EOP staff
Education Professional Training
• AWS Fundamentals • AWS Architect • Terraform
• Kubernetes Security Specialist (CKS) • Docker & Certified Kubernetes Administrator (CKA)
• TIA Security + • DevOps Bootcamp • ITILv3
• PMP • CSM
B.A. North Carolina State University