Post Job Free
Sign in

Senior Software Engineer

Location:
American Fork, UT, 84003
Posted:
February 02, 2025

Contact this candidate

Resume:

Shawn Becker, Ph.D.

Lehi, UT 857-***-**** *****.******@********.*** Linkedin.com/in/shawnbecker GitHub Portfolio

Senior Cloud Engineer

Technology leader with 20+ years of experience architecting enterprise cloud solutions and data systems. Demonstrated success implementing real-time monitoring, streaming analytics, and robust data pipelines across finance, healthcare, and entertainment sectors. Expert in cloud architecture best practices, focusing on security, reliability, and cost optimization through infrastructure automation and continuous improvement. Consistently delivered high-availability solutions that reduced operational costs while enhancing system performance and scalability.

Core Competencies: Enterprise Architecture, Data Modeling, Cloud Design, Security Compliance Data Science & Analytics: Python Analytics, Business Core Competencies: Cloud Architecture: AWS Services, Security Compliance, Infrastructure Automation Data Engineering: ETL/ELT Pipelines, Distributed Computing, Real-time Processing Analytics & ML: Python Analytics, Statistical Modeling, Computer Vision, NLP Visualization: Looker, QuickSight, Plotly, D3.js DevOps: CI/CD, System Monitoring, Performance Optimization

PROFESSIONAL EXPERIENCE

Spexture Lehi, UT

Senior Software Engineer, DevOps Aug 2024 – Present

Designed and implemented an end-to-end CI/CD pipeline using a target-based architecture to optimize model deployment.

Automated multi-environment infrastructure provisioning and credential management via Hashicorp Terraform and Vault.

Deployed PySpark SQL Docker containers on an EKS-managed Kubernetes cluster to enhance resource utilization and scalability for data processing workloads.

Ensured system transparency using real-time monitoring solutions and dashboards with Amazon Managed Grafana.

Utilized AWS Glue and PySpark for data ingress, cleansing, versioning, and lineage tracking, maintaining data integrity and regulatory compliance for security.

Fannie Mae – Risk Works Analysis Data Lake Washington, DC & Remote

Senior Data Engineer Feb 2024 – Jul 2024

Developed and maintained ETL pipelines across multiple environments using AWS (AWS Redshift, Glue, S3, IAM, Lambda, SNS) and tools (SQL, dbt, REST APIs, Postman).

Ensured efficient data ingestion and processing through validated SQL code for data lakes.

Automated schema change tracking and Redshift external view updates using Amazon Glue, reducing manual intervention.

Earned Agile Scrum Master certification, demonstrating knowledge of Agile methodologies and best practices.

The Cigna Group – Data Cybersecurity Division Bloomfield, CT & Remote

Senior Cybersecurity Engineer May 2023 – Dec 2023

Upgraded application deployment processes and improved build automation by leveraging Jenkins CI/CD pipelines, integrating SetupTools, Artifactory/PyPI, SonarQube, and Xray.

Improved build automation and migrated legacy ETL data pipeline components from on-prem Unity IoC applications to AWS cloud using CDC.

Designed Python-based REST API with mutual TLS/SSL authentication via AWS API Gateway for secure CyberArk’s credential retrieval.

Runtime credential extraction eliminated dependency on locally encrypted files, reduced engineering effort, and minimized cost by 95% per password rollover event.

Warner Brothers Interactive Media Waltham, MA & Remote

Senior Data Engineer Sep 2022 – Apr 2023

Performed statistical modeling on game marketing data using analytics stack (Pandas, NumPy, Sci-kit Learn, Keras) with data visualization libraries.

Leveraged existing game telemetry infrastructure to build marketing analytics solutions using Twilio Segment CDP, Kafka, Redshift, and Airflow.

Built real-time status dashboards using DataDog and Amazon QuickSight to provide visibility into marketing campaign metrics.

Angel Studios Provo, UT

Senior Data Engineer Dec 2021 – Aug 2022

Implemented and fine-tuned a CNN using AWS SageMaker, PyTorch, and Keras to classify movie frames for digital asset monetization.

Created executive dashboards and KPI reports using Snowflake and Looker to track revenue metrics and campaign performance.

Greenseed Data Laboratory Orem, UT

Senior Data Engineer Nov 2020 – Nov 2021

Developed data visualizations using Seaborn, Plotly, and Matplotlib.

Implemented CI/CD workflows using GitHub Actions, Coverage, SonarQube, and Xray to improve code quality.

Architected and built a custom star-schema data warehouse on PostgreSQL while utilizing dimensional modeling and SCD Type-2 tables with a shared streaming facts table.

Automated infrastructure provisioning across development, staging, and production environments using Terraform and Helm.

Integrated RESTful APIs to facilitate seamless data exchange with real-estate data teams and external customers.

NuSkin Provo, UT

Senior Full Stack Developer Nov 2019 – Nov 2020

Redesigned and optimized site registration and login workflows using wireframes for improved UX.

Developed reusable Vue components with Vuetify, NodeJS, and SCSS.

Internationalized web content and unified customer data using Adobe Experience Cloud, CDP, and XDM for personalized global user experiences.

SeniorLink (Vela) Boston, MA

Senior Data Engineer Mar 2017 – Nov 2019

Built AWS-based data pipeline enabling HIPAA-compliant healthcare messaging and collaboration on the Vela platform.

Orchestrated end-to-end data flow: ingesting API Gateway messages via Kinesis streams, transforming to Parquet in S3, and processing with PySpark on EMR for Redshift loading.

Defined RESTful APIs for daily caregiver questionnaire submission and data retrieval through web applications.

Followed privacy and encryption standards for sensitive data, including PII, PHI, PCI, Patient Data, FHIR, and HL7, as well as HIPAA and GDPR.

ClipFile Newton Center, MA

Technical Project Manager Solutions Architect Co-Founder Feb 2011 – Mar 2017

Designed and launched an AWS-based SaaS platform for content discovery and curation.

Implemented patented technology to deliver a consumer-facing CMS with fuzzy matching capabilities among user-curated quotes and text fragments. A fuzzy matching system uses ML techniques (PCA, dimensionality reduction, K-means clustering) to connect user-curated content.

Built scalable backend using Spring Boot, PostgreSQL, and Java 8, with RESTful APIs serving web and mobile clients.

Developed an NLP-based recommendation engine analyzing user preferences across curated content (books, articles, quotes) using collaborative filtering.

Earlier Professional Experience

Co-founder & CTO, HomePortfolio (1997-2002)

Led technical teams at Sierra Vista Group, One Call Center, and CocaCola Corp (2002-2011)

Various architect and tech lead roles at digital media and cybersecurity companies (1996-2007)

Front-end development at Cimmetrix Robotics (1989)

EDUCATION

Ph.D. in Media Arts & Sciences Massachusetts Institute of Technology Cambridge, MA 1997 MSc in Computer Science Brigham Young University Provo, UT 1990 BSc in Design Engineering Technology Brigham Young University Provo, UT 1987

Certifications: Supervised Machine Learning: Regression & Classification, Advanced Learning Algorithms – Stanford Online (Coursera) Certified ScrumMaster (CSM) – Scrum Alliance, Inc. AWS Certified Cloud Practitioner, Certified Machine Learning Practitioner – Udemy (AWS) Data Engineering with dbt, Investing in Human Skills in the Age of AI, Hands-On PyTorch ML, Program Management Foundations, Cert Prep: Scrum Master, Learning Kubernetes, iOS 15 Development Essential Training, Advanced SQL for Data Scientists, Neural Networks and Convolutional Neural Networks Essential Training, Data Engineering Foundations, Postman Essential Training, Introducing Postman, Transitioning to Product Management, Learning Bootstrap 2, Learning Vue.js, Spring: Framework in Depth, Advanced Node.js, Node.js Essential Training: Web Servers, Tests, and Deployment, Learning Node.js, JavaScript Essential Training, Node.js Essential Training – LinkedIn

TECHNICAL SKILLS

High Performance Computing: Ray-Tracing, Voxel-space Rendering, WebGPU, WebGL2

Streaming Technologies: Amazon Kinesis, Apache Kafka

AWS & Cloud Services: AWS Architecture, S3, Lambda, Step Functions, SQS, SNS, EC2, Redshift, DynamoDB, SimpleDB, ElastiCache, Aurora, CloudFormation, SAM, API Gateway, PrivateLink, VPC, Athena, ECR, ECS, EKS, Fargate, Amazon EMR, CloudFront

Data Engineering & Big Data: Apache Airflow, Apache Glue, Apache Glue Catalog, Apache Spark, Apache PySpark, Apache Spark SQL, Data Pipeline, Medallion Architecture, Data Mesh, Dimensional Modeling, Metadata Management, Data Lineage, Schema Evolution, Delta Lake, Databricks, Snowflake, OLTP, OLAP, Segment CDP, Akamai

Languages & Frameworks: Python, Java, Spring, SpringBoot, SQL, PostgreSQL, Oracle, MS SQL Server, dbt, Flyway

DevOps & CI/CD: Docker, Kubernetes, Hashicorp Terraform, Git, GitHub, GitHub Actions, Atlassian Bitbucket, Jenkins, GitOps, CI/CD Pipeline Design, JFrog Artifactory, PyPi, maven, MavenCentral Repository, npm, npm registry

ML & AI: Amazon SageMaker, Supervised/Unsupervised Learning, Reinforcement Learning, Semi-Supervised Learning, Self-Supervised Learning, Regression, Classification, Clustering, PCA, Computer Vision, NLP, LLM, LangChain, Embedding, Bayesian K-means

Analytics & Visualization: Looker, Tableau, Google Analytics, WebTrends, AQWS QuickSight, Grafana, Observable, D3.js, Three.js

Security & Compliance: CyberArk, SSA/TLA Authentication, Hashicorp Vault, MFA, JWT, PII, PHI, PCI, HIPAA, GDPR, FHIR, HL7

Monitoring & Testing: DataDog, AWS CloudWatch, New Relic, AppDynamics, Unit Testing, Integration Testing, JMeter, Postman, Faker, PyTest, Coverage, PyLint, Xray, SonarQube, Code Reviews, Sphinx, Javadoc

Data Integration & Tools: ETL/ELT, Altova MapForce/Mock, Atlassian Confluence, Atlassian Jira, MS Project, MS Visio, MS Office 365



Contact this candidate