Post Job Free

Resume

Sign in

Big Data Engineer

Location:
Austin, TX
Salary:
130k
Posted:
July 23, 2023

Contact this candidate

Resume:

Eimis Janeisi Pacheco Aviles

*** **** *** **, ******, TX 78704

737-***-****

adyg6r@r.postjobfree.com

https://www.linkedin.com/in/eimis-pacheco/

Data Engineering Big Data Data Architecture Advanced Analytics Data Science PySpark Databricks Redshift AWS EMR Glue Lambda ETL DWH Data Modeling Python BI Hadoop Hive Cloud Computing

PROFILE

A data professional with over 14 years of experience in the IT field, I combine academic credentials with a strong track record of achievements. I have experience in various data roles, including data engineering, database development, ETL, relational & dimensional modeling and data analysis. I am also equipped with knowledge in machine learning and advanced analytics. I am known for my adaptability, self-motivation, commitment to continuous learning, and high-performance standards. Committed to staying up-to-date, I have a passion for creating impactful data solutions.

Quick Note: Regarding my experience in Costa Rica, even though I was based there, my services were primarily focused on the US market through remote work. As a result, most of my managers and coworkers were from the US.

CERTIFICATIONS

● Databricks Certified Associate Developer for Apache Spark 3.0, 2023

● Databricks Certified Data Engineer Associate, 2023

● CDP Certified Generalist, 2022

● Big Data Professional Certificate, 2021

● Microsoft Certified: Azure Fundamentals, 2021

● Tableau Desktop Specialist, 2021

● AWS Certified Solutions Architect – Associate, 2020

● AWS Certified Cloud Practitioner, 2020

● Scrum Master Professional Certificate (SMPC), 2019 PROFESSIONAL EXPERIENCE

Senior Data Engineer in Transition - Country Change (11/2022 - Present)

● As a new Green Card holder, I am keen to start my professional journey in the U.S. During this transition period, I have been honing my skills in AWS, Databricks, and Snowflake, and am now poised to contribute effectively in my first U.S. role.

Note: When initiating an independent immigration process from within the USA (without corporate sponsorship), the immigrant is prohibited from working and cannot leave the country during the waiting period. This situation inevitably results in an employment gap. This is the second time I have migrated

(Originally from Venezuela). Feel free to ask any questions about my professional history. Pfizer, Costa Rica Site - (08/2021 - 11/2022)

Job Title: Manager, Analytics Solution Engineer

Role Performed: Lead Data Engineer.

Industry: Pharmaceutical/Healthcare.

Main Project: Migrating data pipelines from Hadoop MapR to Databricks as new big data platform. Technology Used: SQL, AWS, Spark, Databricks, Python, Airflow, Redshift, Github.

● Functioned as a pivotal technical leader, guiding the implementation of analytical, operational business intelligence, and big data solutions to bolster Pfizer's Research and Global Product Development organizations.

● Provided technical oversight for team members that were designing, building, supporting, and maintaining analytic/ big data platforms.

● Be involved in the solution architecture, design, development, and management of operational BI and cloud analytics solutions hosted on AWS.

● Engaged with implementation leads, technical leads, and architects to comprehend and address evolving capability requirements driven by customer needs.

● Spearheaded the development of proof-of-concept initiatives and cloud-based solution engineering for Business Intelligence (BI), data pipelining, and analytical solutions projects.

● Oversee and help architect ETL pipelines and data warehouse/data mart solutions.

● Fostered a collaborative environment within cross-functional, globally distributed teams, promoting collective progress and success.

Concentrix, Costa Rica Site – (11/2020– 08/2021)

Job Title: Senior Data Engineer - Customer: Amherst Account. Role Performed: Senior Data Engineer.

Industry: Financial Services and Real Estate.

Main Project: Transitioning legacy SQL Server ETL code to PySpark ETL on Cloudera hadoop. Technology Used: SQL, Cloudera Hadoop, Apache Spark, Stonebranch (orchestrator tool), Apache Sqoop, Apache Hive, Linux, Python, Github, Delta lake, kafka, SQL Server.

● Designed, built and migrated comprehensive data engineering ETL processes onto the Hadoop ecosystem using Apache Spark, Sqoop, Python, Hive, and HDFS, including the creation and maintainance Hive data warehouse to expedite access to information for business and data analysts.

● Executed batch loads, ingested and processed various file formats such as Parquet, ORC, JSON, and AVRO into HDFS, in addition to managing the import and export of relational/operational data between databases and Hadoop using Sqoop.

● Evaluated the technical architecture of the existing system to identify improvement opportunities, reengineered current data pipelines as required for superior performance.

● Conceptualized and developed streaming data ingestion and processing systems using Kafka. VMWare Costa Rica - (06/2019 – 09/2020)

Job Title: Senior Data Analyst

Role Performed: Senior Data Engineer & Data Analyst. Industry: Cloud computing and virtualization technology. Main Projects: VMware Workspace ONE Data Lake Creation/EUC Customer Health/EUC Customer 360. Technology Used: SQL, Spark, Amazon S3, Amazon RedShift, Amazon Glue, Amazon Lambda, Amazon Kinesis, SQS, SNS, PostgreSQL, SQL Server, Oracle PL/SQL, Python, GitHub, Salesforce, Databricks, Tableau.

● Engaged in Business Intelligence (BI) pipeline projects, contributing to the architecture definition of cloud solutions hosted on AWS, within a globally collaborative team aimed at driving revenue growth.

● Participated in the design and development of a data warehouse using Amazon Redshift, also aiding in the construction of a unified data lake (hosted on Amazon S3) to consolidate data for all reporting needs, serving as a single source of truth.

● Spearheaded data ingestion, cleaning and wrangling by processing data gathered from diverse sources such as NoSQL and Relational databases, APIs, and flat files for predictive modeling purposes (Customer Churn).

● Developed data visualization dashboards using Tableau and Python to illustrate critical customer endpoint information, device types, usage, operating systems, platforms, account types (cloud, on-premise, or hybrid), and more.

● Assisted in identifying both problems and opportunities through comprehensive root cause analyses, leading to significant business impacts.

Experian Services Costa Rica - (02/2018 – 01/2019) Job Title: Software Developer Expert

Role Performed: Data Operations Engineer.

Industry: Information Services.

Technology Used: SQL, Oracle PL/SQL, AWS Lambda, AWS Athenas, S3, Python, SQS, SNS, GitHub.

● Focused on analyzing operational data from relational databases for root cause analysis of failures and effective troubleshooting.

● Played a crucial role as part of an internationally distributed team (US, UK, Malaysia, and Costa Rica.) responsible for the support and development of the Oracle E-business Suite ERP.

● Primary duties involved enhancing business users' day-to-day activities through a dedicated ticketing system, securing business data flow and continuity, and continually delivering value by developing innovative data functionalities while refining existing ones. This role also involved meticulous management and optimization of data migration processes.

Cuestamora, Costa Rica Site - (11/2016 – 02/2018)

Job Title: Senior Systems Analyst.

Role Performed: BI and Data Warehouse Engineer

Industry: Pharmaceutical/Healthcare.

Main Project: Sales focus data warehouse creation / Dimensional Modeling Technology Used: SQL, OBIEE, SSIS, Oracle Sales Cloud (OSC), Tableu, AWS Redshift, Glue, S3, Oracle PL/SQL, SQL Server, Python, Pyspark, Databricks.

● Conducted research and development of Big Data solutions, demonstrating a proof of concept for the company.

● Participated in a proof of concept project involving Oracle databases on Amazon Cloud, performing data extraction, data modeling, and visualization to leverage AWS features.

● Developed CRM analyses using Oracle Business Intelligence Enterprise Edition (OBIEE) analytics for OSC. In the absence of a full-fledged BI implementation, I spearheaded the BI project.

● Tasked with integrating Oracle Order Management (OM) with Oracle Sales Cloud (OSC) using Oracle web services.

● Provided mentorship and guidance to junior analysts. Accenture Interactive - (12/2015 - 10/2016)

Job Title: Application Technical Support - Customer: WMware Costa Rica. Role Performed: Data Operations Engineer.

Industry: Information Services.

Technology Used: SQL, Oracle PL/SQL, Oracle SQL Developer, Toad.

● Focused on analyzing operational data from relational databases for root cause analysis of failures and effective troubleshooting.

● Entrusted with analyzing business requirements by addressing queries and issues raised via tickets by primary users.

● Respond to and resolve data pipeline failures, and implement strategies to prevent future recurrences. Technopartners Ltd., Banco Davivienda, Costa Rica Site - (11/2014 - 11/2015) Job Title: Senior Systems Consultant - Customer: Farmatodo. Role Performed: Data Integration Engineer

Industry: Financial Services.

Main Project: Core banking system migration(Cobis) Technology Used: SQL, Transact-SQL, SQL Server, SSIS, Python.

● I have been engaged in a significant migration project involving the bank's core systems. My responsibilities have included handling tasks related to credit lines, customer portfolios, and various types of insurance, such as vehicle, mortgage, property, and machinery.

● Played an integral role in gathering, elucidating, and documenting information requirements through active engagement with business users.

● Ensured data integrity and quality throughout the data pipelines. This includes setting up checks to verify the consistency and accuracy of data.

● Performed data extraction, validation and transformation as part of my routine responsibilities. Development Solutions BEG. C.A, Caracas, Venezuela – (10/2013-11/2014) Job Title: Business Intelligence Analyst - Customer: Farmatodo. Role Performed: BI and Data Warehouse Engineer

Industry: Pharmaceutical/Healthcare.

Main Project: Data warehouse creation / Dimensional Modeling Technology Used: SQL, Oracle PL/SQL, ODI (ETL), OBIEE, Tableau.

● Entrusted with responsibilities including data extraction and data modeling for Business Intelligence (BI) development, and the creation of star schemas.

● Tasked with solving intricate and undefined analytical challenges, partnering with the customer success team to facilitate critical decision-making processes.

● Developed repositories using the BI administrator tool and carried out data visualization.

● Acted as a mentor for junior analysts, providing guidance and support in their professional development. Outsourcing Services, Lima, Peru – (12/2012-10/2013) Job Title: Senior Oracle Developer - Customer: Ripley Bank/RIMAC Insurance. Role Performed: Senior Oracle Developer

Industry: Financial Services/Insurance.

Technology Used: SQL, PL / SQL, Oracle Forms 10g, Oracle Reports 10g..

● Developed and maintained PL/SQL-based systems for clients.

● I monitored and investigated system issues and performed query performance tuning. Multinacional de Seguros (Multinational Insurance Group) Caracas, Venezuela – (08/2010 - 11/2012) Job Title: Systems Analyst.

Role Performed: Oracle Developer

Industry: Insurance.

Main Project: Insurance system migration / Insurance Operational Reporting. Technology Used: SQL, PL / SQL, Oracle Forms 10g, Oracle Reports 10g, Toad, Oracle SQL Developer, SQl Navigator, ODI (ETL).

● Developed and maintained PL/SQL-based systems.

● I participated in a system standardization and data migration project for Guayana Insurance and Interbank Insurance with the aim of aligning all associated companies to use the same Temis insurance system, as is used by Multinational Insurance .

● I contributed to the creation of operational reports in the insurance sector. These reports covered various aspects such as policy details, insurance claims, loss ratio analyses, premium trend reports, and analyses of claim frequency and severity. Additionally, they provided insight into profitability assessments, underwriting performance reviews, deductible amounts, details of policyholders, dependents, and beneficiaries.

● I monitored and investigated system issues and performed query performance tuning. Contract Work, Caracas, Venezuela – (03/2008 - 08/2010) Job Title: Oracle Developer

Role Performed: Oracle Developer

Industry: Financial Services/Telecomunication/Insurance. Technology Used: PL / SQL, Oracle Forms 10g, Oracle Reports 10g, Toad, Oracle SQL Developer, SQL Navigator, SQL Server, Python.

● Developed and maintained PL/SQL-based systems for clients.

● Assisted in system maintenance and improvement, requirements gathering, for operational systems by the development of Oracle packages and procedures.

EDUCATION:

● Master in Artificial Intelligence (Emphasis in ML, Neural Networks & Deep Learning) - Not Concluded, Universidad de Valencia - Spain

● Data Analyst Program (Specialization in Data Science and Big Data), Cenfotec Universidad, 2020

● Master in Information Technology Administration (Emphasis in Project Management), Universidad Nacional de Costa Rica, 2018

● Bachelor Degree in Computer Engineering, Universidad Alejandro de Humboldt, Venezuela, 2011 Course/ Certification Institution

Google Cloud Professional Data Engineer Big Data Academy Perú Cloud Big Data Analytics Professional on AWS Big Data Academy Perú Apache Spark Specialization Program in Databricks Big Data Academy Perú Big Data on AWS (EMR, Glue, Kinesis, etc...) Netec Global Knowledge AWS Cloud Practitioner Essentials (CP-ESS) Fast Lane Data Mining modules using R Language PROMiDAT Iberoamericano, SA Cleaning Data with PySpark DataCamp

Apache Spark Specialization Program with Scala in Databricks Big Data Academy Perú Architecture Program for Big Data & Cloud environments Big Data Academy Perú Introduction to Data Visualization with Python DataCamp Manipulating DataFrames with pandas DataCamp

Cleaning Data in Python DataCamp

Course/ Certification Institution

Importing Data in Python DataCamp

Intermediate Python for Data Science DataCamp

MongoDB Platzi

Data Engineering with Python Platzi

Scrum Master Professional Certificate (SMPC) Cenfotec Oracle BI 11g R1: Create Analyses and Dashboards Oracle Oracle BI 11g R1: Build Repositories Oracle

Oracle BI Publisher 11g R1: Fundamentals Oracle

Advanced Tableau Training Tableau Software

REFERENCES:

1. Pfizer: Edna Lee, Senior Information Manager, 718-***-****, adyg6r@r.postjobfree.com - https://www.linkedin.com/in/edna-lee-86265b3/

2. Pfizer: Drew Palsgrove, Senior Director, Analytics and Data Platforms, 484-***-****, adyg6r@r.postjobfree.com - https://www.linkedin.com/in/drewpalsgrove/ 3. Amherst: Logan Boyd, Senior Manager - Data Engineering, 512-***-****, adyg6r@r.postjobfree.com - https://www.linkedin.com/in/loganboyd/

4. Amherst: Olivia Stewart Lead Data Engineer, 316-***-****, adyg6r@r.postjobfree.com - https://www.linkedin.com/in/oestewart/

5. Concentrix: Rodrigo Madriz, Senior Service Delivery Manager, +506-**-**-**-**, adyg6r@r.postjobfree.com

- https://www.linkedin.com/in/rodrigo-madriz-zuniga/ LINKS:

Credential Verification https://www.credly.com/users/eimis-pacheco/badges Databricks Credential Verification https://credentials.databricks.com/profile/eimispacheco309262/ Link of Interest

https://amazon.qwiklabs.com/public_profiles/69c81b52-c6e7-4138-98b8

-d6f9958e7142

Link of Interest https://github.com/EimisPacheco



Contact this candidate