Plano, TX
... EDUCATION: Master of Science in Computer Science @ Illinois Institute of Technology TECHNICAL SKILLS: Programming & Scripting: Python, Shell Scripting, Scala, SQL, PL/SQL, Perl, Java Data Warehousing: Oracle Exadata, Hadoop, Hive, Databricks, ...
- Apr 30
Plano, TX, 75023
... Analytics, Azure Databricks, Agile Data Engineer @ SIRVA Worldwide Chicago, Illinois Oct 2018 – May 2020 Developed and optimized batch data pipelines using Spark (Scala) for large-scale data processing within the Hadoop ecosystem effectively. ...
- Apr 30
Hyderabad, Telangana, India
... Cloud Platforms: AWS (S3, EMR, Redshift, Glue, Lambda), Azure (ADF, ADLS Gen2, Synapse Analytics) Data Warehousing & ETL: Hadoop, Spark, Hive, Snowflake, Informatica PowerCenter, Azure Data Factory, AWS Glue, ETL Design Orchestration & Automation: ...
- Apr 30
St. Louis, MO
... • Migrated legacy Hadoop workloads to Databricks and BigQuery, identifying system and architecture improvements that reduced pipeline runtime and operational cost. • Implemented Delta Lake with Unity Catalog and Apache Iceberg for schema enforcement ...
- Apr 30
Maryland Heights, MO, 63043
... Technologies Used: Linux, Shell Scripting, Oracle Exadata, Informatica, Airflow, Python, SQL, AWS (S3), Azure (ADLS, ADF), PySpark, Hadoop, Docker, Jenkins, Agile Data Engineer @ Molina Healthcare Long Beach, CA Jan 2021 – Jul 2022 Managed Linux ...
- Apr 30
Hyderabad, Telangana, India
... Migrated batch data from Oracle to Cloudera Hadoop using Sqoop, optimizing the process for large datasets on Linux. Developed Hive queries for aggregating sales and inventory data within the Hadoop ecosystem, supporting business intelligence needs. ...
- Apr 30
Plano, TX
... Worked with Hadoop and Hive for querying and managing large volumes of semi-structured data within the data lake environment. Performed data quality validation and cleansing to ensure the reliability and accuracy of data within the enterprise data ...
- Apr 30
Santo Domingo, Distrito Nacional, Dominican Republic
... Docker, Kubernetes, and CI/CD • Engineered data pipelines and performed feature engineering using big data tools (Spark, Hadoop, SQL/NoSQL) • Deployed scalable AI solutions using cloud ML platforms (AWS SageMaker, Google AI Platform, Azure ML) • ...
- Apr 29
Plano, TX
... Factory, dbt Linux & Systems: Unix/Linux, File Systems, Shell Scripting, Process Automation Big Data: Apache Spark, PySpark, Hadoop, Hive, Kafka Cloud: AWS (S3, EMR, Lambda), Azure (Data Lake, Databricks) Databases: Oracle, PostgreSQL, MySQL, SQL ...
- Apr 29
Dallas, TX
... Ingested data from 160+ legacy systems into Hadoop staging environments. Developed ETL solutions using Informatica PowerCenter, BDE (Blaze), and Power Exchange. Performed performance tuning and optimization of long-running ETL jobs. Automated ...
- Apr 29