Boston, MA
... Databricks, Snowflake, AWS Glue, Hadoop, SQL, dbt, Airflow, and BigQuery, supporting more than 250 recurring reporting and ingestion workflows across enterprise data operations while reducing batch processing interruptions and reporting delays. ...
- Jun 19
New York City, NY
... Retrieval Systems Big Data, Streaming & Data Engineering: Apache Spark (PySpark, Structured Streaming), Apache Kafka, Hadoop, AWS Glue, AWS EMR, Data Pipelines, ETL Development, Batch & Real-Time Data Processing, Data Cleaning, Data Wrangling ...
- Jun 18
Frisco, TX
... Hands on experience on major components in Hadoop Ecosystem like Hadoop MapReduce, HDFS, YARN, Cassandra, Hive, Pig, HBase, Sqoop, Oozie, Kafka. Extensively involved in ETL Data warehousing using Informatica Power Center 7.x/8.x/9.x/10.X Designer ...
- Jun 17
Miami, FL
... Worked with Data Engineering team to Automate ETL workflows using Python scripts and Spark jobs running on Hadoop clusters. Monitored data pipelines and optimized Spark performance by tuning partitioning, caching, and memory configurations. ...
- Jun 17
Mooresville, NC, 28117
... Led enterprise migrations and platform modernization, including Netezza-to-SailFish and DBaaS/Oracle transitions, Hadoop deployments, and accelerated adoption of new analytics through proof-of-concept initiatives. Spearheaded delivery governance, ...
- Jun 17
Ho Chi Minh City, Vietnam
... *********@*****.*** EDUCATION **** - **** **** ********** ** Technology and Engineering Data Engineering GPA: 3.1 SUMMARY Data Engineering student seeking an internship opportunity to apply knowledge in Python, Java, databases, Hadoop and Spark. ...
- Jun 17
United States
... June 2015 – Dec 2015 Role: Linux/Hadoop Systems Administrator Responsibilities: Responsible to deploy, configure, administer and support RHEL 5.x, 6x, systems. Build servers using Kick Start, Red Hat Satellite Server, and vSphere Client. Installed ...
- Jun 16
St. Petersburg, FL, 33705
... (FAISS, Pinecone, ChromaDB) ● Data Engineering & Big Data: ETL/ELT Pipelines, Data Ingestion, Apache Kafka, Apache Spark, Hadoop, Databricks ● Cloud & Data Warehousing: AWS (S3, Lambda, Glue, Redshift), Google BigQuery, Snowflake, Azure Data Factory ...
- Jun 15
United States
... Big Data Technologies Hadoop, PIG, HIVE, Data Warehousing, Sqoop, Apache Storm, Kafka, Spark, Pyspark, Spark Streaming, Spark SQL and Data Frames, Graph X, Scala, Elastic Search,GCP, BigQuery, GCP Data Proc, Cloud Run, AWS, Avro. AWS, Amazon EC2, S3 ...
- Jun 15
Pointe-Claire, QC, Canada
... Conversational RAG, Agentic RAG, Multimodal RAG • Big Data & Distributed Processing:Databricks, PySpark, Spark SQL, Hadoop, DataFrame API, Distributed Joins, Window Functions, Batch Feature Engineering, Data Lake Architectures • MLOps & Model ...
- Jun 13