Marietta, GA, 30062
... Skills • Languages & Scripting: Python (Pandas, NumPy), SQL (CTEs, Window Functions), Scala, Java, Bash • Big Data & Streaming: Apache Spark (PySpark – Batch & Structured Streaming), Kafka, Hive, Hadoop (basic) • ETL & Data Engineering: ETL/ELT ...
- Apr 30
Legacy Town Center North, TX, 75024
... Warehousing & ETL: Data Warehousing, ETL Development, Informatica, AWS Glue, Azure Data Factory, GCP Dataflow, Spark, Hadoop, Databricks, Snowflake Orchestration & Workflow: Apache Airflow, AWS Step Functions, GCP Cloud Composer Cloud Platforms: ...
- Apr 30
Hunters Glen, TX, 75023
... Processed large-scale datasets using Hadoop and Hive for historical financial data analysis, leveraging distributed computing capabilities. Implemented batch scheduling and monitoring using Control-M, ensuring timely and accurate delivery of ...
- Apr 30
Plano, TX
... Warehousing ETL & Orchestration: Informatica PowerCenter, Apache Airflow, AWS Glue Big Data Technologies: Apache Spark, Hadoop, Databricks, Snowflake Cloud Platforms: AWS (S3, EMR, Lambda, Athena) Version Control & DevOps: GitHub, Jenkins, Docker ...
- Apr 30
Hyderabad, Telangana, India
... Lambda, Redshift, Athena) Big Data Technologies: Apache Spark (PySpark), Hadoop, EMR Version Control & DevOps: Git, Jenkins, Docker Data Visualization: Tableau Methodologies: Agile, Scrum WORK EXPERIENCE Senior Data Engineer @ Tailored Brands, Inc. ...
- Apr 30
Texas
... Implemented batch processing pipelines leveraging Hadoop and Hive for processing vast volumes of retail transaction data. Executed comprehensive data cleansing, transformation, and validation routines to maintain data quality standards across ...
- Apr 30
Aubrey, TX
... PROFESSIONAL SUMMARY Experienced Data Engineer and Senior BizOps Engineer with over 8 years of experience in designing, building, and supporting large-scale batch processing systems and data pipelines across Hadoop and Apache NiFi ecosystems. ...
- Apr 30
Hyderabad, Telangana, India
... Azure Data Factory, Git, Agile Data Engineer @ CISCO San Jose, CA Jan 2024 – Aug 2024 Built scalable data pipelines using Spark-Scala on a Hadoop ecosystem, processing massive datasets for analytics and reporting, contributing to robust data flows. ...
- Apr 30
Hyderabad, Telangana, India
... Snowflake, DynamoDB, Hive ETL & Data Warehousing: Informatica, Azure Data Factory, AWS Glue, Databricks, Spark (PySpark), Hadoop, Data Lake, Dimensional Modeling Orchestration & DevOps: Airflow, Jenkins, Docker, AWS Step Functions, Git/GitHub ...
- Apr 30
Plano Park, TX, 75074
... SQL Server ETL & Orchestration: Informatica PowerCenter, Apache Airflow, AWS Glue, Azure Data Factory Big Data Technologies: Hadoop, Spark, PySpark, Kafka, Delta Lake Cloud Platforms: AWS (S3, EMR, Lambda), Azure (ADLS, Databricks) DevOps & Version ...
- Apr 30