Jersey City, NJ, 07306
... Strong background in ETL processes, data warehousing (Snowflake, Redshift, SQL Server, Oracle), and big data technologies (Hadoop, Spark, AWS, Azure Databricks) to manage and analyze large datasets efficiently. Experience in healthcare and finance ...
- May 02
Hyderabad, Telangana, India
... Assisted in minor Hadoop data ingestion using Hive, broadening experience with big data technologies. Employed Git for comprehensive version control, facilitating collaborative development and code management. Identified and implemented system ...
- May 01
Hyderabad, Telangana, India
... Worked extensively within the Hadoop ecosystem for initial batch transformations, leveraging Hive for structured data analytics and reporting. Implemented data validation and quality rules, ensuring high accuracy and reliability of insurance ...
- May 01
West Haven, CT
... MDM (Calero/MDSL), FactSet, S&P Capital IQ Databases & Big Data SQL Server, Oracle, Snowflake, Azure Synapse, PostgreSQL, Hadoop, Google BigQuery, Enterprise Data Lakes ETL & Governance Informatica, Apache Airflow, Data Mapping, Lineage, Metadata ...
- May 01
Hyderabad, Telangana, India
... Performed extensive data transformations using Hive on Hadoop (Cloudera), improving data quality and consistency for sales reporting. Built Spark batch processing jobs for sales reporting, leveraging Python for complex data manipulation and ...
- May 01
Hyderabad, Telangana, India
... (via Azure SQL DB concept), PySpark, Python, SQL, Git Data Engineer @ UPS Atlanta, GA Dec 2021 – Sep 2023 Developed and optimized complex ETL pipelines using Spark (Scala and Python) on a Hadoop ecosystem, enhancing data processing capabilities. ...
- May 01
Danbury, CT
... Ltd Apr 2018 - May 2020 Big Data Engineer Hyderabad, India • Built Hadoop ingestion pipelines using Sqoop to extract data from Oracle/MySQL into HDFS for downstream analytics. • Developed Spark/PySpark transformation jobs applying business rules for ...
- May 01
Plano, TX, 75074
... Technologies Used: Oracle, Informatica PowerCenter, MySQL, SQL, Unix, Shell Scripting, Python, Hadoop (HDFS, Hive), Git, Jenkins
- Apr 30
Chicago, IL
... SageMaker, S3, EC2, EKS) • Programming & Tools: Python, SQL • Data Engineering: ETL/ELT, Batch & Streaming Processing, Data Modeling, Content Migration • Big Data & Processing: PySpark, Spark SQL, Hadoop, Spark • Tools: Jenkins, Git, Jira, Tableau
- Apr 30
San Antonio, TX, 78229
... Environment: Power BI Desktop & Service, Power Query, DAX, Microsoft Fabric, SQL Server, Oracle, MySQL, Google BigQuery, Dremio, SSIS, SSAS, Tableau, R, Python, Excel, Jira, MS Project, Alteryx, Apache Hadoop, Agile. Education: Master’s in ...
- Apr 30