Hyderabad, Telangana, India
... Worked extensively within the Hadoop ecosystem for initial batch transformations, leveraging Hive for structured data analytics and reporting. Implemented data validation and quality rules, ensuring high accuracy and reliability of insurance ...
- May 01
West Haven, CT
... MDM (Calero/MDSL), FactSet, S&P Capital IQ Databases & Big Data SQL Server, Oracle, Snowflake, Azure Synapse, PostgreSQL, Hadoop, Google BigQuery, Enterprise Data Lakes ETL & Governance Informatica, Apache Airflow, Data Mapping, Lineage, Metadata ...
- May 01
Hyderabad, Telangana, India
... Performed extensive data transformations using Hive on Hadoop (Cloudera), improving data quality and consistency for sales reporting. Built Spark batch processing jobs for sales reporting, leveraging Python for complex data manipulation and ...
- May 01
Hyderabad, Telangana, India
... (via Azure SQL DB concept), PySpark, Python, SQL, Git Data Engineer @ UPS Atlanta, GA Dec 2021 – Sep 2023 Developed and optimized complex ETL pipelines using Spark (Scala and Python) on a Hadoop ecosystem, enhancing data processing capabilities. ...
- May 01
Danbury, CT
... Ltd Apr 2018 - May 2020 Big Data Engineer Hyderabad, India • Built Hadoop ingestion pipelines using Sqoop to extract data from Oracle/MySQL into HDFS for downstream analytics. • Developed Spark/PySpark transformation jobs applying business rules for ...
- May 01
Plano, TX, 75074
... Technologies Used: Oracle, Informatica PowerCenter, MySQL, SQL, Unix, Shell Scripting, Python, Hadoop (HDFS, Hive), Git, Jenkins
- Apr 30
Chicago, IL
... SageMaker, S3, EC2, EKS) • Programming & Tools: Python, SQL • Data Engineering: ETL/ELT, Batch & Streaming Processing, Data Modeling, Content Migration • Big Data & Processing: PySpark, Spark SQL, Hadoop, Spark • Tools: Jenkins, Git, Jira, Tableau
- Apr 30
San Antonio, TX, 78229
... Environment: Power BI Desktop & Service, Power Query, DAX, Microsoft Fabric, SQL Server, Oracle, MySQL, Google BigQuery, Dremio, SSIS, SSAS, Tableau, R, Python, Excel, Jira, MS Project, Alteryx, Apache Hadoop, Agile. Education: Master’s in ...
- Apr 30
Oakwood Glen, TX, 75025
... Programming Languages: Python, Scala, SQL Cloud & Big Data: AWS (S3, EMR, Athena), Azure (ADLS, Databricks), Apache Spark, Hadoop, Hive Version Control & DevOps: GitHub, Jenkins, Docker, Kubernetes Methodologies: Agile, SDLC, Data Modeling WORK ...
- Apr 30
Hyderabad, Telangana, India
... Composer, BigQuery, Dataproc, Pub/Sub Proven expertise in Big Data ecosystems, delivering scalable and fault-tolerant pipelines using Apache Hadoop, Spark, Kafka, Databricks, and Snowflake for high-volume data processing and streaming use cases. ...
- Apr 30