Plano, TX, 75024
... Technologies Used: AWS (Redshift, S3), Linux, Shell Scripting, Python, Apache Airflow, SQL, Oracle, Hadoop, Spark, ETL, Data Warehousing, Agile, Git, Unix TECHNICAL SKILLS: Programming & Scripting: Python, Shell Scripting, SQL, Perl, Java Data ...
- Apr 30
Plano, TX
... dbt Cloud Platforms: AWS (S3, Glue, Lambda), Azure (Databricks, Data Lake), GCP (BigQuery, Dataflow) Big Data Technologies: Hadoop, Hive, Kafka, Flink, PySpark DevOps & Version Control: Git, Jenkins, Docker, Kubernetes, Terraform Reporting & ...
- Apr 30
Fort Worth, TX
... Azure (Data Lake, Data Factory, ADLS), SSIS, Alteryx, Apache Airflow, Snowflake • Big Data & Streaming: Apache Spark, Hadoop, Kafka, Flink, Hive, Databricks • ETL & Orchestration: Apache Airflow, NiFi, SSIS, Informatica, Docker, Kubernetes • ...
- Apr 30
McDermott Place, TX, 75025
... SQL Server, DynamoDB, Hive, Snowflake Data Warehousing & ETL: Informatica PowerCenter, Azure Data Factory, AWS Glue, Hadoop, Spark (PySpark), Databricks, Delta Lake Orchestration & Automation: Apache Airflow, Azure Data Factory Triggers, AWS ...
- Apr 30
Plano, TX
... Developed extensive shell scripts for automating data ingestion, processing, and transformation tasks within Hadoop and Spark ecosystems. Designed and implemented robust ETL pipelines using Spark (Scala) for processing complex insurance data, ...
- Apr 30
Plano, TX, 75025
... Worked with Apache Hadoop and Hive for processing and analyzing historical healthcare data within a distributed Linux environment. Contributed to initial assessments for migrating on-premise data infrastructure to cloud platforms, planning future ...
- Apr 30
Plano, TX, 75025
... Implemented robust batch processing frameworks using Hadoop and Hive on Linux clusters for large-scale financial data volumes. Designed and implemented Kafka-based ingestion pipelines for real-time market data, ensuring low-latency data availability ...
- Apr 30
India
... Implemented batch processing workflows on Hadoop, utilizing Linux commands and shell scripts for efficient large-scale data analysis and file system management. Injected CSV and JSON files from various upstream systems into HDFS, leveraging Unix ...
- Apr 30
Santa Clara, CA
... AWS (S3, EC2, Lambda, Redshift), Azure (Data Factory, Synapse), GCP (BigQuery – Basic) Big Data & Processing: Apache Spark, Hadoop, Hive, Distributed Data Processing Visualization & BI Tools: Power BI, Tableau, Excel, KPI Dashboards, Interactive ...
- Apr 30
Hyderabad, Telangana, India
... EDUCATION: Master of Science in Computer Science @ Texas Tech University TECHNICAL SKILLS: Programming Languages: Python, Shell Scripting, SQL, Perl Data Warehousing: Oracle Exadata, Azure Synapse Analytics, BigQuery, Hadoop, Hive ETL & ...
- Apr 30