Post Job Free

Resume

Sign in

Data Engineer

Location:
VasanthaNagar, Karnataka, 560001, India
Salary:
850000
Posted:
January 09, 2024

Contact this candidate

Resume:

BASUDEV CHHOTARAY

DATA ENGINEER

CONTACT INFORMATION

Mobile: +91-824*******, 904*******

Email: ad2lsn@r.postjobfree.com

Address: Veerannapalya, Bengaluru, Karnataka, India 560045 LinkedIn: hƩps://www.linkedin.com/in/basudevchhotaraya1757290/ SUMMARY

Results oriented Data Engineer with three years of hands-on experience designing, implemenƟng, and opƟmizing end-to- end data soluƟons. Proficient in Python, Apache Spark, AWS, Snowflake, Apache Airflow, Ansible, Jenkins, Terraform, Docker, and SQL. Proven experƟse in developing and maintaining scalable data pipelines, ensuring seamless data flow, and integraƟng DevOps pracƟces for enhanced efficiency. EDUCATION

Master of Computer ApplicaƟon, Utkal University, Bhubaneswar, Odisha (August 2019)

Bachelor of Computer ApplicaƟon, Berhampur University, Odisha (August 2016) KEY SKILLS

Programming Languages:

Python (ETL, scripƟng, automaƟon)

Big Data Technologies:

Apache Spark (largescale data processing), Databricks Cloud Plaƞorms (AWS):

Amazon S3, EMR, Glue, RedshiŌ, Lambda (scalable data soluƟons) Data Warehousing:

Snowflake (opƟmized schema structures, improved query performance) Workflow OrchestraƟon:

Apache Airflow (complex data workflows, ETL automaƟon) Database Technologies:

SQL (database management, data querying)

NoSQL (handling unstructured or semi structured data) Version Control:

Git (version control, collaboraƟon)

DevOps Technologies:

Ansible (automaƟon, system reliability)

Jenkins (conƟnuous integraƟon, deployment streamlining) Infrastructure as Code (IaC):

Terraform (efficient infrastructure management)

ContainerizaƟon:

Docker (containerizaƟon for deployment)

PROFESSIONAL EXPERIENCE

Blackbuck Insights Pvt. Ltd. (18/01/2021 – Present) Projects:

1. TransUnion 01/11/2022 - Present

Project: Credit InformaƟon Processing and AnalyƟcs Plaƞorm Overview:

Spearheading the development of a comprehensive credit informaƟon processing and analyƟcs plaƞorm, emphasizing Apache Spark, Python, and DevOps pracƟces.

Achievements:

Designed and opƟmized ETL processes for large credit datasets, leveraging Apache Spark and Python to ensure data accuracy and maintain high performance.

Implemented DevOps strategies, uƟlizing Ansible and Jenkins for automaƟon, and terraform for infrastructure management, enhancing the overall agility of the data infrastructure.

Collaborated closely with data scienƟsts and analysts, uƟlizing Apache Airflow for workflow automaƟon, to derive meaningful insights from raw credit data, contribuƟng to informed decision making and risk assessment.

ConƟnuously evaluaƟng and implemenƟng new technologies, including Docker for containerizaƟon, to improve data processing efficiency and maintain a compeƟƟve edge in the credit informaƟon industry.

Leading efforts to enhance data security measures and ensure compliance with industry standards, parƟcularly in the sensiƟve field of credit informaƟon.

UƟlizing AWS services for scalable and cost-effecƟve data soluƟons. 2. CIOX Health 10/08/2021 - 16/09/2022

Project: Integrated Health Data Processing Plaƞorm Overview:

Led the development of an integrated health data processing plaƞorm, leveraging Apache Spark, Python, and Apache Airflow for efficient data orchestraƟon.

Achievements:

Engineered robust end-to-end ETL processes uƟlizing Apache Spark, ensuring seamless extracƟon, transformaƟon, and loading of diverse health data sources.

Implemented advanced data quality checks, significantly enhancing the integrity and accuracy of processed health data.

Successfully integrated Apache Airflow for complex workflow orchestraƟon, automaƟng ETL processes and ensuring Ɵmely and accurate data processing.

UƟlized DevOps pracƟces, including Ansible and Jenkins, for deployment automaƟon, resulƟng in a 25% reducƟon in deployment Ɵme.

Played a key role in the integraƟon of new health data sources, expanding the organizaƟon's data capabiliƟes and supporƟng data driven decision making in the healthcare domain.

Employed Snowflake as the primary data warehouse, opƟmizing schema structures for improved query performance.

CERTIFICATIONS

AWS CerƟfied Developer Associate:

Verified experƟse in AWS cloud technologies and hands-on experience in developing applicaƟons on AWS. Astronomer CerƟficaƟon for Apache Airflow Fundamentals:

Demonstrated proficiency in Apache Airflow for workflow automaƟon and management. PERSONAL STRENGTHS

Diligent and Honest:

Strong work ethic, dedicaƟon, and commitment to delivering high-quality results. Adaptability:

Quick adaptaƟon to new environments and challenges, ensuring seamless integraƟon into diverse teams. Friendly Team Player:

Excellent interpersonal skills, collaboraƟng effecƟvely with cross funcƟonal teams. Disciplined Approach:

Follow disciplined methodologies for systemaƟc problem solving and efficient task management.



Contact this candidate