Data Engineer Computer Science

Location:

Cleveland, OH, 44114

Posted:

February 09, 2025

Contact this candidate

Resume:

DIVYA KOLA

Data Engineer

Ohio, United States

Phone: 216-***-****

Email: ***********@*****.***

LinkedIn: www.linkedin.com/in/koladivya

EDUCATION

Campbellsville University • Kentucky, USA January 2023 – August 2024

Masters in computer science GPA: 4.0/4.0

Relevant Courses: Operating Systems, Database management System, Advanced Database Management,

Software Engineering, Cyber Security, Emerging Technologies, Cyber Law, Artificial Intelligence

Jawaharlal Nehru Technological University • Hyderabad, India June 2017– April 2020

Bachelor of Technology in Computer Science and Engineering GPA: 7.32/10

Relevant Courses: Object Oriented Programming (Java): Data Structures and Algorithms; Data Warehousing, Computer Networks, Web Technology, Software Development.

SKILL

Cloud Platforms : Amazon Web services (AWS)- EC2, S3, Lambda, Redshift, DynamoDB, AWS Glue

Programming Languages : Python (Pandas, NumPy), SQL

Data Transformation : Skilled in data cleansing, integration, aggregation, lineage, batch &real time ETL

processing

Data Warehousing and Databases: proficient in Redshift and DynamoDB for efficient data storage and analytics

ETL Tools : Expertise in AWS Glue for building scalable ETL workflows.

Data Analysis Reporting : Experienced in data profiling for insights and decision-making

Containerization and Orchestration: Hands-on experience with Docker for application deployment and orchestration

SQL : Advanced skills in writing optimized queries, stored procedures, data warehousing

solutions

WORK EXPERIENCE

Data Engineer January 2024 to December 2024

Client: Edward Jones, St.Louis

Designed and maintained scalable ETL pipelines to extract, transform, and load data from various sources, including flat files, databases, and APIs.

Developed ETL processes for staging data from multiple sources such as CSV, XML, and XLSX, ensuring efficient data integration.

Built and managed data lakes and warehouses on AWS S3 and Redshift to support advanced analytics and reporting needs.

Created ETL pipelines using AWS Glue to process S3 Parquet files in data lake environments.

Ability to perform SQL queries and light scripting to provide basic data analysis.

Secures information by completing database backups.

Monitor and maintain the performance and availability of data pipelines. Troubleshoot issues in the data pipelines and resolve problems with data ingestion, processing, and storage.

Maintains operations by following policies and procedures; reporting needed changes.

Developed custom data transformation and ETL workflows using Python-based AWS Lambda functions.

Utilized Python libraries such as Pandas and PySpark for data manipulation, cleaning, and analysis.

Optimized database systems with indexing, aggregation, and materialized views to enhance query performance.

Collaborated with data analysts and business users to gather requirements, deliver data solutions, and provide technical guidance.

Strong problem-solving skills and the ability to work independently and as part of a team.

Develop, test, and deploy ETL pipelines to move and transform data across multiple data sources.

Strong Knowledge in Python and object-oriented programming.

Experience working with AWS cloud services including S3, AWS Batch, ECR, Lambda, SNS, SQS, and other managed cloud platforms.

Updated data mapping documents to reflect actual development and ensure alignment with business goals.

Provide technical assistance to teammates as needed through collaboration and by sharing your expertise

AWS Data Engineer May 2020 – July 2022

Optus Infotech Pvt.Ltd, India

Extensive hands-on experience with AWS services, including EC2, S3, IAM, Lambda, Glue, SNS, Aurora, and DynamoDB.

Designed and implemented data pipelines using AWS Glue and Lambda to process and transform large-scale datasets.

Managed data lake architectures on S3 for efficient storage and retrieval of structured and unstructured data.

Developed database objects, including tables, views, and stored procedures, to support application-specific requirements.

Optimized query performance and managed databases using Amazon RDS and DynamoDB.

Integrated APIs and third-party data sources into AWS environments in collaboration with senior engineers.

Gathered and documented technical requirements, creating functional and technical design specifications.

Identify and pursue database enrichment and cleansing projects.

Create and present data-driven reports to executive leadership, supporting strategic decision-making

Initiative and passion to work independently.

Flexibility, strong decision-making skills, and effective time management skills.

Update documentation, reports, and maintain training materials associated with the Data Warehouse as well as Business Intelligence.

Gather requirements from clients, partners, and all internal departments via understanding their needs, challenges, and goals.

Designed and implemented ETL pipelines (SSIS, SQL Server, Snowflake) to enhance data infrastructure, supporting business insights and improving organizational strategy.

Built dimensional data models (Kimball, DVM) and optimized data workflows, improving business insights and enabling analytics for mental healthcare applications.

Led end-to-end data engineering projects, mentoring junior developers and ensuring adherence to best practices in ETL, data modeling, and organizational strategy.

Developed Power BI and Cognos dashboards, integrating data infrastructure with business intelligence solutions for mental healthcare analytics and strategic decision-making.

Collaborate with software developers and data scientists to deliver cross-departmental solutions.

Develop and support analysis of marketing campaigns.

Provide ad hoc analytical support to managers and departments as needed.

Engineering Trainee –Internship Jan 2020 – May 2020

Designed and developed an Emotion-Based Music Player leveraging Python, OpenCV, and TensorFlow for facial expression recognition.

Implemented a real-time emotion detection module to recommend mood-specific playlists using a curated music library.

Built a recommendation engine supported by a relational database (MySQL) for categorizing music by mood and genre.

Designed an intuitive user interface to enhance user experience, integrating features like play, pause, and dynamic playlist adaptation.

Optimized facial expression detection and music selection processes, reducing latency by 30%.

ACADEMIC PROJECTS & CERTIFICATIONS

Certificate on SQL - Completed Course by Demonstrating Theoretical and Practical Understanding of SQL

Certificate on Python

Contact this candidate