Post Job Free
Sign in

Data Engineer Information Technology

Location:
Worcester, MA
Posted:
February 25, 2025

Contact this candidate

Resume:

SRI RANJANI SALLA

+1-857-***-**** *****************@*****.*** Linkedin Github

Education

Clark University January 2024 - May 2025

Master of Science in Information Technology Massachusetts, United States Key Courses: Cyber Security,Networking Fundamentals,Data Management For Info Tech,Python For Data Analyatics & Data Minnning Malla Reddy Engineering College For Women July 2018 - May 2022 Bachelor of Engineering in Computer Science Hyderabad, India Professional Experience

Accenture (Insurance - ETL Project) August 2022 - December 2023 Data Engineer Analyst Hyderabad, India

• Developed scalable ETL workflows using Snowflake and Ab Initio to automate data extraction, transformation, and loading from multiple sources, including MySQL.

• Migrated an on-premises Teradata warehouse to the Snowflake cloud platform.

• Reduced manual data processing time by 60% by automating ETL processes and updating ETL logs with Linux.

• Designed ETL pipelines with Ab Initio and DataBricks to enhance data governance and compliance.

• Employed AutoSys to schedule ETL workflows and handle large datasets from multiple sources.

• Created Power BI dashboards to monitor key performance metrics, cutting reporting time by 60%.

• Integrated AWS S3 with Snowflake to manage and ingest raw insurance data, including claims, customer profiles, and policies.

Cognizant February 2022 – September 2022

Big Data Engineer Hyderabad, India

• Designed and implemented large-scale data pipelines using PySpark and Scala, streamlining critical insurance operations.

• Executed workflows with PySpark and Databricks, improving system performance and ensuring accurate data transformation.

• Applied validation, cleaning, and transformation techniques to uphold data quality and integrity.

• Developed Power BI/Tableau dashboards, reducing reporting time by 40% and aiding data-driven decision-making.

• Managed cloud environments using Putty, executed PySpark scripts, and integrated with Snowflake for efficient data processing and secure storage.

Relevant Projects

Lightweight Privacy-preserving Medical Diagnosis in Edge Computing August 2023 - March 2024

• This project concept to prevalent for mobile users to submit symptoms at any time and get diagnosis results.

• With the advances of extensive storage space and unlimited computing capacity in cloud computing, machine learning over outsourced medical data has been extensively studied with the adoption.

• Using AI/ML to train an accurate diagnosis model, it is necessary to share the training data distributed among various medical institutions.

Analyzing Real Estate Trends December 2024 – January 2025

• Cleaned and processed a dataset of 1,459 entries with 46 features, resolving missing values and scaling numerical data to ensure seamless integration into machine learning workflows.

• Conducted exploratory analysis using correlation heatmaps, pair plots, and distribution visualizations, identifying key predictors and improving data-driven decision-making.

• Automated encoding and scaling pipelines for 100% of features, reducing manual errors and improving data preparation efficiency for predictive modeling.

Technical Skills

Languages: Python, R, SQL,T-SQL, Scala, Java, HTML/CSS Databases: Snowflake, MySQL, Power Queries

Libraries/Frameworks: ETL, PySpark, Hadoop, Spark, Databricks,Power BI,Tableau, Ms Access, MS Excel Tools/Applications: Snowflake, Azure, Putty,Git, AutoSys, AWS S3 Certifications

• Snowflake DataWarehouse Badge

• SnowflakeData Base

• DataBricks



Contact this candidate