Post Job Free
Sign in

Data Engineer Analyst

Location:
Mission Hill, MA, 02120
Salary:
80000
Posted:
September 10, 2025

Contact this candidate

Resume:

Sarvesh Santosh Sawant

857-***-**** *******.***********@*****.*** LinkedIn

Summary

Data Engineer and Analyst with over 5+ years of experience designing scalable ETL pipelines, optimizing data workflows, and building end-to-end analytics solutions across cloud platforms (AWS, Azure, GCP). Proven ability to integrate and automate data pipelines using tools like SQL, Python, Alteryx, Airflow, and Power BI. Adept at cross-functional collaboration, data governance, and transforming complex datasets into actionable insights that support strategic decision-making. Passionate about solving real-world problems through data engineering, predictive analytics, and cloud-native BI solutions. PROFESSIONAL EXPERIENCE

Data Analyst Feb 2025 – Present

KGS Technology Group Inc, Alpharetta, USA

• Advanced SQL Analysis & Time Series Forecasting: Leveraged advanced SQL functions, including WINDOW and AGGREGATE, to perform complex insurance data calculations and time series forecasting. Integrated Python libraries such as Pandas for data preprocessing and Matplotlib for visualizing actionable insights, enhancing data-driven decision-making processes.

• Data Integration & Automation in Azure Environments: Engineered solutions to integrate structured and unstructured datasets across SQL Server and Azure environments. Utilized Azure PowerShell scripts to automate data workflows and enforce governance policies, ensuring data consistency and compliance with organizational standards.

• Power BI Dashboard Development & Stakeholder Collaboration: Collaborated with cross-functional stakeholders to define data reliability metrics and business requirements. Designed and deployed over five interactive Power BI dashboards utilizing DAX for complex calculations, providing real-time insights into key business metrics. This initiative led to a 20% improvement in the efficiency and effectiveness of analytical solutions.

Data Engineer Mar 2024 – Jan 2025

Salt-Tech Inc, DELAWARE

• ETL Workflow Development & Integration: Engineered and maintained robust ETL pipelines utilizing Alteryx and SQL Server, facilitating seamless data extraction, transformation, and loading processes. Integrated Apache Airflow to orchestrate and synchronize data workflows between IoT databases and AWS Redshift, ensuring efficient data movement and processing across cloud environments.

• Pipeline Optimization & Performance Enhancement: Enhanced Directed Acyclic Graphs (DAGs) within Apache Airflow by optimizing task parallelism and leveraging dynamic task mapping. Implemented best practices such as modular DAG design and efficient resource allocation, resulting in a 30% reduction in latency for high-volume financial data workflows.

• Agile Project Management & Version Control: Led Agile sprint planning and backlog grooming sessions, collaborating cross-functionally with analysts and engineers to define project scopes and timelines. Utilized GitHub for version control, ensuring seamless collaboration and maintaining code integrity throughout the development lifecycle.

• Cross-Functional Collaboration & Data Quality Assurance: Collaborated closely with data analysts and engineers to enhance data quality, documentation, and reliability for analytics use cases. Implemented data validation frameworks and standardized data formats, ensuring consistent and accurate data for downstream analytics. Data Analyst Intern Jan 2024 – Mar 2024

Humana, Louisville, KY

• Engineered and maintained cloud-based ETL pipelines, integrating diverse healthcare datasets (claims, EHR, pharmacy) to support analytics and reporting needs across multiple business units.

• Developed and automated SQL-based reporting solutions, reducing manual reporting time by 30% and enhancing data accessibility for stakeholders.

• Collaborated with cross-functional teams to design and implement data governance policies, improving data quality and compliance with healthcare regulations.

• Created and maintained interactive dashboards using Power BI, providing real-time insights into key performance indicators and supporting data-driven decision-making.

• Conducted in-depth analyses of healthcare utilization and cost trends, identifying opportunities for cost savings and operational efficiencies.

• Provided training and support to business users on data tools and best practices, fostering a data-driven culture within the organization.

Data Engineer Mar2021 – Dec 2021

Silgate Solutions Ltd, India

• Developed scalable ETL workflows to process and transform large volumes of client data (5– 10 TB/week), leveraging Python, SQL, and batch processing frameworks.

• Collaborated with cross-functional teams—including BPO, KPO, and IT units—to integrate data pipelines from diverse sources like call-center logs and digital dashboards.

• Implemented data quality checks and cleansing routines to enhance data reliability for downstream analytics, reducing errors by ~30%.

• Optimized database schemas and indexing strategies in PostgreSQL/MySQL, improving query performance by ~40%.

• Automated data ingestion processes using Airflow (or your tool of choice), ensuring reliable pipeline execution with retry and logging mechanisms.

• Deployed monitoring and alerting solutions (e.g., with Grafana/Prometheus) to track pipeline health and proactively address failures.

Data Engineer Jan 2020 – Mar 2021

Larsen and Toubro Infotech, Airoli, India

• Improved data accuracy by 80% through advanced SQL queries using JOINS, resolving 500+ IoT discrepancies in transaction data ensuring data integrity and consistency

• Developed robust ETL pipelines using Alteryx and integrated them with SQL Server and AWS Redshift, reducing processing time by 25% and designed logical and physical data models to support decision making

• Used advanced T-SQL and PL/SQL for resolving transactional discrepancies in IoT and financial datasets, enhancing data accuracy by 80%

• Integrated data from Salesforce CRM into reporting pipelines to track sales performance trends across business units and designed star and snowflake schemas for data warehouse, enabling financial insights

• Built 10+ Tableau dashboards with MDX and complex joins to track KPIs and revenue trends across operations TECHNICAL SKILLS

Languages: SQL, Python, R, Java, C, C++

Databases: MySQL, Oracle SQL, SQL Server, Azure Data Studio, PostgreSQL, NoSQL ETL & BI: Alteryx, Talend, ER/Studio, Data Profiling, Tableau, Power BI, Airflow, Informatica, Qlik Replicate Utilities: Jupiter Notebook, Excel, MS Office, PowerPoint, Access, Word Project Management Tools: JIRA, Confluence, Teams, Smartsheet Cloud Platforms: AWS (EC2, DynamoDB, S3, Redshift, Glue), GCP, Azure Databricks, Snowflake Cloud Data Services: AWS Lambda, AWS Glue, Azure Data Factory, Google BigQuery, Cloud Storage, Serverless Computing Visualization & Reporting: DAX (Power BI), MDX (OLAP), Custom Visuals Development, Dashboard Optimization Collaboration & Agile Tools: Slack, Zoom, MS Teams, Agile/Scrum Methodologies, Jira Workflow Management Others: REST API Integration, JSON, XML, Web Services, Metadata Management, Automated Testing & Validation Frameworks

PROJECTS

Oil and Gas Pricing & Commodity Contract Prediction (JPMorgan Chase) Jul 2024 – Aug 2024

• Improved price forecast accuracy by 15% using predictive models like Linear Regression and Random Forest

• Created 10+ interactive Power BI dashboards analyzing market trends to drive commodity contract decisions

• Conducted EDA for handling missing values and dimensionality reduction, enhancing data analytics clarity NYC Motor Collision Collisions Data Warehousing & BI Sep 2022 – Dec 2022

• Designed star and snowflake schemas for dimensional modeling and addressed data quality gaps

• Built 10+ workflows using Alteryx and Talend for data profiling transferring data from GCP to EDW

• Created 10 interactive dashboards with Tableau and Power BI facilitating Variance Analysis and reporting EDUCATION

Master of Science, Information Systems

Northeastern University, Boston, MA Jan 2022 – Dec 2023 Coursework: Designing Advanced Data Architecture for Business Intelligence, Database Management Systems Bachelor of Engineering, Computer Engineering

St. Francis Institute of Technology, Mumbai, India Jul 2015 – May 2019 Coursework: Data Mining, Distributed Databases, Data Science Engineering Methods and Tools



Contact this candidate