Post Job Free
Sign in

Data Engineer

Location:
Boston, MA
Posted:
February 09, 2026

Contact this candidate

Resume:

SAI GEETA ACHARYA

Boston, MA 617-***-**** ******************@*****.*** LinkedIn

EDUCATION

Northeastern University, Boston, MA Aug 2023 - Aug 2025 Master of Science in Computer So2ware Engineering

• Relevant Coursework: Database management and Design, Agents of AI, Python Programming, Agile TECHNICAL SKILLS

• Programming & ScripDng: SQL, PL/SQL, Python, Shell ScripUng, Java

• ETL & Data Tools: Snowflake, InformaUca PowerCenter, IICS, Oracle, Qdrant

• Scheduling & DevOps: Control-M, CA7, GitHub, ServiceNow, JIRA, Confluence, Agile WORK EXPERIENCES

Street Care (Volunteer), Boston, MA Data Engineer Oct 2025 – Present

• CollaboraUng with program coordinators to understand data needs for community outreach, donaUons, and service tracking, translaUng requirements into analyzable data structures.

• Engineered a centralized data repository for community outreach, automaUng manual spreadsheet workflows into a structured SQL environment.

• DocumenUng data definiUons, assumpUons, and query logic to improve transparency, data accuracy, and future. Dana-Farber Cancer InsDtute, Boston, MA Data Engineer May 2024 - Dec 2024 Technologies: PL/SQL, SQL, Snowflake, IICS, Python, Git, Control-M

• Developed ETL pipelines in IICS enabling seamless data migraUon between Oracle and Snowflake for clinical data.

• Architected fact and dimension tables in Snowflake to support modernizaUon of legacy systems.

• OpUmized Snowflake SQL queries and warehouse usage to improve query performance

• Implemented CDGC (Cloud Data Governance & Catalog) for metadata, lineage and quality scoring.

• Created Python automaUon scripts for ingesUon validaUons, audits, and ETL triggers.

• Implemented Git version control and automated validaUon rules to improve data quality and anomaly detecUon for ETL assets. Wipro Technologies, Pune, India Developer Dec. 2020 - Oct. 2022 Technologies: PL/SQL, InformaUca PowerCenter, Unix Shell Scripts, Control-M, CA7, OFSAA

• Delivered 20+ SDLC releases for systems for a major US bank client, ensuring high-quality data ingesUon.

• OpUmized ETL performance by 30% using Shell scripUng and PL/SQL enhancements, while reducing processing Ume by 15% through advanced package and cursor tuning.

• Customized fraud detecUon scenarios, uUlizing PL/SQL to reduce false posiUves by 30%.

• Owned the root-cause analysis for recurring data issues, implemenUng permanent PL/SQL fixes to prevent future downUme.

• Performed comprehensive dataset reconciliaUon and validaUon across mulUple source systems to ensure data reliability for regulatory pipelines.

• Conducted InformaUca KT sessions to accelerate team workflow development and improve overall ETL efficiency. ACADEMIC PROJECTS Northeastern University, Boston, MA Smart Document Q&A System Java, Akka Cluster, Qdrant, OpenAI May 2025 – Aug 2025

• Designed and implemented a distributed document Q&A plakorm using Akka Cluster with mulU-node architecture.

• Integrated OpenAI GPT-3.5-turbo for semanUc understanding, delivering context-aware answers from uploaded PDFs with

~85% accuracy.

• Developed error-resilient data pipelines with robust excepUon handling for database failures. Dine Ease PL/SQL Jan. 2024 - Apr 2024

• Led development of a Restaurant Management System, order processing, inventory tracking, and table management to enable data-driven decision-making.

• Designed an Oracle PL/SQL database with tailored business rules and access roles to ensure customer data confidenUality.

• Created database views, complex SQL queries, and stored procedures to opUmize resource allocaUon and management.



Contact this candidate