SAI GEETA ACHARYA
Boston, MA 617-***-**** ******************@*****.*** LinkedIn
EDUCATION
Northeastern University, Boston, MA Aug 2023 - Aug 2025 Master of Science in Computer So2ware Engineering
• Relevant Coursework: Database management and Design, Agents of AI, Python Programming, Agile TECHNICAL SKILLS
• Programming & ScripDng: SQL, PL/SQL, Python, Shell ScripUng, Java
• ETL & Data Tools: Snowflake, InformaUca PowerCenter, IICS, Oracle, Qdrant
• Scheduling & DevOps: Control-M, CA7, GitHub, ServiceNow, JIRA, Confluence, Agile WORK EXPERIENCES
Street Care (Volunteer), Boston, MA Data Engineer Oct 2025 – Present
• CollaboraUng with program coordinators to understand data needs for community outreach, donaUons, and service tracking, translaUng requirements into analyzable data structures.
• Engineered a centralized data repository for community outreach, automaUng manual spreadsheet workflows into a structured SQL environment.
• DocumenUng data definiUons, assumpUons, and query logic to improve transparency, data accuracy, and future. Dana-Farber Cancer InsDtute, Boston, MA Data Engineer May 2024 - Dec 2024 Technologies: PL/SQL, SQL, Snowflake, IICS, Python, Git, Control-M
• Developed ETL pipelines in IICS enabling seamless data migraUon between Oracle and Snowflake for clinical data.
• Architected fact and dimension tables in Snowflake to support modernizaUon of legacy systems.
• OpUmized Snowflake SQL queries and warehouse usage to improve query performance
• Implemented CDGC (Cloud Data Governance & Catalog) for metadata, lineage and quality scoring.
• Created Python automaUon scripts for ingesUon validaUons, audits, and ETL triggers.
• Implemented Git version control and automated validaUon rules to improve data quality and anomaly detecUon for ETL assets. Wipro Technologies, Pune, India Developer Dec. 2020 - Oct. 2022 Technologies: PL/SQL, InformaUca PowerCenter, Unix Shell Scripts, Control-M, CA7, OFSAA
• Delivered 20+ SDLC releases for systems for a major US bank client, ensuring high-quality data ingesUon.
• OpUmized ETL performance by 30% using Shell scripUng and PL/SQL enhancements, while reducing processing Ume by 15% through advanced package and cursor tuning.
• Customized fraud detecUon scenarios, uUlizing PL/SQL to reduce false posiUves by 30%.
• Owned the root-cause analysis for recurring data issues, implemenUng permanent PL/SQL fixes to prevent future downUme.
• Performed comprehensive dataset reconciliaUon and validaUon across mulUple source systems to ensure data reliability for regulatory pipelines.
• Conducted InformaUca KT sessions to accelerate team workflow development and improve overall ETL efficiency. ACADEMIC PROJECTS Northeastern University, Boston, MA Smart Document Q&A System Java, Akka Cluster, Qdrant, OpenAI May 2025 – Aug 2025
• Designed and implemented a distributed document Q&A plakorm using Akka Cluster with mulU-node architecture.
• Integrated OpenAI GPT-3.5-turbo for semanUc understanding, delivering context-aware answers from uploaded PDFs with
~85% accuracy.
• Developed error-resilient data pipelines with robust excepUon handling for database failures. Dine Ease PL/SQL Jan. 2024 - Apr 2024
• Led development of a Restaurant Management System, order processing, inventory tracking, and table management to enable data-driven decision-making.
• Designed an Oracle PL/SQL database with tailored business rules and access roles to ensure customer data confidenUality.
• Created database views, complex SQL queries, and stored procedures to opUmize resource allocaUon and management.