Phone: 551-***-****
Address: *** ********* ***, ****** ****, New Jersey
Email: ******@****.**.***
LinkedIn: linkedin.com/in/sweetysaha
About:
With 7 years of experience, I excel in turning intricate requirements into elegant insights. Proficient in ETL, Python, and cloud technologies like AWS and Google Cloud, I've guided teams to success and worked with clients in the US, UK, and renowned companies like Google for 3 years. I've consistently delivered results for clients like Reckitt, showcasing a commitment to data quality and innovation.
Professional Experience:
Data Analysis and Data Modeling Professional: Specialized in applied information technology as a data analysis and data modeling professional.
ETL Expertise: Proficient in ETL processes, especially with Informatica, encompassing Designer, Workflow Manager, Repository Manager, ETL, and Data Warehouse.
AWS Proficiency: A strong foundation in Amazon Web Services (AWS) concepts, particularly EMR and EC2, for efficient data analytics.
End-to-End Data Analysis: Skills cover data mining, acquisition, preparation, data manipulation, feature engineering, machine learning, validation, and visualization.
Project Success: Developed a Plant Disease Detection system with a remarkable 98% accuracy rate and a Heart Disease Detection system with an 89% accuracy rate.
Created a budget tracking dashboard for Publicis Groupe, enabling the comparison of calculated budgets with actual expenditures.
Led the development of "Vault," an interface for candidate background checks, and achieved a 98% accuracy rate in stock prediction and analysis. Additionally, established multiple databases with customized roles and referential integration to enhance data management.
Database Versatility: Proficient with diverse databases, including Oracle, SQL Server, MySQL, and PostgreSQL.
ETL and Data Warehousing: Experienced in designing, developing, documenting, and testing ETL jobs and mappings, including Data Stage, for data warehousing.
Python ETL Framework: Hands-on experience includes the creation of ETL frameworks with Python.
OLAP and OLTP Environments: Effective work in OLAP and OLTP environments, encompassing data transformations and analysis.
Cloud-Based Data Pipelines: Proficiency in building data ingestion, ETL, and data processing pipelines using cloud services.
Data Quality Governance: Establishment and execution of the Data Quality Governance Framework to ensure data suitability.
Parallel Job Design: Expertise includes designing parallel jobs with various stages.
Leadership and Collaboration: Demonstrated strong leadership and collaborative abilities through effective leadership of a team of 15 dedicated professionals to deliver innovative solutions.
Technical Skills:
Programming: Python, TensorFlow, YOLO5, NumPy, Pandas, LLM, SQL, C++, C, Java
Cloud Technologies: AWS, Google Cloud
Databases: Oracle, SQL Server, MySQL, PostgreSQL, Oracle Querying PL/SQL
Data Warehouses: Snowflake, Amazon Redshift
ETL/Reporting: Informatica, Tableau, PowerBi, Advanced Excel
Education:
Master’s in Data Analytics and Visualization from Yeshiva University, New York [ AUG 2022 - DEC 2023]
Master’s in Computer Application from Guru Gobind Singh Indraprastha University [ AUG 2014 - SEP 2017]
Bachelor’s in Computer Application from Punjab Technical University [ AUG 2010 - SEP 2013]
Certifications:
•AWS Cloud Practitioner
AWS Solution Architecture Associate
•Microsoft PowerBi
•Machine Learning
•PMP Project Management
Professional Experience:
Aug 2022 - Present
Katz School of Science and Health New York
Role: Student Employee (Intern Analyst)
Responsibilities:
•Research and evaluate academic and student data to identify trends and predict end-of-semester appointment totals as well as survey student post-graduation outcomes.
•Developed a dashboard for Analysis using the PowerBi and Tableau Tools. Check discrepancies in the data frame Using SQL command and build a report.
•This dashboard has significantly improved data accessibility and visualization, resulting in more informed decision-making within the organization.
•Through diligent data analysis, I have identified trends and successfully predicted end-of-semester appointment totals. This proactive approach has helped the institution efficiently allocate resources and meet student needs.
•My meticulous use of SQL commands to check data discrepancies has enhanced data quality, reduced the risk of errors, and ensured the reliability of analytical results.
•The created reports have become instrumental in communicating data insights to stakeholders, enabling them to make data-driven decisions. These reports have played a crucial role in shaping the institution's strategies and policies.
Yeshiva University New York. Aug 2022 - Present
Role: Teaching Assistant
Responsibilities:
•Assist in structuring data Management courses, resolving student inquiries, and providing course support.
•Through my efforts, I have enabled students to overcome hurdles and grasp intricate concepts, resulting in enhanced performance and a deeper understanding of structured data management principles.
Shevet Glaubach Center New York.
Role: Internship
Responsibilities:
Led the assessment of data for inconsistencies, providing invaluable insights into data quality. This effort resulted in data cleaning and updating activities, which, in turn, enhanced the integrity and reliability of Tableau by an impressive 90%.
Spearheaded the development of user-friendly visualizations that significantly improved data accessibility for team members. This not only made data more comprehensible but also facilitated data-driven decision-making.
Collaborated with the team to formulate strategies for collecting and reporting meaningful data points. These efforts streamlined data collection processes and empowered team members with the insights they needed for informed decision-making.
Designed and implemented a model for seamlessly integrating FDS (Financial Data System) data with other user data. This integration enabled a comprehensive analysis of students' career strategies and engagement by academic year, leading to valuable insights that informed strategic decisions and improved outcomes.
Client: Deloitte India
Role: Deputy Manager
Responsibilities:
Developed custom process KPIs and dashboards within Celonis, providing real-time insights to stakeholders and enabling data-driven decision-making.
Utilized Celonis to analyze and optimize complex business processes, resulting in a 20% reduction in operational costs and a 15% increase in process efficiency.
Implementation of multi-node clusters on AWS EC2, combined with ETL process optimization, results in enhanced data processing efficiency and quality.
Ensuring data security with user versioning in S3 buckets while possessing in-depth knowledge of Snowflake Database structures assures data integrity and accessibility.
Leveraging Python libraries for machine learning empowers the development of advanced algorithms for data-driven insights and predictions.
Created Data Integration templates and report creation using Tableau and Power BI, leading to accurate data analysis and actionable insights.
Performed DDL operations, database management, and precise data extraction and transformation to facilitate reliable data analysis, decision-making, and business success.
Client: Publicis Sapient India
Role: Media Analyst
Responsibilities:
Orchestrated end-to-end data pipelines, ETL, and ELT processes, resulting in streamlined data ingestion and transformation.
Demonstrated a track record of effective data mining, validation, and predictive modeling on large datasets, driving data-driven decisions.
Crafted REST APIs for enhanced data integration, improving overall system efficiency.
Leveraged PowerBi and SQL to analyze media program performance and campaign data, producing actionable insights that informed successful strategies and outcomes.
Built a dashboard that allowed Publicis Groupe for medical products, and pharmaceutical strategies to monitor and assess the differences between estimated budget amounts and actual spending, offering insightful financial data.
Client: AuthBridge Research Services Pvt India
Role: Senior Associate Analyst
Responsibilities:
Seamlessly integrated data from MySQL and AWS DB through proficient ETL processes, resulting in improved data accessibility and management.
Supervised the creation of "Vault," a candidate background check interface. To improve data management, many databases with unique roles and referential integration were also created.
Generated and validated SQL codes to produce accurate reports, facilitating data-driven decision-making.
Successfully oversaw ETL solutions and implemented process automation, leading to enhanced operational efficiency, and reduced manual workload.
Executed a seamless migration of on-premises databases to a MySQL data warehouse, resulting in improved data centralization and streamlined data management.
Conducted rigorous data validation using SQL Server Integration Services, ensuring data accuracy and reliability, which positively impacted data-driven insights.
Client: GlobalLogic Technologies Ltd (Client-Google) India
Role: Data Analyst
Responsibilities:
Successfully developed an end-to-end Data Warehouse on Google Cloud, optimizing data management and accessibility.
Developed and implemented Data Quality frameworks and automated operational processes, leading to a 30% increase in data accuracy and a 20% reduction in manual efforts.
Conducted thorough data analysis and validation, resulting in data-driven insights that informed strategic decisions.
Efficiently maintained databases and automated reports, reducing reporting times by 40% and improving data availability.
Collaborated with global teams to resolve issues and streamline operations, fostering a culture of cross-functional efficiency and problem-solving.
Sweety Saha
JUN 2023- Aug 2023
JUN 2022- Aug 2022
May 2021- Jun 2022
Jun 2019 - Apr 2021
Sep 2016 - May 2019