Raveena Sankasani
Data Analyst
*******@************.***
Experienced Data Analyst with 4 years of expertise in Data and statistical analysis, data mining, and data wrangling, specializing in maintain data pipelines, data warehousing and visualizations.
Optimizing data processing workflows, migrating data writing operations to PySpark, resulting in an excellent reduction in processing times.
Proficient with languages like Python, R, SQL for scripting and experienced with data analysis packages including NumPy, Pandas, SciPy and visualization through GGplot2, Matplotlib.
Hands-on experience with cloud technologies including AWS, Azure, and Snowflake, and utilized ETL tools like Hadoop, SSIS, and Talend for efficient data integration.
Performed data visualization using Tableau and Power BI, complemented by a solid understanding of modeling techniques such as regression analysis and data flow diagrams.
KEY SKILLS
Methodology: SDLC, Agile, Scrum, Kanban, Waterfall Language: SQL, Python, R
Packages: NumPy, Pandas, Matplotlib, Seaborn, SciPy, Scikit-Learn, TensorFlow, ggplot2 Machine Learning: Supervised & Unsupervised Learning, Reinforcement Learning, Time Series Analysis Cloud Technology: AWS, Azure, Snowflake
Databases: MS SQL Server, MySQL, Oracle
EDA: Data Warehousing, Data Modelling, Data Mining, Data Manipulation, Statistical Analysis, Visualization ETL & Other Tools: Hadoop, SSIS, Talend, NiFi, ADLS, Databricks, SAP, SAS, Salesforce, Google Analytics, Tableau, Power BI, MS Project, MS Access
Modeling Techniques: Classification Models, Data Flow Diagrams, ER-Diagrams, Root Cause Analysis, Regression Analysis, Power Query EXPERIENCE
Data Analyst Jan 2024 – Present
State Street NY, USA
Led the migration of data analysis processes, achieving a 40% increase in data processing speed and delivering actionable insights to upper management for informed strategic decision-making in financial operations.
Automated routine data extraction & transformation tasks by Python, reducing 50% manual processing time while increasing productivity.
Leveraged Informatica for comprehensive data validation, eliminating data duplicity and implementing robust data ingestion practices, which mitigated 20% revenue loss per transaction and enhanced revenue by 8% per transaction.
Transitioned over 15 financial products and automated workflows by integrating AWS EMR, eliminating SQL server bottlenecks and enhancing overall system performance for banking applications.
Streamlined the ETL process by merging raw financial data from 5 sources using advanced SQL techniques, resulting in a remarkable 35% improvement in processing time for financial reporting.
Developed SQL triggers for Transaction Tracker, capturing data at 15-min intervals to generate insights on transaction fluctuations & customer payment preferences, leading to accurate transaction predictions and 8% revenue boost.
Designed and developed enterprise-wide operational reports for Sr. management using Power BI and generated weekly, monthly reports by using MS Excel Techniques (Charts, Graphs, Pivot tables).
Adopted Agile to facilitate collaboration across departments, migrating & documenting 10+ deployed SQL processes to PySpark in EMR, and improving workflows, reducing redundancy in data analysis processes. Data Analyst Jan 2020 – Dec 2022
SeaGate India
Leveraged Python libraries such as NumPy and Pandas for data cleansing, pre-processing, and analysis, resulting in a 20% improvement in data quality and processing speed related to Healthcare and Pharmacy.
Facilitated cross-functional collaboration by adopting agile framework, improving team communication and enhancing the reliability of data pipelines, resulting in a significant increase in data accuracy.
Conducted in-depth exploratory data analysis using SQL to derive actionable insights from complex datasets, that led to measurable improvements in marketing effectiveness & customer engagement metrics.
Managed and analyzed healthcare data using MySQL, implementing data integrity checks and validation processes that improved the efficiency of patient record retrieval and overall data reliability.
Optimized Power BI by developing efficient data models and leveraging DAX for advanced calculations, reducing report loading times and improving end-user experience.
Created and maintained interactive dashboards in Power BI, enabling real-time data visualization and driving a 25% increase in data-driven decision-making across departments.
Implemented ETL pipelines using Azure Data Factory and Implemented error handling mechanisms and automated data quality checks to identify and resolve issues during the ETL workflow.
Managed ETL operations for the extraction of big data from 18 social service systems, advancing assessments in mental health, electronic health records (EHR), and community programs, while optimizing data governance to save $12,000 annually.
Delivered strategic insights through data mining and statistical analysis, informing decision-making processes and supporting key business initiatives in healthcare operations.
EDUCATION
Masters in Computer Information Systems Jul 2024 New England College, Henniker, NH, USA