Srijani Roy Choudhury
Santa Clara, CA ***** 469-***-**** *******.************@*****.*** linkedin.com/in/srijani-roy-choudhury-4a5832114
Data Analyst
Enthusiastic and data-driven professional having 6+ years of overall professional experience & 3+yrs of data engineering, analysis and hands-on experience in data mining, modeling, and interpretation. Having specialization in Advanced SQL, Python &Tableau. Expertise involves conducting research, mining data, cleaning and verifying the integrity of data, creating data visualization graphs, developing automated data anomaly detectors, building APIs & algorithms. Superior communication, innovative presentation, and analytical skills.
Areas of Expertise
Data Engineering Pipelines
Program Management
Data Architecture frameworks
Statistical Analysis
Problem Solving skills
Attention to Details
Technical Documentation
Innovative Data Visualization
Efficient Team Player
SKILLS
Domain Knowledge: Data Engineering pipelines, Data Analysis, Data Architecture, Databases, Numerical/Statistical Analysis, System Design, Machine Learning, Data Structure, AWS Cloud, TensorFlow, Spark Product management, Pattern recognition,
Technology & Tools: Python 3.0, SQL, NoSQL (MongoDB), Anaconda 1.9 (Jupyter), PyCharm, Neo4j, 3.5, JMP 15, MS Excel, Google Sheets, R, HIPAA, CCPA, Snowflake
Visualization & Prototyping Skills: Tableau, Canva, ProtoIO XLOOKUP, Data Engineering, Agile, Ris
EXPERIENCE HIGHLIGHTS
Ascend Technology Inc, Sunnyvale, CA June 2020 -present
Data Analyst
Project: Covid – Information Services & Prevention
The project entails collation and dissemination of public data via Pandemic – Information
Write queries to understand various statistical & technical data quality metrics like Mean, Median, Variance, Count of Nulls by Python NumPy, Pandas, SciPy, Matplotlib, Scikit-learn
Write SQL queries to subset and anonymize data as needed
Created Tableau dashboard to present real-time insights into Covid contact spread and trace
Ongoing optimization for better performance.
University Of San Francisco Nov 2019 – March 2020
Data Analyst, Association of Information Systems
Updated Databases daily, based on requirement to store data.
Wrote complex SQL queries for gathering information about a student like name, email, first name, last name, student, their course registration, primary address etc.
Cleaned, explored & prepared data by deploying various python statistical libraries like NumPy, Pandas, SciPy, Matplotlib, Scikit-learn to make it fit for training and analysis.
Created interactive dashboards, histograms, pivots for understanding the trends of admission rate to the school via Tableau, Spreadsheet for advanced analysis.
Thermo Fisher Scientific, Fremont, CA June 2018 – June 2019
Data Quality Analyst, Labelling
Gained a 360 view of Packaging, Labeling and the overall clinical/non-clinical data architecture in compliance with Quality by collaborating with various teams from R&D to Manufacturing, which impacted final labeling changes on the workorders via data flow diagram (DFD)
Used SQL extensively to access information like BOM, Kit number, product ID, orderID, Hazard code, hazard ID, SDS number, address associated with a particular Kit label/Bulk label for processing change orders in the Agile PLM and MasterControl (Electronic Document Management Systems)
Writing, analyzing, and redlining design control documents, Safety Data Sheet, SOPs, validation documents, manufacturing, and testing documents as per GHS to update labels with the correct hazardous chemicals, hazard codes, hazard sign, symbols & precautionary statements in compliance with GDPR/CCPA, FDA’s Quality System Regulation (QSR), HIPAA
Presenting meaning full insights pertaining to labeling changes via Tableau.
Apple Inc, Cupertino, CA Jan 2018 – March 2018
SIRI Grading Analyst, SIRI
Gained insights in Machine Learning/AI concepts by analyzing, parsing & categorizing SIRI responses with NLP.
Analyzed and Parsed SIRI’s responses by breaking a sentence into subject and predicate, categorizing the verbs, adverbs, and other parts of speech with NSLinguistic tagger.
Translated and ranked SIRI audio response for enhancing SIRI’s learning curve
Created Tableau interactive dashboards for presenting data to the management for better visualization.
Dell R & D, Bengaluru, India Jan2015 – Dec 2015
Information Technology Analyst
Facilitated the development of roadmaps for several Dell products from start to completion by defining scope, benchmarking-market analysis
Elicited stakeholders’ requirements by collaborating with cross functional teams and gathering information by MySQL pertaining to a product line
Adopted different modeling tenets by UML (activity diagram, use case diagram, domain class diagram) to describe the system’s functional and nonfunctional requirements
Storyboarding for various Dell products during each of the design phases addressing stakeholders ‘concern for better visualization.
Technamic Solution Group, Kolkata, India Aug 2010 – Nov 2014
Network Engineer
Gained insights in network analysis, feasibility and post sales operations by Layout planning and Designing of the Network.
Running Root Cause Analysis (RCA) for the network implementation for Unified Ticketing Network of Indian (E.Railway)
Troubleshooting TCP/IP, EIGRP & OSPF on Cisco routers for Freight Operations Information System (FOIS).
Running Root Cause Analysis on the network for reducing the troubleshooting ticketing time.
Creating, updating, and querying (SQL) databases (Microsoft Access) for maintaining information related to Layer 2&3 switches, routers, modems etc. (partID, Spec, orderID, BOM) by collaborating with the Sales Teams.
RELEVANT COURSES & RESEARCH PROJECTS
Data Science and Data Warehousing: Hands-on data mining and analysis on Titanic dataset predicting survival rate of the passengers based on sex, class (3-tiers in Titanic ship), age and family by using Jupyter Notebooks and different statistical packages NumPy, Pandas, SciPy, Matplotlib, scikit-learn.
Coding for Analytics: Hands-on data analysis on Spotify ’17 dataset for answering critical business questions by identifying key patterns and predicting “The top 5 artists” using Spyder, Google Collaborator and different Python libraries like Pandas and NumPy
Data Systems: Created a SQL database listing all major credit cards tailored to travel, hotel, flight, etc. with Microsoft Workbench and implemented CRUD operation on the same. Gained exposure to NoSQL document database like MongoDB.
Social Media as a Tool: Designed a graph database with Neo4j for Social Media Profiling for Psychological Analysis with NLP (Natural Language Processing). Scikit,
Business Analytics: Predicted the sales, profit margin and trends of Luna Farm by creating, Pivot tables, Dashboards with Tableau. Used JMP for histograms, Multivariate and outlier analysis for the same project.
Data Architecture and Management: Analyzed the non-clinical data architecture at Thermo Fisher Scientific by Fishbone Analysis regards to data Quality, Security & Governance in compliance with TOGAF & GDPR,
EDUCATION
Master of Science in Information Systems, GPA: 3.88/4.00 University of San Francisco, San Francisco, CA,2020
Master of Science in Applied Electronics and Instrumentation, GPA: 3.77/4.00 WBUT, Kolkata, India (2013)