Mehreen Ali Gillani
DATA SCIENTIST
Residential address: 888 Main Street. New York City
E-mail: *******.*********@*******.****.***
GitHub: https://github.com/mehreengillani
Results-driven and detail-oriented Data Science Master’s candidate with a robust foundation in statistical modeling, machine learning, and data visualization. Adept at transforming complex data into actionable insights using tools like Python, R, and SQL. Known for strong analytical thinking, effective communication, and a proactive approach to problem-solving. I am eager to apply academic knowledge and hands-on project experience to drive data-informed decision-making in dynamic business environments.
QUALIFICATION
Degree
INSTITUTION
DATE
CGPA
Master's in data
Science
City University of New York (CUNY), School of Professional Studies (SPS), NY
2025-2027
In process
MS (IT) Specialization in Networks
National University of Science and Technology (NUST), Islamabad, Pakistan
2010-2013
3.6
BS (IT)
Punjab University, Lahore, Pakistan
2005-2009
3.52
SKILLS
Languages & Core ML: Python, Scikit-learn, Pandas, NumPy, SQL, R
Deep Learning & NLP: PyTorch, TensorFlow
Visualization: Tableau, Matplotlib, Seaborn, Power BI
Databases & Tools: PostgreSQL, MySQL, Git, Docker, Jupyter
Methodologies: A/B Testing, Statistical Inference, Time Series Analysis
Data Science Projects:
Analyzed Netflix's catalog and user data to challenge core business assumptions on pricing and content success.
https://github.com/mehreengillani/DATA607_project3/blob/main/DATA607-Project3-Part2-Extended%20EDA_Visualizations_Thorough_Analysis-MASTER.rmd Key Finding 1: Discovered negligible correlation between subscription tier and user engagement, challenging the basis for tiered pricing models. Key Finding 2: Identified that high production budget explains less than 10% of rating variance, revealing content success relies more on genre and critical reception than spend. Process: Engineered the analysis pipeline in R (tidyverse, ggplot2) through data cleaning, feature engineering, and statistical testing on a dataset of 10,000+ titles.
Comparative Sentiment Analysis with Custom Lexicons: https://rpubs.com/Mehreen/1362288 Executed a comparative text mining analysis in R (tidytext) to evaluate the impact of lexicon choice on a novel product review dataset.
Nobel Prize Data Analysis & Pipeline Engineering Engineered a scalable R pipeline to parse and consolidate nested JSON from multiple APIs into a structured SQL database, enabling analysis of 120 years of laureate data https://rpubs.com/Mehreen/1364827
Airbnb Market Analysis for host strategy, price prediction. Analyzed 50,000+ listings using Python (Pandas, Seaborn, Scikit-learn) to identify key pricing drivers and amenity, Implemented and compared multiple regression models (Linear Regression, Random Forest, Gradient Boosting) to predict price https://github.com/mehreengillani/Data602/blob/main/AirBnB_price_prediction.ipynb
WORK EXPERIENCE
TeraData (Sep 2013-Oct 2013)
Internship in Application Development Dept.
Responsibilities were:
Bugs identification in TeraData web Portal
NUST School of Electrical Engineering and Computer Sciences (July 2011-Jan 2012)
Worked as Research Assistant
Responsibilities were:
Configuration of NOX, flowvisor, Wireless router
Practical implementation of openflow wireless network
NUST School of Electrical Engineering and Computer Sciences (April 2011-June 2011)
Worked as Research Assistant
Research topics:
How to plan capacity when mobility introduces temporary congestion in LTE-Advanced/4G networks?
How to perform traffic prioritization in LTE-Advanced?
Nokia Siemens Network (December 2009– August 2010)
Worked as Network Performance Engineer at Network Performance Management (NPM) Dept.
Responsibilities were:
Preparation of network analysis reports on a daily basis using NetAct
Customized hourly, busy hour and daily Cell, BSC and Network level reports to help solve problems in the network.
Identification of all Congested Cells, Cells with poor CSSR, cells with high Call Drop Rate in the network, and finding solutions to solve the problem
Performance analysis of newly integrated sites.
Creation of Performance Management Reports, Root Cause Analysis of
Packet switched, circuit switched KPI’s Trending & KPI degradation of Telenor network
Good knowledge of RF, Optimization and monitoring tools i.e. Optima, NetAct, MapInfo, TEMS.
Worked on Telenor swap project (Siemens BTS was swapped with Nokia Flexi EDGE BTS).
Undergraduate Projects:
"Development of Islamic Web Portal in PHP ", Jan 2009
"Web based project on ASP.NET", Nov 2008
"Development of Address Book using MVC and Struts Design Pattern", July 2008
"J2ME based game", Oct 2007
"Developed Online Airline Reservation System in J2EE", June 2007
"Project based on tree Data Structure", Dec 2006
Graduate Projects:
"Backup path management in Cognitive Radio Networks", Network Switching and Routing: CSE870
"Dynamic Capacity planning in LTE-A / 4G Networks", Wireless Networks: EE834
“Post Summarization of Micro-blogs”, master's Thesis
Publication:
Post Summarization of Microblogs of Sporting Events
https://dl.acm.org/doi/10.1145/3041021.3054146
Gillani, M.A.; Ali, A.; Ilyas, H.; Sultan, A.; "Multi-path traffic grooming in DOCS," High-Capacity Optical Networks and Enabling Technologies (HONET), 2011, vol., no., pp.71-75, 19-21 Dec. 2011