Post Job Free
Sign in

Data Analyst Power Bi

Location:
Richardson, TX, 75080
Posted:
November 21, 2024

Contact this candidate

Resume:

Aditya Jain

Data Analyst

***************@*****.*** 469-***-**** Dallas, TX Tableau GitHub LinkedIn SUMMARY

Data Analyst with over 3 years of experience in analyzing, interpreting, and transforming complex data into actionable insights. Proficient in using Python, R, and SQL for data manipulation, along with advanced analytics through popular libraries like NumPy, Pandas, and TensorFlow. Expertise in leveraging data visualization tools such as Tableau, Power-BI, and Excel to create impactful dashboards and reports, aiding in strategic decision-making. Skilled in cloud technologies like AWS and Azure, database management (MySQL, SQL Server, PostgreSQL), and collaborating in Agile teams. Adept at utilizing version control tools like Git and GitHub for collaborative development. Proven ability to streamline processes and deliver data-driven solutions to optimize business performance. WORK EXPERIENCE

Data Analyst JP Morgan Chase & Co., TX Mar 2024 – Present

● Implement Agile/Scrum methodologies to manage project timelines, ensuring the successful delivery of data-driven solutions that align with organizational goals.

● Design and develop interactive dashboards and reports for senior management by leveraging Tableau, Power BI (Power Query, DAX), and Excel (Pivot Tables, VBA, VLOOKUP), driving data visualization and business intelligence.

● Conduct in-depth analysis of large datasets using Python (SciPy, PySpark) and R to generate actionable insights that support strategic decision-making.

● Optimize data processing by developing and automating custom scripts in PyCharm and Jupyter Notebook, reducing data preparation time by 30% through automation of data cleansing and transformation processes.

● Build and manage robust data pipelines using SQL to streamline data extraction, transformation, and loading (ETL) processes for mission-critical reporting systems.

● Automate data workflows using Python and version control tools like Git, enhancing data accuracy and eliminating manual data handling tasks, leading to increased operational efficiency. Jr.Data Analyst Spectrum Webapps, Ahmedabad - INDIA Mar 2019 – Jul 2022

● Led data analysis in Agile and Scrum environments, collaborating with teams to ensure insights aligned with business goals.

● Analysed datasets using Python, R, and SQL, identifying trends that improved business processes and informed decision- making strategies.

● Designed and maintained interactive dashboards using Power BI to delivering data insights to drive key business initiatives.

● Transformed raw data using Pandas, NumPy, and Matplotlib for efficient cleaning and visualization, enhancing data accuracy and usability.

● Automated routine reporting tasks with VBA in Excel, cutting manual labour by 20% and improving reporting speed.

● Executed ETL processes using SQL databases like MySQL, PostgreSQL ensuring data integrity and seamless system integration.

● Enhanced cloud-based data pipelines using AWS services (e.g., S3, RDS, Glue), improving data accessibility and workflow efficiency.

● Streamlined code versioning with GitHub, ensuring seamless collaboration and code management across projects. SKILLS

Methodologies : SDLC (Agile, Scrum)

Languages : Python, R, SQL

IDE’s : Visual Studio Code, PyCharm, Jupyter Notebook Python Packages : NumPy, Pandas, Matplotlib, SciPy, ggplot2, Scikit learn, PySpark, Keras, TensorFlow Visualization Tools : Tableau, Power BI (Power Query, DAX), Excel (Pivot Table, Pivot Chart, VLOOKUP,XLOOKUP, VBA, Conditional Formatting),

Database : MySQL, SQL Server, Mongo DB, PostgreSQL Cloud Technologies : AWS (S3, RDS, Glue), AZURE Fundamentals Version Control Tools : Git, GitHub

Operating System : Windows, Linux

PROJECTS

Credit Card Churn Analysis (Python, Tableau, Machine Learning)

● Cleaned data and conducted feature engineering. Categorical variables were encoded according to 3 levels.

● Used logistic regression, clustering, decision tree, neural network, and gradient boosting model pipeline with hyperparameter optima, classification and analyzed the dataset of credit card customers. Machine Learning (ML) model were built to predict which customers will churn and achieved a ROC AUC score of 0.993 in the final model. COVID-19 and Airbnb Data (SQL, Tableau, ETL, SQL Server, Microsoft SQL Management Studio, Excel)

● Analyzed COVID-19 data, resolving data type issues and filtering data to enhance accuracy and relevance.

● Created a SQL Server database, efficiently managing and querying large datasets.

● Designed an interactive Tableau dashboard to display key metrics and trends, like vaccination impact, and pricing.

● Joined multiple data sources in Tableau to present a holistic view of the data, using advanced visualization techniques.

● Showcased insights through a drill-down dashboard, allowing detailed exploration of data by continent and country. Bike Sale Analysis (MS Excel, Pivot Table, XLOOKUP, VLOOKUP)

● Cleaned and transformed data by addressing inconsistencies and used Excel to create pivot tables to summarize data on marital status, income, and bike purchase status. Developed an interactive dashboard visualizing purchase behavior.

● Provided insights into how factors influence bike purchasing decisions, aiding in targeted marketing strategies. Data Professional Survey Analysis (Power BI, Power Query, DAX)

● Created a Power BI dashboard to analyze survey data. Transformed and processed data within Power BI using DAX and Power Query. Designed interactive report displaying key trends relevant to the data professional field. Amazon Web Scraper Project (Python, Beautiful Soup, API)

● Developed a web scraper using Python to extract product data from Amazon, including prices and ratings. Utilized library Beautiful Soup to efficiently perform data scraping and parsing. Processed and cleaned the scraped data.

● Implemented automated scripts to collect and update data regularly, for real-time accuracy.

● Created visualizations and reports to analyze pricing trends and provide actionable insights for market research. Movie Correlation Project (Python, Pandas, Matplotlib, Seaborn, Jupyter Notebook)

● Conducted in-depth data cleaning and preparation by importing and merging datasets containing movie budgets, gross revenues, and features like company, country, and genre.

● Converted categorical data to numerical formats using label encoding and one-hot encoding, enhancing dataset readiness for quantitative analysis.

● Performed exploratory data analysis (EDA) by creating scatter plots and applying visual enhancements, improving the readability of key relationships such as budget vs. revenue.

● Applied correlation analysis using Pearson, Kendall, and Spearman methods, identifying strong positive relationships, such as a 0.71 Pearson correlation between budget and gross revenue. Automatic Crypto Data Extraction Project (Python, Pandas, Seaborn, Matplotlib)

● Extracted and processed real-time cryptocurrency data from an API using Python, focusing on metrics such as price changes over various intervals (1 hour, 24 hours, 7 days, etc.).

● Utilized Pandas for data grouping and aggregation, applying df.groupby to compute the average percentage changes for different cryptocurrencies, allowing for deeper insights into market fluctuations.

● Applied data transformation techniques such as stacking and pivoting (df.stack and df.pivot, reshaping datasets for better usability in analysis and visualization.

Airbnb Pricing Analysis Project (Tableau, Excel)

● Developed an interactive Tableau dashboard to analyze Airbnb pricing trends and provide insights for potential hosts on optimizing revenue and understanding market competition.

● Conducted zip code pricing analysis by visualizing average Airbnb prices using heat maps for strategic pricing decisions.

● Performed competition analysis by assessing the distribution of listings by bedroom count, helping identify areas with higher or lower competition for better market positioning.

Internship, Big Data, Cloud, Apache, Hive, Hadoop, Spark, Sqoop, Impala, Google Cloud, Machine learning, Time-to-event analysis, Cancer therapy studies, Progression-free survival (PFS), Overall survival (OS), Regression models, R shiny app, Data Science, Statistics, Python, Pandas, Pyspark, AI solutions, Statistical modeling, Data insights, SQL, Large datasets, Data manipulation, Scikit-learn, TensorFlow, Keras, Git, Data analysis, Reporting, Data visualizations, Dashboards, Data storytelling, Charts, Graphs, Python scripting, Machine Learning, Data Science Pipeline (Cleaning, Wrangling, Modeling), Hypothesis Testing, Business Intelligence, ETL, Supervised and Unsupervised Learning, SciPy, Data Mining, Random Forest, Predictive modeling, Statistical testing, Quantitative decision-making support, Data engineering, Data processing, Data modeling, Distributed data systems, Scala, Grant Thornton, Golang, Shell Scripting, JavaScript, Full-stack, React, AngularJS, NodeJS, Systems engineering, Data lifecycle, Normalization, Hadoop, Hive, Spark, Streaming, Data APIs, GraphQL, Database systems, NoSQL, Organizational skills, Analytical skills, Written communication skills, Data Science, Business Data Analyst, data analysis, data visualizations, dashboards, data storytelling, charts, graphs, spreadsheet, communication skills, analytical skills, written skills, presentation skills, Six Sigma, White Belt, Advance Statistics for Data Science, Database Foundation for Data Science, Business Analytics with R, Data Visualization, Applied Econometrics, Big Data, BI tools, mathematics, business analysis, merchandising, competitive intelligence, attention to detail. EDUCATION

Master of Science in Business Analytics The University of Texas at Dallas - Richardson, TX Bachelor of Commerce (B.Com.), Accountancy & Finance University of Rajasthan - Jaipur, RJ



Contact this candidate