Post Job Free
Sign in

Data Analyst Analysis

Location:
Brooklyn, NY
Posted:
September 18, 2024

Contact this candidate

Resume:

Ja-Yuan Pendley

347-***-**** ***.***********@*****.*** http://www.linkedin.com/in/jayuan-pendley https://jayuan101.github.io https://public.tableau.com/app/profile/ja.yuan.pendley/vizzes Brooklyn, NY

SUMMARY

Data Analyst with extensive experience in data analysis, business analysis, and technical project management. Proven ability to leverage advanced analytical techniques and programming skills to drive operational efficiency and enhance decision-making. Expertise in utilizing Python, SQL, and various data visualization tools to extract actionable insights and present findings to stakeholders. Demonstrated success in implementing data-driven methodologies, automating ETL processes, and improving system performance through AI and machine learning strategies.

PROFESSIONAL EXPERIENCE

Data Analyst Outlier November 2023 – Present

Data Analysis and Insights:

Led data analysis efforts to identify anomalies and uncover valuable insights.

Conducted root cause analyses, statistical data analyses, and other analyses related to platform performance.

Designed and implemented new statistical analyses to address key business questions.

Data Extraction and Efficiency:

Implemented sophisticated data extraction methods using Python, resulting in a 35% increase in operational efficiency.

Led a major data transformation initiative, implementing AI strategies that improved data processing efficiency by 50%.

Collaboration and Communication:

Collaborated with cross-functional teams of data scientists, data engineers, and software engineers to deliver analyses aligned with business objectives.

Presented analyses to technical and non-technical stakeholders.

Wrote explanations of how data science can solve problems and evaluate various solution approaches.

Product and Performance Enhancement:

Implemented new data-driven methodologies to enhance product performance and capabilities, in coordination with product management and software development teams.

Monitored and maintained existing prediction, optimization, and simulation engines to ensure optimal performance.

Jr. Business Analyst Cyquent Remote September 2022 - November 2023

Executed detailed data analysis and cleaning: This refers to performing comprehensive examination and processing of data to identify trends, patterns, or issues. Cleaning the data means removing inaccuracies, correcting errors, and handling missing values to ensure high-quality data.

Gathering and Documenting Requirements: Jr. BAs work with stakeholders, clients, and business teams to gather and document the requirements for projects, systems, or processes. They assist in creating business requirements documents, user stories, and other documentation.

Data Analysis: Jr. BAs help analyze data to identify trends, opportunities for improvement, and business needs. They might use spreadsheets, databases, or other tools to support decision-making processes.

Process Mapping and Improvement: They may help in mapping out current business processes, identifying inefficiencies, and suggesting improvements. This can involve creating flowcharts, diagrams, and process models.

Stakeholder Communication: Jr. BAs assist in coordinating and facilitating communication between business teams, IT teams, and other stakeholders. They ensure that everyone is aligned with project goals and timelines.

Assisting in Solution Design: Jr. BAs contribute to brainstorming and proposing potential solutions to business problems. They may assist senior analysts in evaluating the feasibility of different options.

Reporting: They may be responsible for generating and maintaining reports that provide insights into project progress, business performance, and other key metrics.

TECHNICAL PROJECT EXPERIENCE

ats_scanner_keywords SQL Server, SSIS, Python, Banking, Finance, DBT, Mulesoft, Alation, Data Analysis, Cloud Computing, Business Analyst, Data Analyst, Process Management, Problem-Solving, Interdepartmental Coordination

Airbnb Data Analysis November 2022 November 2022

•Applied SQL and Python for data extraction, transformation, and loading (ETL) processes to analyze Airbnb data.

•Identified trends and patterns through data analysis, contributing to strategic decision-making.

•I used Tableau for data visualization, facilitating easy comprehension and usage in business discussions.

• App link: Airbnb Website

Song Recommendation System using Spotify’s API November 2022

•Performed correlation analysis, data visualization, and k mean clustering, to uncover patterns and insights within Airbnb’s open dataset.

•Visualized the analysis using Folium and Plotly.

•App link: Spotify Recommendation API

NYC Taxi and Limo Data Analyst November 2022

•Performed correlation analysis, data visualization, and k-mean clustering, to uncover patterns and insights within NYC Taxi and limo.

•Visualized the analysis using Seaborn.

•A/B Testing and Linear Regression

•App link: NYC Taxi and Limo Website

Web application using Kubernetes December 2023

Deployed a web application on AWS Elastic Kubernetes Service by creating a Docker container.

Performed cost-benefit analysis of using ECS (Elastic Container Service) vs EKS (Elastic Kubernetes Service).

Created a repository for the Docker containers in AWS Elastic Container Registry.

Predicting Stocks August 2024

Stock Data & Model Training: The app allows users to input a stock ticker, fetch historical stock data, preprocess it, and train a TensorFlow model with CNN and LSTM layers for time series forecasting.

News Feed Integration:Fetches and displays recent news articles related to the stock ticker using the NewsAPI, showing article titles, descriptions, and links.

Interactive Interface: Provides a user-friendly interface with input fields for stock symbols and date ranges, and buttons for model training, with visualizations of stock data, model training history, and news articles.

App link: Predicting Stocks

ETL Pipeline using Azure Data Factory, Databricks, and Snowflake February 2023

Created a complex ETL pipeline to extract data from Azure Blob Storage Container.

Imported a Databricks Notebook in Data Factory to perform the complex transformations and perform Machine.

Learning Tasks on the data.

Loaded the data into Snowflake Data Warehouse for further analysis.

Made chat bot using Open AI September 2024

Users can specify what they want to eat (breakfast, lunch, or dinner) in a text area.

Upon clicking a button, the app generates personalized food suggestions using OpenAI's GPT model.

Input is limited to under 1000 characters.

Displays a warning if the input exceeds the character limit.

Acts as a personal chef, providing customized meal recommendations based on user input.

App link: OpenAI chatbot

EDUCATION

Master of Science, Data Science New York Institute of Technology

Bachelor of Computer Systems Technology New York City College of Technology

CERTIFICATION

• Google Advanced Data Analytics is authorized by Google.

• AWS Cloud Technical Essentials authorized by Amazon Web Services

• Architecting Solutions on AWS authorized by Amazon Web Services

• Data Wrangling, Analysis, and AB Testing with SQL authorized by UC Davis

• Microsoft Azure Databricks for Data Engineering (DP-203) authorized Microsoft

Technical Skills

Programming Languages: Python, R, Java, PySpark, R, JavaScript, C++, SQL, PL/SQL, QlikView

Libraries and Frameworks: Django, Flask, sci-kit-learn, Xgboost, Keras, TensorFlow, PyTorch, Pandas, NumPy, SciPy, Kubernetes, Spark, Beautiful Soup,

AWS Cloud Services: AWS Lambda, Secrets Manager, IAM, AWS Athena, Redshift, AWS Elastic Beanstalk, AWS EKS, AWS EC2, AWS DynamoDB, AWS ECS, AWS S3, SQS, Event bridge, API Gateway, AWS CloudFormation

Azure Cloud Services: Azure Synapse, Cosmos DB, Azure Databricks, Azure AKS, Azure Blob Storage, Step Functions, Azure Data Factory (ADF), Azure Logic Apps, Azure Function Apps

Databases: Redshift, Snowflake, PostgreSQL, MySQL, Azure Synapse, SQL Server, MongoDB, DynamoDB, Aurora DB, Cloud Bigtable

SDLC Methodologies: Agile, TDD, Waterfall, SCRUM, Portfolio Management

Data Pipeline Orchestration: Docker, GitLab CI/CD, Airflow, Jenkins, Kubernetes, Quality assurance,

Data Visualization: Matplotlib, Looker, Tableau, PowerBI, Qlik Sense, Visio

Machine Learning Methods: A/B testing, Natural Language Processing (NLP), Classification, SVM, Regression, Prediction, Statistical Modeling, Recommendation System, Time Series Forecasting, Computer Vision, Transformers, Large Language Models (LLM)

ETL, ML and Big Data Tools: Machine Learning, Deep Learning, Databricks, ML Flow, MLlib, HBase, HDFS, Yarn, MapReduce, Hive, UAT, Data Cleaning, Data Reporting



Contact this candidate