Post Job Free
Sign in

Data Analyst Sql Server

Location:
Hoffman, NJ, 08831
Posted:
December 18, 2024

Contact this candidate

Resume:

RAJESH P

Location: NJ Phone: 216-***-**** Email: ********************@*****.***

SUMMARY

Data Analyst with a Master's degree in Computer Information Science and over 3 years of experience handling structured and unstructured data. Proficient in Python libraries such as NumPy, Pandas, Seaborn, and Matplotlib for preparing, analyzing, and visualizing data. Skilled in SQL for identifying trends and analyzing temporal data. Experienced in ETL processes with SSIS and managing databases like MS SQL Server, PostgreSQL, MySQL, and MongoDB. Strong at creating dashboards and reports with tools like Power BI, Tableau, and SSRS. Demonstrated success in improving workflows, building predictive models, and delivering actionable insights to boost operational efficiency.

TECHNICAL SKILLS

Programming Languages

Python, SQL, R

Libraries

Pandas, NumPy, Matplotlib, Seaborn, MapReduce

Database

MySQL, MongoDB, PostgreSQL, MS SQL Server

Methodologies

Agile (SCRUM)

Analytical Skills

Data Mining, Cleansing, Statistical Analysis, Visualization, Text Mining, ETL, SSIS/SSRS

Visualization Tools

Tableau, Power BI, MS Excel

Cloud Technologies

AWS (EC2, S3, Lambda, Quicksight), Snowflake

EDUCATION

Master of Science in Computer Information Science 08/2023

Cleveland State University, Ohio

Bachelor’s in Computer Science & Engineering 05/2021

JNTUK, India

CERTIFICATIONS

Machine Learning with Python

PROFESSIONAL EXPERIENCE

Data Analyst Jan 2024 - Current

Michaels, TX, USA

Supported the development of a Customer Feedback Analysis System, transforming unstructured survey responses into actionable insights using Python and SQL.

Developed automated scripts with the Python-MySQL connector to extract, analyze, and validate customer satisfaction data, reducing manual effort by 40%.

Designed robust ETL pipelines using Azure Data Factory to extract data from cloud platforms, clean it, and load it into SQL Server for analysis.

Leveraged Pandas and NumPy libraries for data wrangling tasks, ensuring efficient data cleaning and validation for business reporting.

Built Power BI dashboards to monitor customer churn rates and track user engagement, integrating SQL Server and Excel as data sources for real-time analytics.

Created DAX calculations to develop dynamic measures, enabling detailed insights into sales metrics, revenue growth, and product performance.

Optimized complex SQL queries for analyzing transactional data, utilizing joins, CTEs, and aggregation functions to improve query execution time.

Deployed data pre-processing techniques such as scaling, transformation, and encoding using Scikit-learn to prepare datasets for advanced analytics.

Utilized Azure Blob Storage for centralized data storage and retrieval, enabling scalable handling of survey and log files.

Applied statistical models to identify patterns, correlations, and key drivers of customer satisfaction, helping product teams prioritize feature enhancements.

Automated repetitive reporting tasks in Excel using VBA macros, pivot tables, and advanced formulas, improving productivity.

Assisted in implementing Power BI row-level security for user-specific access to reports and visualizations.

Followed Agile methodologies, participating in sprint planning and review sessions to ensure timely delivery of insights and reports.

Data Engineer Aug 2023-Dec 2023

Deloitte, CA, USA

Designed and implemented robust data pipelines using Python and SQL to analyze large-scale datasets, driving key business decisions and improving data accessibility.

Extracted and transformed data from cloud-based data warehouses using advanced SQL queries, enabling accurate reporting and operational insights.

Optimized and maintained PostgreSQL and MySQL databases to ensure high availability, performance, and data reliability across healthcare systems.

Built interactive Tableau dashboards incorporating advanced features like parameter controls, blended data sources, and drill-down capabilities to visualize patient demographics and healthcare trends effectively.

Utilized AWS Glue and PySpark to orchestrate ETL workflows, automating the ingestion, transformation, and loading of unstructured data into BigQuery for further analysis.

Streamlined data processing workflows using Python libraries such as Pandas and NumPy to reduce execution time for critical operations like data cleaning and validation.

Adopted agile principles by participating in sprint planning, retrospectives, and daily stand-ups to ensure effective collaboration and project delivery timelines.

Applied Matplotlib and Seaborn for creating clear, insightful charts to showcase patterns, trends, and KPIs in healthcare performance metrics.

Collaborated with cross-functional teams to implement AWS S3 and SNS for secure data storage and event notifications, automating system alerts and monitoring.

Software Engineer Trainee Oct 2019 - Dec 2021

VV global Solutions

Provided technical support for a Supply Chain Management (SCM) system, troubleshooting bugs, resolving data inconsistencies, and assisting in production issue fixes.

Collaborated with senior developers to analyze logs, debug Python scripts, and identify the root cause of failures in backend systems.

Wrote SQL queries to support data extraction and resolve discrepancies within the PostgreSQL and MySQL databases.

Assisted in monitoring and maintaining ETL pipelines using SQL Server Integration Services (SSIS), ensuring smooth data flow between systems.

Supported the integration of AWS services like S3 for data storage and Lambda to automate error notifications and event triggers.

Contributed to the development of JIRA dashboards for tracking production issues, response times, and resolution status for internal teams.

Created reports in Power BI to visualize system performance and identify trends in supply chain operations, reducing downtime by 15%.

Actively participated in daily stand-ups and knowledge-sharing sessions to improve understanding of system architecture and workflows.

Updated Python-based data validation scripts to check file integrity and alert the team about missing records, improving accuracy.

Assisted in testing and validating RESTful APIs for system integration, documenting test results and bug reports for development teams.

Assisted in code deployments under supervision using Git, ensuring smooth releases without disruptions to production.

Used Jupyter Notebooks to validate small Python fixes for data extraction and basic analysis tasks.

ACADEMIC PROJECTS

Data Engineer Customer Purchase Behavior Analysis using Python and Tableau

Developed a data pipeline to analyze customer purchase behavior using Python for cleaning and transforming raw sales data.

Extracted data from multiple sources (CSV, APIs, and Excel files), cleaned and formatted datasets using Pandas and NumPy for analysis.

Built interactive dashboards in Tableau to visualize customer segmentation, purchase frequency, and revenue trends.

A Novel Approach for Optical Character Recognition (OCR) of Handwritten Telugu Alphabets using Convolutional Neural Networks

Developed a Convolutional Neural Network (CNN) model to accurately recognize handwritten Telugu alphabets, addressing the challenges posed by their complex and overlapping structures.

Created a comprehensive dataset of handwritten Telugu characters to improve model training, ensuring it effectively handles visual similarities between alphabets.

Achieved an accuracy of 80% to 95%, significantly improving OCR performance.



Contact this candidate