Sonti Subramanyam Prasad
Email: ***********.*******@*****.*** Phone: +1-940-***-****
SUMMARY
Data Analyst with around 2 years of experience in large-scale data gathering, database management, and statistical analysis using Python with the PyData Stack (NumPy, Pandas, Matplotlib). Proficient in creating and automating reports and dashboards using Tableau and Power BI, transforming raw data into actionable business insights. Demonstrated expertise in developing and managing automated data pipelines, ensuring data integrity, and improving processing efficiency. Adept at collaborating with cross-functional teams to drive data-driven decision-making and providing insights through advanced data analysis techniques. Proven ability to manage and optimize large datasets while ensuring high data quality.
WORK EXPERIENCE
Data Analyst
Kantar Sep 2020 – Dec 2021
Led large-scale data gathering efforts, ensuring accurate and timely data collection across multiple datasets for strategic business decision-making.
Performed data analysis and statistical programming using Python with the PyData Stack (NumPy, Pandas, Matplotlib) to identify trends and generate actionable insights.
Developed and implemented automated data pipelines using Python, reducing manual data processing by 30% and improving overall data handling efficiency.
Conducted advanced statistical analysis and data modeling using Python's PyData Stack, providing key insights that informed business strategies and operational decisions.
Managed and optimized large datasets using Oracle DB, applying rigorous data cleaning, and preprocessing techniques, resulting in a 20% improvement in data quality and reliability.
Designed and built custom dashboards using Tableau and Power BI, providing real-time reporting and visualization of critical metrics for stakeholders, enhancing data-driven decision-making.
EDUCATION
Master of Science in Computer Science
University of North Texas Jan 2022 – Dec 2023
Focused on Data Science, Machine Learning, and Advanced Database Systems.
Relevant coursework: Big Data Analytics, Statistical Modeling, Machine Learning, Data Visualization.
Bachelor of Engineering in Computer Science
Anurag University Aug 2016 – Aug 2020
Specialized in Database Management, Data Structures, and Algorithms.
Completed a capstone project on Predictive Analytics using Machine Learning, improving prediction accuracy by 15%.
ACADEMIC PROJECTS
Sentiment Analysis Using the Binary Naive Bayes Classifier Sep 2023
Developed a sentiment analysis model in Python, achieving 85% accuracy in classifying customer reviews.
Optimized dataset preparation and model algorithms for enhanced performance and insights.
Spam Detector Language Model Using Logistic Regression Nov 2023
Created a high-accuracy spam detection model using Logistic Regression, achieving 98% accuracy in email classification.
Improved model reliability through thorough data preprocessing and feature engineering.
ETL-Homeless Shelter Management System May 2023
Conducted ETL operations on large CSV datasets, structuring them into a MySQL database.
Analyzed shelter resident trends, identifying key factors contributing to homelessness and developing data-driven solutions.
Utilized Python for advanced data analysis and visualization, generating actionable insights for non-profits.
SKILLS
Programming Languages: Python (NumPy, Pandas, Matplotlib, Scikit-learn), SQL
Data Technologies: MySQL, T-SQL, Oracle DB
Business Intelligence Tools: Tableau, Power BI, Microsoft Excel (Advanced Functions, Pivot Tables, Power Query)
Data Analysis & Processing: Large-scale Data Gathering, Data Scrubbing, Data Modeling, Statistical Analysis, Data Engineering
Automation: Development of Automated Data Pipelines using Python
Database Management: Database Administration, Data Quality Management, Data Integrity
Visualization: Data Visualization and Reporting with Tableau, Power BI, and Matplotlib
Cloud Technologies: AWS, Azure, VMware