Sampreeth Chindam
Data Analyst Software Developer
Address Orlando Tampa
Phone 331-***-****
E-mail *********.*****@*****.***
.
Experienced IT professional with over three years of expertise in Cloud Technologies and Data Science, holding a master's degree in Data Analytics Engineering. Proficient in programming languages such as Python, Java, and SQL, with a good understanding of functional programming. Hands-on experience with various AWS services, including Amazon EC2, Amazon S3, and IAM roles. Skilled in predictive modeling, statistical analysis, and applying machine learning models. Strong experience in data cleaning, reshaping, building ETL pipelines, and generating segmented subsets using Databricks, Numpy, Pandas, and PySpark. Microsoft Certified Azure Administrator Associate (AZ-100) professional with expertise in DevOps methodologies, Jenkins, Docker, and Splunk.
Technical Skills
Statistical Analysis, Data Visualization, Data Modeling, Data Mining, Ad-hoc Data Analysis, Scalable Automation, Data Integration, Data Validation, Data Warehousing, Interactive Visualizations, ETL pipeline, Machine Learning, Cloud Computing, Data Structures and Algorithms, and Object-Oriented Programming.
Tools and Technologies
Java, Python, R programming, SQL (MySQL, PostgreSQL), C programming, JavaScript, HTML/CSS.
AWS EC2, S3, Lambda, SageMaker, Microsoft Azure
Databricks, Snowflake, PySpark, Numpy, Jupyter Notebook, Hadoop, Hive, Matplotlib, sci-kit-learn, Anaconda, PyCharm, GitHub, Splunk, Docker, Tableau, Power BI, Apache Spark, QuickSight Salesforce Marketing Cloud
Work History
2023 - 02 - Current
Data Analyst
Sea Coast Bank
Built ETL pipelines for marketing and servicing campaigns for US Partnership Cards using databricks, pyspark and Salesforce Marketing Cloud tool
Conducted A/B tests to increase customer engagement by analyzing business requirements, performing data segmentation, integrating customer data into emails, enforcing compliance approvals, and analyzing performance using DataBricks, Snowflake, PySpark, and AWS S3
Identified 1 million inactive accounts, developed a campaign to close them, and reclaimed 2 billion USD in inactive exposure by collaborating with cross-functional teams.
Simplified reporting by creating aggregated data sets and developed Tableau dashboards to visualize key performance metrics, enabling leadership to make informed business decisions.
Ensured data integrity and correctness by validating metrics and end-user reports.
Developed and executed business reports and perform ad-hoc data analysis for performance monitoring
Created Databricks scripts to extract data from Snowflake tables and upload files to one lake S3 buckets using PySpark and Python
Manipulated data and build new features using Python and PySpark
Trained new team members on Fractal(in-house tool), Q/A testing, A/B testing, and Databricks scripts.
2019-06 - 2021-08
Data Scientist
Thrive Digital Health LLP, Hyderabad, India
Collaborated with healthcare analytics team to develop predictive models using regression to analyze total charges incurred by hospitals and length of stay for patients diagnosed with lung cancer and mental illness, using Python, AWS SageMaker, EC2, and S3
Conducted detailed data analysis using SQL and Python to determine data structure, content, and quality.
Utilized SPSS to analyze, load, and convert large ASCII files into CSV format and used the data-wig library to impute missing values for categorical and continuous data
Designed dashboards using ggplot, Python matplotlib, Power BI, and Tableau to analyze important features and model performance and present findings from the analysis.
2018-10 - 2019-05
Software Developer
Cliff.AI, Hyderabad
Containerized Drupal web applications and implemented CI/CD pipelines in Jenkins.
Secured and troubleshoot Azure Web App for Containers
Created SharePoint Site Collections, Site Pages, Lists, and Nintex Forms, and utilized REST APIs to retrieve data from SharePoint lists and display JSON data in Site Pages
Automated data entries using Power Apps
Monitored multiple applications using Microsoft Azure Cloud Services (PaaS, IaaS), Application Insights, and Splunk
Provided initial assessment and possible workarounds for production issues
Collaborated with application developers to guide the development and implementation of Cloud applications, systems, and processes using DevOps methodologies.
Education
2021-01 - 2023-05
Master's: Data Analytics Engineering
Eastern Illinois University – Charleston, IL
GPA: 3.87
2015-06 - 201-05
Bachelor's: Computer Science
Jawaharlal Nehru Technological University - Hyderabad
GPA: 3.54
Projects
Fake News Classification on Twitter (Python, Flume, N-gram analysis, Decision Tree ML) Nov 2022
•Trained a decision tree machine learning model to classify tweets as real or fake based on the extracted feature.
•Collected live stream data of twitter using Flume and used a gram analysis do perform feature extraction.
•Achieved 86% accuracy, 86.6% precision, 91.9% recall, 59% F1 score.
Air BNB Price Predictions (tableau, Python) May 2022
•Implemented EDA for finding influential variables and developed an accurate forecasting tool and ML model with an accuracy of 85% which predicted the price of the properties.
•Deployed Seaborn and Matplotlib packages for data set visualizations, Pandas and NumPy for data cleaning and Pyspark for data exploration.
•Interpreted the results with Milab in Pyspark and Seaborn to visualize the parameters that affect the property price.
Data engineer, data, engineer, PDF, PDF forms, ETL, data sets, SQL, AWS, analytics, extraction, transformation, data pipeline, big data, hadoop, spark, NoSQL, Postgres, redshift, dynamodb, workflow management, python, API, API development