Chandra Data Analyst
Email: *******.******@*****.*** Experience: 3+ Years
Contact No: +1-913-***-****
Objective
As an Experienced Software Engineer with over 3 years of expertise in designing and optimizing data pipelines, performing data analysis, and delivering actionable insights, I specialize in leveraging advanced technologies. Proficient in building ETL workflows, working with Cloud, and big data tools like Apache Spark and Hadoop. Skilled in database management, data modelling, and ensuring data integrity through automation. Expertise in Python, SQL, and data visualization tools (Tableau, Power BI) to clean, process, and present data. Strong communicator, bridging technical and business teams to drive data-driven decisions and efficiency. Passionate about leveraging data to support strategic growth.
Skills
Programming Languages: SQL, Python(Pandas, Numpy, matplotlib), R
Data Visualization: Tableau, Power BI, Matplotlib, Seaborn
Databases: MySQL, PostgreSQL, Microsoft SQL Server
Tools: Excel, Google Sheets, Jupyter Notebooks
Statistical Methods: Hypothesis testing, Regression analysis, A/B testing
Data Reporting: Automated reporting, Dashboard creation
Big Data Technologies: Hadoop, Spark, Kafka, Hive, Flink
ETL Tools: Apache NiFi, Airflow, Talend, Informatica
Cloud Platforms: Amazon Web Services (S3, Lambda, Redshift), Azure (Data Factory, SQL)
Data Warehousing: Google Big Query
Version Control: Git, GitHub
Data Modelling & Warehousing: Kimball methodology, Star/Snowflake schema design
Work Experience
Humana, USA
Data Analyst Jan 2024– Present
Assisted in reviewing data ingestion from sources (APIs, databases, flat files), ensuring 99% data accuracy.
Cleaned records using Python and Pandas, improving data consistency by 30%.
Supported data transformation by creating ETL scripts using SQL and Apache Airflow to filter, aggregate, and merge datasets.
Monitored data pipelines, ran daily validation checks, reducing processing errors by 20%.
Optimized SQL queries for large datasets, reducing query times by 35% for faster data retrieval.
Documented database structures and schemas for efficient querying.
Generated reports identifying bottlenecks, improving pipeline efficiency by 20%.
Worked with data engineers to improve data quality, boosting pipeline efficiency by 15%.
Automated data cleaning for records using Python, reducing manual work by 40%, and suggested ETL improvements, cutting pipeline bottlenecks by 25%.
Mind Tree, Bengaluru
Associate Software Engineer Sep 2020 – Aug 2022
Extracted and integrated data from retail systems into a central data warehouse using SQL and Python.
Set up API integrations with external sources like supplier databases using Python and Apache Kafka.
Developed ETL pipelines in Apache Airflow, Python automating the processing of transactions.
Cleaned and standardized records using Pandas and NumPy, ensuring data consistency.
Designed relational and NoSQL databases (MySQL, MongoDB) for inventory data and optimized queries by 30% using indexing.
Integrated demand forecasting models with data scientists, improving forecast accuracy by 15% and developing strategies to reduce stockouts by 20% and overstock by 10%.
Created real-time dashboards in Tableau to track stock levels and sales trends.
Collaborated with teams to ensure data is structured for business needs using Jira.
Integrated healthcare claims data from sources using Python and SQL improving data quality by 25%.
Designed NoSQL (MongoDB) and relational (PostgreSQL) databases to store claims data.
Developed real-time dashboards in Tableau/Power BI, created reports on fraud using Python, SQL, and Excel, and engineered features for fraud detection models, pattern recognition accuracy by 15%.
Ensured HIPAA compliance in data handling, safeguarding patient privacy.
Cleaned healthcare claims using SQL, Python ensuring accuracy by handling missing values and duplicates.
Generated statistics and analysed trends in claims using Excel, R, SQL and Power BI, identifying a 10% increase in claims from specific providers.
Education
Masters in Computer Science, 2023
University of Central Missouri, Warrensburg, MO
Bachelor of Technology in Electronics and Communication Engineering, 2021
Rajeev Gandhi Memorial College of Engineering and Technology.
Related Courses completed:
oAdvanced Operating Systems
oDatabase Theory & Apps
oArtificial Intelligence
oCloud Computing
oStatistical Foundations
oAdvanced Web Applications
oAdvanced Computer Networks
Certifications
Python and SQL for Data Science (Scalar Topics).
DBMS-Master the Fundamentals and Advanced Concepts (Scalar Topics).
Programming for everybody (Python) and AI for everybody (Coursera).
The Joy of Computing using Python (NPTEL).
Fundamentals of AWS (Scalar Topics).