Post Job Free

Resume

Sign in

Data Quality Engineer

Location:
Austin, TX
Posted:
February 02, 2024

Contact this candidate

Resume:

Sahithi Panchumarthi

Austin, TX ***** 402-***-**** ad3bmr@r.postjobfree.com LinkedIn

SUMMARY

Experienced data professional with three years of expertise in data processing, ETL development, database management, and engineering efficient data pipelines. Proficient in SQL, Python, and ETL tools, adept at optimizing data structures, ensuring data quality, and extracting valuable insights through statistical analysis and visualization. Experienced in AWS, passionate about leveraging data to drive innovative business solutions, seeking a dynamic environment to apply hands-on expertise and contribute to impactful data-driven projects.

SKILLS

Programming Languages: Python, SQL, R

Database: MySQL, PostgreSQL, MongoDB, HBase, Cosmos DB, Snowflake Big Data Ecosystem: Apache Hadoop, Apache Spark, YARN, Hive, Sqoop, Kafka Visualization tools: Tableau, Power BI, Matplotlib Cloud Platform: AWS, Azure

Tools and Software: Microsoft Excel, Google Analytics, Pandas, Git, NumPy, Jenkins, Jira, Docker, Bitbucket EXPERIENCE

Data Engineer Devoir Software Solutions LLC Chesterfield, MO Jan 2023 – Present

Contributed to on-premises migration to AWS, leveraging services like AWS Glue, S3, Redshift, Lambda and EC2, reducing costs by 37% while ensuring data integrity and security compliance.

Integrated AWS Glue for ETL, Lambda for event-driven processing, and Apache Spark for parallel data transformation, ensuring a smooth transition to AWS Cloud and reduced data processing time by 40%.

Implemented SQL and MySQL indexing strategies for on-premises databases, enhancing query speed by 45%; replicated success in Snowflake and Amazon Redshift, optimizing overall system performance during migration and cloud transition.

Developed Python-based automated scripts, including PySpark, for data quality checks, leveraging Jenkins for CI/CD and AWS Lambda for serverless execution, reducing manual intervention by 25% and ensuring data reliability.

Proactively resolved post-migration bottlenecks, achieving a 30% reduction in error incidents, and ensuring system stability.

Collaborated cross-functionally to design scalable, business-aligned data architecture using Snowflake's cloud-native services. Utilized Agile and SDLC methodologies, integrating Jenkins and AWS for end-to-end CI/CD pipeline and VM provisioning.

Collaborated on Tableau visualizations for actionable insights. Streamlined app deployment using Docker for consistency. Managed code efficiently with Git for version control and team coordination. Data Analyst Avisirah Technologies Pvt Ltd Hyderabad, India Sept 2019 – July 2021

Proficiently collected and refined diverse datasets using Python (Pandas), Excel, and SQL, ensuring data accuracy. Additionally, leveraged databases like SQL, NoSQL, PostgreSQL, MongoDB/Cassandra, Hadoop, and Hive.

Utilized Python (Pandas, NumPy) and R for exploratory data analysis, regression, hypothesis testing, and predictive modeling, improving forecasting accuracy by 20%. Used Excel, RStudio, and Jupyter Notebooks for detailed statistical analysis.

Executed optimized SQL queries, contributing to database optimization using MySQL, PostgreSQL, and managing databases in Azure cloud services, helped reduce processing time by 30%, enhancing system performance.

Developed and implemented comprehensive dashboards and reports in Power BI and Tableau, leveraging dynamic visualizations to effectively communicate insights to stakeholders.

Collaborated effectively in cross-functional teams, addressing data queries, and contributing to problem-solving while maintaining meticulous attention to detail, ensuring data integrity, and enhancing data-driven decisions by 25%.

Pursued ongoing education in advanced analytics, showcasing adaptability by transitioning from on-premises tools to Azure cloud-based solutions, demonstrating readiness to integrate cutting-edge technologies for improved analysis. Data Intern Avisirah Technologies Pvt Ltd Hyderabad, India May 2019 – Aug 2019

Assisted in preliminary statistical analysis, spotting trends within datasets, and created simple visualizations and charts in Excel, reducing workload by 25%.

Executed fundamental SQL queries for data extraction, database management, and integrity checks, contributing to overall data robustness.

Transformed and cleaned raw data using Python to create fact and dimension tables reducing redundancy by 35%.

Actively engaged in team discussions, providing valuable input for data-related problem-solving and strategy development. EDUCATION

University of Missouri Kansas City, Kansas City, MO Aug 2021-Dec 2022 Master of Science in Computer Science

B V Raju Institute of Technology, Hyderabad, India June 2016- May 2020 Bachelor of Technology

ACADEMIC PROJECTS

Twitter Sentimental Analysis [Twitter API, Python, NLTK, MySQL, Tableau, Pandas, NumPy] Conducted sentiment analysis on Twitter data pertaining to Elon Musk's controversies, extracting, and analyzing sentiments from user-specific tweet IDs to gauge public reactions and sentiment distribution.

Student Portfolio Platform(Web app) [JavaScript, CSS, Angular, PHP, MySQL, REST API, Node.js, Django] Developed a student portfolio platform as a team, offering secure login, curriculum tracking, grade management, assignment submissions, and professor feedback for academic progress monitoring and streamlined communication.



Contact this candidate