Prudhvi Alaparthi
Data Engineer
************@*****.*** +1-937-***-**** NJ, USA LinkedIn
SUMMARY
Value-driven Data Engineer and Data Analyst with 4+ years of experience in data modeling, ETL pipeline development, and cloud-based data solutions. Expertise in designing scalable data architectures, optimizing data workflows, and leveraging SQL, Python, and cloud platforms
(Azure, AWS) to drive efficient data processing. Adept at building automated reporting solutions, developing insightful dashboards in Tableau and Power BI, and collaborating with stakeholders to translate business needs into actionable insights. Strong background in cost transparency, data governance, and anomaly detection, ensuring data accuracy and operational efficiency across diverse industries. SKILLS
Methodologies: SDLC, Agile, Waterfall
Programming Languages: SQL, Python, R, MATLAB
IDEs: Visual Studio Code, PyCharm
Visualization Tools: Tableau, Power BI, MS Excel, Alteryx Cloud Technologies: AWS, Azure, Databricks
Databases: Oracle, MySQL, SQL Server, MongoDB
Other Tools: Git, Snowflake, Power Query, Macro, SAS, Apptio, TBM Studio, Apptio One, Cost transparency Operating System: Linux, Windows, Mac
EXPERIENCE
American Express, NY Data Engineer Oct 2024 - Present
• Spearheaded the Cost Transparency Project, optimizing infrastructure cost breakdowns for application owners, increasing cost accountability, and improving strategic financial planning by 30%.
• Collaborated with product owners and finance teams to integrate critical data from multiple sources, increasing cost model accuracy and streamlining financial reporting.
• Designed, built, and automated ETL pipelines leveraging PySpark, Python, SQL, and AWS services (S3, Lambda, Glue, Redshift) to streamline data processing and reporting, improving efficiency by 40%.
• Utilized Apptio One for cost modeling and reporting, creating interactive dashboards that enhanced cost visibility across applications and infrastructure, reducing manual reporting efforts by 35%.
• Worked extensively with on-premises databases and data warehouses, implementing scalable data integration solutions that enhanced processing efficiency. Optimized workflows to reduce latency and improve query performance. Achieved a 25% increase in data integration efficiency while ensuring seamless interoperability between systems.
• Defined and standardized units of measure (UoMs) for cost reporting, improving alignment across infrastructure and financial teams.
• Applied DevOps practices for pipeline deployment and monitoring, leveraging Git, Jenkins, and Airflow for version control, orchestration, and automation.
Pfizer, NY Data Analyst Jan 2024 - Sep 2024
• Applied Agile techniques to projects, ensuring objectives were met and delivering high-quality outcomes on schedule.
• Utilized data visualization tools such as Matplotlib, Seaborn, and ggplot2 to create captivating visualizations, improving the clarity and impact of data presentations by 25%.
• Led extensive data analysis initiatives using SQL queries and Excel, extracting actionable insights to address evolving business needs and support strategic decision-making processes.
• Developed engaging Tableau dashboards to enable intuitive data exploration and empower stakeholders to make informed decisions.
• Identified and resolved data anomalies through rigorous validation tests in Azure Data Factory pipelines, achieving a 99.5% accuracy rate and improving operational efficiency by 15%.
• Conducted routine maintenance and backups of MS SQL Server, ensuring data availability and readiness for disaster recovery scenarios.
• Performed detailed data cleaning and preparation, producing high-quality datasets for analysis and increasing accuracy by 20%. Mindtree, India Data Analyst Jun 2019 - May 2022
• Utilized data visualization libraries such as Matplotlib and Seaborn to create visually compelling data visualizations, enhancing the communication of insights and facilitating the understanding of complex datasets.
• Designed and launched interactive Power BI dashboards and reports, enabling cross-departmental data-driven decision-making and increasing operational efficiency by 25% within the first quarter.
• Leveraged AWS Redshift for data warehousing, establishing a high-performance environment for querying and analytics, thereby enabling efficient business intelligence and data-driven decision-making processes.
• Employed Power Query to clean, transform, and merge data from multiple sources for analysis and reporting.
• Developed and deployed machine learning models across various domains to solve complex business problems, optimizing decision- making and operational processes.
• Conducted SQL-based data extraction, transformation, and analysis, resulting in 30% faster query execution. EDUCATION
Masters in Business Analytics
University of Dayton, Dayton, Ohio May 2024
Bachelors in Mechanical Engineering
KKRANDKSR Institute of Technology and Sciences, (JNTUK), Guntur May 2019