MEGHANA MACHA
*******.*********@*****.*** www.linkedin.com/in/meghana-macha-30567b333 740-***-****
Professional Summary:
Experienced Data Engineer with 4 years of expertise in designing distributed data pipelines, implementing scalable data solutions, and optimizing cloud-based systems. Proficient in Spark, Python, and AWS ecosystems with a strong focus on big data technologies, workflow orchestration tools like Apache Airflow, and data governance practices. Adept at metadata management, improving data quality, streamlining data acquisition strategies, and collaborating with cross-functional teams to deliver innovative solutions aligned with business goals.
Certification:
Certified in Azure Data Engineering Associate
Certified in AWS Data Engineering Associate
Technical Skills:
Programming & Scripting: Python (proficient in OOP concepts), Spark, SQL, Unix
Data Platforms: Snowflake (including SnowSQL, SnowPipe), Teradata, Cassandra, MongoDB, Oracle, SQL Server, ADLS, Oracle Hexadata
BI & Analytics Tools: Tableau, Power BI, Grafana, Alteryx, Denodo, Cognos
Data Warehousing & ETL: IBM DataStage, Informatica, Azure Data Factory, Azure Databricks, Hadoop, Hive
Azure Cloud Technologies: Azure EventHub, Azure Blob Storage, Azure Data Lake, Azure Functions, Azure Power Apps, Power BI
CI/CD & Development Frameworks: Jenkins, Azure DevOps, GitHub, Terraform, Agile, DevOps
Data Governance & Privacy: Informatica Axon, EDC, BigID
Other Skills: Data Modeling, ODS (Operational Data Store) concepts, ETL pipeline development
Work Experience
Data Engineer/Power BI January 2023 – Present
Lithia Motors, United States
Responsibilities:
Designed and implemented ETL pipelines to process and analyze data, ensuring integrity, accuracy, and consistency across systems.
Utilized SQL to write, optimize, and troubleshoot queries for data extraction, transformation, and reporting tasks.
Applied SQL joins and created views to combine and simplify access to complex datasets, improving query performance and enabling efficient reporting workflows.
Debugged and resolved data issues by analyzing SQL logs, performing root cause analysis, and validating data flows.
Conducted data validation and profiling using SQL to identify anomalies, ensuring clean, high-quality data for reporting purposes.
Developed interactive dashboards and ad-hoc reports in Power BI to present trends and insights for informed decision-making.
Environment: SQL, Python, Power BI, IBM DataStage, Tableau.
Data Analyst July 2021 – July 2022
Tata Consultancy Services, Hyderabad
Responsibilities:
Designed and maintained dynamic dashboards using Power BI to provide actionable insights and improve operational decision-making.
Developed and optimized SQL queries for data extraction, validation, and analysis to ensure accuracy and efficiency in reporting.
Built and managed ETL pipelines to automate data ingestion, transformation, and storage processes, improving workflow efficiency.
Implemented CI/CD pipelines using Jenkins to automate deployment of data processes, ensuring smooth and consistent workflows.
Conducted root cause analysis to troubleshoot and resolve data anomalies, enhancing data quality and reporting accuracy.
Utilized SQL joins and views to combine data from multiple sources, enabling comprehensive reporting and analysis.
Automated reporting tasks using Python scripts and SQL, streamlining repetitive processes and reducing manual effort.
Performed data profiling and validation to identify inconsistencies and ensure clean, reliable datasets for business users.
Environment: SQL, Python, Power BI, ETL (DataStage), Jenkins (CI/CD), Tableau.
SQL Developer October 2020 – July 2021
Cognizant, Hyderabad
Responsibilities:
Designed and developed SQL queries for data extraction, transformation, and reporting, ensuring optimal performance and accuracy.
Created and managed views to simplify access to complex data structures and support streamlined reporting processes.
Applied joins and other SQL techniques to combine datasets from multiple tables for analysis and reporting.
Debugged data issues by analyzing query performance and identifying root causes, ensuring clean and accurate data.
Developed ad-hoc reports to support stakeholders in analyzing trends and making informed decisions.
Collaborated with cross-functional teams to understand data requirements and deliver efficient solutions.
Environment: SQL, Power BI, Python.
Education:
Bachelor’s in computer science, Jawaharlal Nehru Technological University
Master of Science, Webster University
Additional Projects
Automotive Data Analysis
Conducted an in-depth analysis of automotive datasets to identify trends and insights, leveraging Python for data manipulation and visualization.
Utilized Pandas and Matplotlib to clean, process, and analyze structured data, presenting findings through interactive visual dashboards.
Skills Used: Python, Pandas, Matplotlib, Data Visualization
Dice Job Scraping
Developed a web scraping tool to extract job postings from the Dice website using Python.
Demonstrated expertise in data extraction, cleaning, and storage, with applications in data engineering and analytics.
Skills Used: Python, Beautiful Soup, Web Scraping, Data Cleaning
View Project on GitHub