Post Job Free
Sign in

Data Engineer Governance

Location:
Carrollton, TX
Posted:
April 21, 2025

Contact this candidate

Resume:

SRI HARSHA DONDA

+1-817-***-**** **************@*****.*** Texas, US

SUMMARY

Experienced Data Governance Consultant and Data Engineer with expertise in designing and implementing business data lineage solutions, ensuring data quality, and driving governance activities. Proficient in managing cloud environments, including AWS, Azure, and GCP, while leveraging tools like Collibra, Ataccama, and Informatica to enforce data policies and improve decision-making. Skilled in developing scalable reporting platforms, automating data workflows, and optimizing ETL pipelines using Python, SQL, and AWS services. Proven ability to lead cross-functional teams, collaborate with stakeholders, and deliver complex solutions on time and within scope.

PROFESSIONAL EXPERIENCE

Data Governance Consultant (Discover Financial Services) Jun 2023 – Dec 2024, US

Designed and implemented business data lineage solution using Collibra and Ab Initio across all UK and EU business units for Merchant Services, Consumer Loans, and Corporate Card divisions. The components of the solution have been built using SQL Server and PowerShell

Engaged with customers across industries to assist with the administration and management of their Informatica environments

Successfully completed complex environment upgrades of Informatica products on Windows as well as Linux on prem systems utilizing both parallel and in-place techniques

Helped customers to migrate their on-prem Informatica environments to the cloud-based Amazon Web Services (AWS) and Microsoft Azure instances.

Advise senior executive management on design, architecture and development of a new Risk Reporting Business Intelligence solution

Responsible for ensuring data quality, and compliance. I use the catalog to manage metadata, enforce data policies, and monitor data lineage to ensure adherence to regulations and organizational standards

Developed custom data quality rules in Ataccama to address specific business requirements, improving data trustworthiness and decision-making.

Implemented scripts that load Google Big Query data and run queries to export data

Designed and Developed a POC of an adaptable and extensible metadata-driven financial reporting Data Warehouse with the data virtualization layer, using Markit EDM for ETL, SQL Server, SSAS and Tableau. The solution provides for the early-arriving facts scenario

Analyze the existing data repositories as data sources for the BI reporting solution

Managed Collibra DGC across the enterprise, driving governance activities for all participating business units and ensuring all work activity is completed on time and to standards; while mitigating risks as needed

Responsible for the intake process, ensuring all requests or issues are handled in a timely manner and assigned to appropriate parties for review, resolution and escalation

Responsible to work with the business for RACI activities and incorporate them in Collibra

Establish and govern an enterprise data governance implementation strategic priority for development of information-based capabilities

Roll out an enterprise-wide data governance framework, with a focus on improvement of data quality and the protection of sensitive data through modifications to organization behavior policies and standards, principles, governance metrics, processes, related tools

Define roles and responsibilities related to data governance and ensure clear accountability for stewardship of the company’s principal information assets

Facilitate the development and implementation of data quality standards, data protection standards and adoption requirements across the enterprise

Data Engineer (Sify Technologies) Jul 2020 – Aug 2021, IN

Designed and implemented a scalable reporting platform that enabled the generation of customized merchant reports by leveraging AWS Glue, Step Functions, and DynamoDB. This architecture facilitated seamless data processing and transformation, ensuring that reports could be tailored to specific merchant requirements in a timely manner

Developed a microservice-based reporting engine that aggregated data from DynamoDB and automated the storage of processed data in Amazon S3. Integrated AWS Lambda to trigger SNS notifications, which automatically converted the data into CSV and PDF formats, streamlining the report generation process for merchants

Automated serverless workflows using AWS Step Functions, significantly improving the efficiency of data processing. This innovation reduced infrastructure costs by 30% by eliminating the need for dedicated servers and ensuring more efficient use of cloud resources

Created AWS Glue Crawlers to automatically infer table schemas from raw data stored in Amazon S3. This approach enabled efficient querying using Amazon Athena, enhancing data retrieval speeds and minimizing the need for manual schema definition

Enhanced data pipelines by developing Python scripts for ETL workflows, which improved pipeline reliability and reduced data processing times by 40%. This optimization led to faster reporting cycles and better resource utilization

Utilized Terraform to provision AWS resources, ensuring consistent and repeatable infrastructure deployments across development, testing, and production environments. This approach standardized resource management and minimized configuration drift between environments

Reduced report generation time by 50% by optimizing data processing pipelines and leveraging automation techniques, enabling quicker turnaround times for on-demand reports and increasing overall system efficiency

Successfully onboarded multiple high-profile clients by implementing customizable reporting capabilities, which allowed clients to tailor reports to their specific needs. This flexibility not only improved client satisfaction but also contributed to increased business adoption of the platform

ACADEMIC PROJECTS

1.Merchant Services Cloud Reporting Platform

Developed a scalable reporting platform for merchants, automating the generation of customized reports based on merchant requests

Used AWS Glue to process and transform raw data into structured formats, enabling efficient querying through Athena

Designed serverless workflows using AWS Step Functions, reducing manual intervention and improving processing efficiency

2.Fraud Detection System for Loan Applications

Engineered microservices for fraud detection in loan applications, integrating third-party APIs to fetch real-time credit scores and analytics

Optimized data processing pipelines, reducing fraud detection time by 30% while ensuring accuracy

3.Online Food Ordering System

Developed a web application with Django, implementing dynamic front-end features with JavaScript and backend APIs with Spring Boot

Streamlined the order placement process, improving transaction speed and reducing errors

EDUCATION

Master of Science – Business Analytics, University of Texas at Arlington, US Aug 2021 – May 2023

Bachelor of Technology – Computer Science, GITAM University, IN Jun 2016 – Jun 2020

SKILLS

Hadoop Technologies - HDFS, MapReduce, YARN, Hive, Pig, HBase, Impala, Zookeeper, Sqoop, OOZIE, Apache Cassandra, Flume, Spark, AWS, EC2

Data Governance Tools - Collibra DGC v5.x, Ataccama, Informatica Axon, EDC, Alation

Cloud Technologies - AWS, GCP, Azure, Lambda, Athena, EBS, DMS, Big Query, Elastic Search, SQS, SNS, KMS, QuickSight, ELB

Programming Languages - Python, PySpark, Spark, SQL, Java, PHP, PL/SQL, Scala, Shell Scripts

Databases - NoSQL, Oracle, DB2, MySQL, SQL Server, MS Access, HBase

Data Modeling & Data Quality - Erwin R9.x, E R Studio, Snowflake, Informatica Developer / Analyst (IDQ), Ataccama Data Quality Analyzer

ETL Tools - Apache NiFi, Apache Airflow, Talend, Informatica, SSIS

Big data Tools - Apache Hadoop, Apache Spark, Apache Kafka, Kubernetes, Alteryx, Apache Hive, Apache Cassandra, Apache Flink, and Apache Pig

Reporting Tools - Jaspersoft, Qlik Sense, Tableau, Junit, Adobe Analytics

IDEs - Eclipse, NetBeans JDeveloper, IntelliJ IDEA

Reporting Tools - Jaspersoft, Qlik Sense, Tableau, Junit, Adobe Analytics

CERTIFICATIONS

AWS Certified Solutions Architect – Associate, Advanced Python Programming (Coursera), Machine Learning Bootcamp (Udemy)

Data Governance in Azure by Microsoft

Data Governance by University of Washington (Coursera)



Contact this candidate