SRI HARSHA DONDA
+1-817-***-**** **************@*****.*** Texas, US
SUMMARY
Experienced Data Governance Consultant and Data Engineer with expertise in designing and implementing business data lineage solutions, ensuring data quality, and driving governance activities. Proficient in managing cloud environments, including AWS, Azure, and GCP, while leveraging tools like Collibra, Ataccama, and Informatica to enforce data policies and improve decision-making. Skilled in developing scalable reporting platforms, automating data workflows, and optimizing ETL pipelines using Python, SQL, and AWS services. Proven ability to lead cross-functional teams, collaborate with stakeholders, and deliver complex solutions on time and within scope.
PROFESSIONAL EXPERIENCE
Data Governance Consultant (Discover Financial Services) Jun 2023 – Dec 2024, US
Designed and implemented business data lineage solution using Collibra and Ab Initio across all UK and EU business units for Merchant Services, Consumer Loans, and Corporate Card divisions. The components of the solution have been built using SQL Server and PowerShell
Engaged with customers across industries to assist with the administration and management of their Informatica environments
Successfully completed complex environment upgrades of Informatica products on Windows as well as Linux on prem systems utilizing both parallel and in-place techniques
Helped customers to migrate their on-prem Informatica environments to the cloud-based Amazon Web Services (AWS) and Microsoft Azure instances.
Advise senior executive management on design, architecture and development of a new Risk Reporting Business Intelligence solution
Responsible for ensuring data quality, and compliance. I use the catalog to manage metadata, enforce data policies, and monitor data lineage to ensure adherence to regulations and organizational standards
Developed custom data quality rules in Ataccama to address specific business requirements, improving data trustworthiness and decision-making.
Implemented scripts that load Google Big Query data and run queries to export data
Designed and Developed a POC of an adaptable and extensible metadata-driven financial reporting Data Warehouse with the data virtualization layer, using Markit EDM for ETL, SQL Server, SSAS and Tableau. The solution provides for the early-arriving facts scenario
Analyze the existing data repositories as data sources for the BI reporting solution
Managed Collibra DGC across the enterprise, driving governance activities for all participating business units and ensuring all work activity is completed on time and to standards; while mitigating risks as needed
Responsible for the intake process, ensuring all requests or issues are handled in a timely manner and assigned to appropriate parties for review, resolution and escalation
Responsible to work with the business for RACI activities and incorporate them in Collibra
Establish and govern an enterprise data governance implementation strategic priority for development of information-based capabilities
Roll out an enterprise-wide data governance framework, with a focus on improvement of data quality and the protection of sensitive data through modifications to organization behavior policies and standards, principles, governance metrics, processes, related tools
Define roles and responsibilities related to data governance and ensure clear accountability for stewardship of the company’s principal information assets
Facilitate the development and implementation of data quality standards, data protection standards and adoption requirements across the enterprise
Data Engineer (Sify Technologies) Jul 2020 – Aug 2021, IN
Designed and implemented a scalable reporting platform that enabled the generation of customized merchant reports by leveraging AWS Glue, Step Functions, and DynamoDB. This architecture facilitated seamless data processing and transformation, ensuring that reports could be tailored to specific merchant requirements in a timely manner
Developed a microservice-based reporting engine that aggregated data from DynamoDB and automated the storage of processed data in Amazon S3. Integrated AWS Lambda to trigger SNS notifications, which automatically converted the data into CSV and PDF formats, streamlining the report generation process for merchants
Automated serverless workflows using AWS Step Functions, significantly improving the efficiency of data processing. This innovation reduced infrastructure costs by 30% by eliminating the need for dedicated servers and ensuring more efficient use of cloud resources
Created AWS Glue Crawlers to automatically infer table schemas from raw data stored in Amazon S3. This approach enabled efficient querying using Amazon Athena, enhancing data retrieval speeds and minimizing the need for manual schema definition
Enhanced data pipelines by developing Python scripts for ETL workflows, which improved pipeline reliability and reduced data processing times by 40%. This optimization led to faster reporting cycles and better resource utilization
Utilized Terraform to provision AWS resources, ensuring consistent and repeatable infrastructure deployments across development, testing, and production environments. This approach standardized resource management and minimized configuration drift between environments
Reduced report generation time by 50% by optimizing data processing pipelines and leveraging automation techniques, enabling quicker turnaround times for on-demand reports and increasing overall system efficiency
Successfully onboarded multiple high-profile clients by implementing customizable reporting capabilities, which allowed clients to tailor reports to their specific needs. This flexibility not only improved client satisfaction but also contributed to increased business adoption of the platform
ACADEMIC PROJECTS
1.Merchant Services Cloud Reporting Platform
Developed a scalable reporting platform for merchants, automating the generation of customized reports based on merchant requests
Used AWS Glue to process and transform raw data into structured formats, enabling efficient querying through Athena
Designed serverless workflows using AWS Step Functions, reducing manual intervention and improving processing efficiency
2.Fraud Detection System for Loan Applications
Engineered microservices for fraud detection in loan applications, integrating third-party APIs to fetch real-time credit scores and analytics
Optimized data processing pipelines, reducing fraud detection time by 30% while ensuring accuracy
3.Online Food Ordering System
Developed a web application with Django, implementing dynamic front-end features with JavaScript and backend APIs with Spring Boot
Streamlined the order placement process, improving transaction speed and reducing errors
EDUCATION
Master of Science – Business Analytics, University of Texas at Arlington, US Aug 2021 – May 2023
Bachelor of Technology – Computer Science, GITAM University, IN Jun 2016 – Jun 2020
SKILLS
Hadoop Technologies - HDFS, MapReduce, YARN, Hive, Pig, HBase, Impala, Zookeeper, Sqoop, OOZIE, Apache Cassandra, Flume, Spark, AWS, EC2
Data Governance Tools - Collibra DGC v5.x, Ataccama, Informatica Axon, EDC, Alation
Cloud Technologies - AWS, GCP, Azure, Lambda, Athena, EBS, DMS, Big Query, Elastic Search, SQS, SNS, KMS, QuickSight, ELB
Programming Languages - Python, PySpark, Spark, SQL, Java, PHP, PL/SQL, Scala, Shell Scripts
Databases - NoSQL, Oracle, DB2, MySQL, SQL Server, MS Access, HBase
Data Modeling & Data Quality - Erwin R9.x, E R Studio, Snowflake, Informatica Developer / Analyst (IDQ), Ataccama Data Quality Analyzer
ETL Tools - Apache NiFi, Apache Airflow, Talend, Informatica, SSIS
Big data Tools - Apache Hadoop, Apache Spark, Apache Kafka, Kubernetes, Alteryx, Apache Hive, Apache Cassandra, Apache Flink, and Apache Pig
Reporting Tools - Jaspersoft, Qlik Sense, Tableau, Junit, Adobe Analytics
IDEs - Eclipse, NetBeans JDeveloper, IntelliJ IDEA
Reporting Tools - Jaspersoft, Qlik Sense, Tableau, Junit, Adobe Analytics
CERTIFICATIONS
AWS Certified Solutions Architect – Associate, Advanced Python Programming (Coursera), Machine Learning Bootcamp (Udemy)
Data Governance in Azure by Microsoft
Data Governance by University of Washington (Coursera)