Data Engineer with over ** years of experience driving large-scale data initiatives and delivering complex technical programs. Adept at col- laborating with cross-functional and global teams to align technical solutions with business strategies, ensuring timely delivery and measur- able business impact.
WORK EXPERIENCE
Lead Data Engineer, PwC, Boston July 2020-Oct 2024
• POC : Designed and deployed an AI-powered chatbot on Azure Databricks using LLMs, integrating enterprise knowledge bases and data lakes for real-time, context-aware responses, reducing support ticket volume by 30%.
• Led migration of a legacy asset management platform to Azure Databricks, consolidating 150+ Spark pipelines, improving query performance by 40% and reducing compute costs by 25% through optimized auto-scaling and caching.
• Developed a Python-based document digitization service using Airflow, Azure Functions, Event Hubs, replacing a legacy ETL tool
(formerly Datawatch Monarch), leading to $4.5M in annual licensing savings and a 15% boost in data quality.
• Led the development of a high-performance microservice leveraging RabbitMQ and Snowflake API, enabling real-time data extrac- tion and improving data accessibility by 30%.
• Integrated Veracode and SonarQube into Azure DevOps CI/CD pipelines, using Azure Functions for automated policy enforcement and Azure Key Vault for secret management—boosting security and code quality coverage by 40% across 100+ repositories.
• Led a global data governance initiative across 20+ cross-functional teams, using Azure Purview and Data Factory to improve data lineage tracking, enhance regulatory compliance, metadata management for Tax and Accounting SaaS applications. Senior Data Engineer, PwC, Boston July 2017-July 2020
• Engineered a microservice hosted on Azure App Services using C# .NET, LINQ, SQL Server, and REST APIs to automate audit testing workflows within the Asset Management Data Platform, significantly improving audit quality and operational efficiency.
• Built a scalable data pipeline using Fivetran, DBT, and Snowflake to automate financial transaction reporting; reduced data latency by 80% and enabled near real-time insights in Power BI for compliance and fraud analytics
• Developed an enrichment service for the Master Data Management (MDM) platform by integrating third-party REST APIs from Bloomberg and Refinitiv, enhancing data accuracy and financial instrument coverage Data Engineer, PwC, Boston Feb 2016-July 2017
• Utilized Python (Pandas) and SQL to analyze customer feedback and support tickets, identifying product pain points and introducing new features, reducing customer tickets by 40%.
• Delivered high-quality Tableau dashboards for KPI tracking, driving product improvements and supporting business growth.
• Implemented Robotic Process Automation (RPA) using UiPath and Alteryx to automate custodian reports, enhancing data stand- ardization and improving report accuracy and efficiency. Business Intelligence Engineer Co-op, Houghton Mifflin Harcourt, Boston June 2014-Dec 2014
• Leveraged Google Analytics, SiteCatalyst, and R programming to analyze customer behavior and develop a data-driven product roadmap, leading to a 23% increase in conversion rates.
• Conducted A/B testing and collaborated with cross-functional stakeholders to optimize the design and functionality of the eCom- merce site, significantly enhancing user experience. Data Engineer, Bank of America, Mumbai July2010-July 2013
• Automated key ETL components, including scheduling, data extraction, and transformation, achieving a 20% improvement in batch performance by optimizing SQL queries and streamlining long-running processes.
• Designed a Shell-based archival framework, saving 100+ hours annually and reducing operational costs by 18%, while optimizing SQL ETL processes to improve data throughput and system scalability. SKILLS
• Cloud Platforms: Azure (Databricks, Event Hubs, Azure DevOps), AWS (Redshift, RDS), Snowflake
• Languages: SQL, Python, C#, Java, R, JavaScript, Spark, Shell, HTML, CSS
• Databases: Microsoft SQL Server, MySQL, PostgreSQL, Oracle, Cassandra, Amazon Redshift, RDS, Airtable
• Tools & Orchestration: Airflow, Jenkins, Docker, Kubernetes, GitHub, Veracode, SonarQube, Alteryx
• Data Visualization: Tableau, Power BI, MicroStrategy, Excel, R, Google Analytics, Adobe SiteCatalyst
• Project Management: SAFe, Agile, Scrum, Kanban, Azure DevOps, JIRA, Confluence, ServiceNow EDUCATION
Master of Science, Information Systems: Northeastern University, Boston, MA Sep 2013 - Dec 2015 Bachelor of Engineering (B.E.), Information Technology: University of Mumbai, India Aug 2006 - Jun 2010 RISHI IDNANI 18 Curtis Road Natick, Massachusetts 617-***-**** **********@*****.*** www.linkedin.com/in/rishiidnani **********@*****.*** www.linkedin.com/in/rishiidnani