Anusha Pamali Dsilva
******************@*****.***(385-371-9710) linkedin.com/Anusha_Dsilva github.com/Anushadsilva San Jose Summary
Results-oriented Data Engineer with 3+ years of experience delivering impactful data-driven solutions. Successfully led a team of developers to implement a system that enhanced data accessibility by 50% and improved data accuracy by 30%. Skilled in cloud platforms like Azure and AWS, Python, SQL, and API development, with a proven ability to collaborate with cross-functional teams. Adept at integrating software development practices into data engineering workflows, ensuring scalable, secure, and efficient systems that drive business insights and operational excellence. Experience
Data Engineer, Rio Tinto May 2023 - Current
• Architected and developed a Flask API-driven data pipeline to ingest transformer oil data into Azure SQL Database, secured via Azure API Management. Integrated CI/CD pipelines using GitHub Actions for automated deployments, collaborating with cross-functional teams to ensure seamless integration and alignment with business needs.
• Designed and optimized ETL workflows, increasing operational efficiency by 85% and significantly reducing costs.
• Implemented ETL solutions for the Operational Data Platform, enabling real-time data availability in the data warehouse and BI layer, adding 25+ KPIs to improve productivity reporting and decision-making by 98%.
• Conducted API testing with Postman and worked with cross-functional partners to validate JSON payloads and ensure data integrity. Configured secure SSL certificates, reducing data retrieval times by 50%.
• Extracted oil data from legacy source systems using ODBC connections, configured SSIS packages for scheduled data loads from source to target, improving efficiency by 90% and driving cost savings through generated reports.
• Created Power BI reports for KPIs and automated workflows with Power Automate, streamlining productivity tracking and enabling actionable insights.
• Resolved data issues in condition monitoring reports by modifying or writing complex SQL queries, views and stored procedures.
Data Science & Engineering Intern, Rio Tinto June 2022 - Jan 2023
• Created a dashboard to predict engine health based on oil sample data, reducing the cost and frequency of failures by 85%
• Resolved issues in the existing ETL setup for extracting oil data and implemented an alternate solution that significantly improved performance and efficiency by 90%.
• Developed Python scripts to establish ODBC connections to source data, efficiently extracting and pushing delta records to the destination, ensuring continuous data synchronization and updates.
• Designed and optimized complex SQL queries to split downtime records by shift and accurately determine the production day. Senior Software Engineer, Wipro Limited May 2014 - Aug 2021
• Build a utility in python to handle reimbursements for customers wrongly charged on their credit card there by increasing process efficiency to 88%
• Developed python scripts to encrypt files containing customers details and store it in S3
• Automated policy creation for life insurance product using Python and improved processing time by 85%
• Analyzed insurance product usage data in R Studio to get insights on products extensively used by customers
• Build and maintained data extraction tool using SQL queries, Python libraries (pandas) to extract and consolidate data from different third-party vendors
• Developed and executed SQL queries to retrieve information adhering to business requirements
• Documented various insurance and credit card flows to ease up training and development for new hires in the team
• Collaborated with cross-functional teams using Agile methodologies, such as Scrum and Kanban, to deliver high-quality software products within tight deadlines
Education
Master's in Information Systems University of Utah, David Eccles School of Business, USA May 2023 Master’s in Software Engineering Birla Institute of Technology and Science, Pilani, India Dec 2018 Bachelor’s in computer application Mangalore University May 2014 Certifications
Data Engineering Nano Degree, Udacity, 08/21
• Developed ETL pipelines for Apache Cassandra and Postgres, processing event data and loading into fact and dimension tables.
• Built data pipelines to extract, stage, and transform data into Redshift for user insights.
• Implemented Spark-based ETL pipeline for processing and storing data in S3.
• Automated and monitored production data pipelines using Apache Airflow.
• Skills: Python, Postgres, Apache Cassandra, AWS, Redshift, Spark, Apache Airflow SKILLS
Programming & Scripting: Python, SQL, R, PowerShell Cloud Platforms & Services: AWS (Glue, Redshift, Aurora, S3), Azure (SQL Database, API Management, DevOps), GCP (Big Query) Database/Visualization: SQL Server (SSMS), PostgreSQL, Cassandra, Amazon Redshift, Azure SQL Database, Big Query, Power BI, ERD