Surya Gande
Email: ***********@*****.*** Mobile: 763-***-****
PROFESSIONAL SUMMARY
5+ years of experience as a Senior Data Engineer specializing in AI data platforms, with a proven track record in developing scalable data solutions and enhancing data-driven decision-making processes for large organizations.
Expertise in designing and implementing data pipelines using advanced technologies, ensuring high data quality and availability for analytics and reporting.
Proficient in cloud platforms such as AWS and Azure, leveraging services to optimize data storage and processing capabilities in line with organizational goals.
Strong background in big data technologies, including Apache Spark and Hadoop, enabling efficient processing of large datasets for actionable insights.
Demonstrated ability to collaborate with cross-functional teams to define data requirements and deliver solutions that meet business needs and enhance operational efficiency.
Committed to adhering to best practices in data governance, security, and compliance, ensuring alignment with organizational policies and industry standards.
Recognized for driving innovation in data engineering practices, contributing to improved performance metrics and successful project outcomes.
SKILLS
Programming Languages: Python, SQL, Java, Scala, R, JavaScript, C#, Go
Data Engineering Tools: Apache Spark, Apache Kafka, Apache Airflow, Databricks, Talend, Informatica, AWS Glue, Azure Data Factory
Cloud Platforms: AWS, Azure, GCP, EC2, S3, Lambda, Azure Functions, Snowflake
Database Technologies: MySQL, PostgreSQL, MongoDB, Redshift, DynamoDB, Oracle, Cassandra, Teradata
Data Warehousing: Snowflake, Amazon Redshift, Google BigQuery, Teradata, Azure Synapse Analytics, Data Lakes, ETL Processes, Data Modeling
Big Data Technologies: Hadoop, Apache Hive, Apache HBase, Apache Flink, Apache NiFi, Spark Streaming, Flume, Kafka Streams
Data Visualization: Tableau, Power BI, Looker, Matplotlib, Seaborn, D3.js, Google Data Studio, QlikView
Soft Skills: Communication, Teamwork, Problem-Solving, Adaptability, Critical Thinking, Collaboration, Time Management, Conflict Resolution
Project Management: Agile, Scrum, Kanban, Waterfall, Stakeholder Management, Risk Management, Resource Allocation, Performance Tracking
Data Governance: Data Quality, Data Privacy, Compliance Standards, Metadata Management, Data Lineage, Data Stewardship, Data Security, Ethical Data Use
WORK EXPERIENCE
UnitedHealth Group - Minnetonka, MN
Senior Data Engineer - AI Data Platforms - Aug 2024 to Present
Spearheaded the development of AI-driven data platforms utilizing Amazon Web Services (AWS) and Apache Spark, enhancing data processing efficiency by 30% and supporting real-time analytics for healthcare solutions.
Engineered scalable data pipelines using Python and Apache Kafka, facilitating the integration of diverse data sources and ensuring high availability for data-driven decision-making across the organization.
Collaborated with cross-functional teams to implement data governance frameworks, ensuring compliance with HIPAA regulations and enhancing data security protocols.
Optimized ETL processes by leveraging Azure Data Factory and Databricks, resulting in a 25% reduction in data processing time and improved data accuracy for reporting purposes.
Led a team of data engineers in adopting Agile methodologies, conducting daily stand-ups and sprint planning sessions to enhance project delivery timelines and team collaboration.
Developed and maintained data models in Snowflake, improving data accessibility for business intelligence tools and enabling stakeholders to derive actionable insights.
Implemented machine learning algorithms in data analysis workflows, contributing to predictive analytics initiatives that improved patient outcomes and operational efficiency.
Conducted training sessions for junior engineers on best practices in data engineering and cloud technologies, fostering a culture of continuous learning and knowledge sharing.
Analyzed system performance metrics and identified bottlenecks, implementing solutions that enhanced system reliability and reduced downtime by 15%.
Presented project outcomes and data insights to senior management, effectively communicating complex technical concepts to non-technical stakeholders.
Technologies Used: AWS, Azure, Databricks, Snowflake, Apache Spark, Python, Apache Kafka, ETL, Agile, Machine Learning, Data Governance, HIPAA, Data Modeling, Business Intelligence
Wayfair - Boston, MA
Advanced Data Engineer - Jan 2021 to Dec 2023
Designed and implemented robust data pipelines using Apache Airflow and Python, streamlining data ingestion processes and improving data availability for analytics teams by 40%.
Optimized data storage solutions in Google BigQuery, reducing storage costs by 20% while enhancing query performance for large datasets utilized in e-commerce analytics.
Collaborated with product teams to define data requirements and develop data models that supported real-time inventory tracking and customer behavior analysis.
Leveraged machine learning techniques to analyze customer data, resulting in personalized marketing strategies that increased conversion rates by 15%.
Established data quality metrics and monitoring systems, ensuring data integrity and compliance with internal standards and external regulations.
Led initiatives to migrate legacy data systems to cloud-based architectures, improving scalability and reducing operational overhead by 30%.
Facilitated workshops on data visualization tools such as Tableau, empowering business users to create self-service reports and dashboards for enhanced decision-making.
Conducted performance tuning of SQL queries and data processing jobs, achieving a 35% improvement in response times for critical business reports.
Mentored junior data engineers, providing guidance on technical challenges and fostering a collaborative team environment focused on innovation.
Engaged with stakeholders to gather feedback on data solutions, iterating on designs to better meet business needs and enhance user satisfaction.
Technologies Used: Apache Airflow, Python, Google BigQuery, Machine Learning, SQL, Tableau, Data Quality, Cloud Migration, Data Modeling, E-commerce Analytics, Data Visualization, Performance Tuning
PayPal - San Jose, CA
Big Data Engineer - Oct 2019 to Dec 2020
Developed and maintained big data processing frameworks using Apache Hadoop and Spark, enabling the analysis of large-scale transaction data and improving fraud detection capabilities.
Collaborated with data scientists to implement machine learning models for predictive analytics, enhancing transaction monitoring and reducing false positives by 20%.
Engineered data ingestion processes using Apache NiFi, ensuring seamless data flow from multiple sources into the data lake for comprehensive analysis.
Conducted performance optimization of data processing jobs, resulting in a 30% reduction in processing time and improved resource utilization across the cluster.
Implemented data security measures in compliance with PCI-DSS standards, safeguarding sensitive financial information and maintaining customer trust.
Participated in Agile ceremonies, contributing to sprint planning and retrospectives to continuously improve team processes and project outcomes.
Analyzed user engagement metrics and provided insights to product teams, driving enhancements in user experience and increasing customer retention rates.
Documented data engineering processes and created technical specifications, ensuring knowledge transfer and adherence to best practices within the team.
Assisted in the migration of on-premises data solutions to cloud environments, improving scalability and flexibility for future growth.
Presented technical findings and project updates to stakeholders, effectively communicating complex data concepts and fostering informed decision-making.
Technologies Used: Apache Hadoop, Apache Spark, Machine Learning, Apache NiFi, PCI-DSS, Agile, Data Lake, Data Security, Performance Optimization, User Engagement Analytics, Cloud Migration, Technical Documentation
CERTIFICATIONS
AWS Certified Data Engineer - Associate
Azure Data Engineer Associate
Databricks Certified Data Engineer Professional
Snowflake SnowPro Core Certification
EDUCATION
Masters in Business Analytics University of Massachusetts Amherst GPA 3.95
Masters in Economics University of Hyderabad 79%
Bachelors in Economics University of Hyderabad 79%