Srinivasarao Goli
Data Engineer
E-Mail: *********@*********.*** Phone: +1-913-***-**** Location: USA SUMMARY
• Data Engineer with around 4 years of experience in designing, developing, and maintaining scalable big data solutions to meet complex business needs.
• Adept at working with Agile and Waterfall methodologies to drive the successful execution of data engineering projects, consistently delivering high-quality solutions on time and aligned with organizational goals.
• Proficient in building high-performance data pipelines and optimizing data workflows using Hadoop, Apache Spark, and MapReduce, ensuring efficient processing of large datasets across various platforms.
• Skilled in programming with Python, R, and SQL, and leveraging industry-leading tools like Scikit-learn for developing machine learning models that enhance data-driven decision-making and improve business intelligence.
• Experienced in designing and automating ETL processes using tools such as SSIS, NiFi, and Kafka, with a strong focus on delivering scalable solutions across AWS and Azure environments.
• Adept at visualizing data using tools like Tableau and Power BI, enabling key stakeholders to access actionable insights and drive business strategy.
• Experienced in managing relational and NoSQL databases such as MySQL, SQL Server, MongoDB, and PostgreSQL, ensuring high availability, data integrity, and optimal performance across systems. SKILLS
Methodologies: SDLC, Agile, Waterfall
Programming Language: Python, R, SQL
IDE’s: PyCharm, Jupyter Notebook
Big Data Ecosystem: Hadoop, MapReduce, Hive, Apache Spark, Pig, Sqoop, Pyspark, Snowflake
ETL Tools: SSIS, Apache NiFi, Apache Kafka, Talend, Apache Airflow, Informatica
Cloud Technologies: AWS, Azure, GCP, DataBricks
Packages: NumPy, Pandas, Matplotlib, SciPy, Scikit-learn, Seaborn, TensorFlow
Reporting Tools: Tableau, Power BI, SSRS
Database: MongoDB, MySQL, SQL Server, PostgreSQL
Version Control: Git, GitHub, GitLab
Other Skills: Data Cleaning, Data Wrangling, Critical Thinking, Communication Skills, Presentation Skills, Problem-Solving Operating Systems: Windows, Linux, Mac
EXPERIENCE
MetLife USA
Data Engineer Jul 2023 - Present
• Led the adoption of Agile practices across multiple data engineering projects, improving project timelines and fostering better communication between teams.
• Engineered scalable, high-performance data pipelines using Apache Spark and Kafka, increasing processing throughput by 65%.
• Established end-to-end ETL workflows using Talend, automating data extraction, transformation, and loading, which improved data accuracy by 35% and enhanced data accessibility across internal systems by 30%.
• Designed interactive and insightful dashboards using Tableau, translating complex datasets into actionable visualizations that facilitated better business decisions.
• Configured and optimized data processing workflows on Azure Databricks, significantly enhancing processing speeds by 70%.
• Streamlined collaboration among data engineering teams, resulting in a 40% increase in overall productivity across the team.
• Accomplished and optimized Snowflake data warehouses, improving data retrieval processes and storage efficiency.
• Worked closely with cross-functional teams to refine and optimize MySQL database architecture, improving system scalability and performance.
KPIT Business Solutions India
Jr. Data Engineer Feb 2020 - Jul 2022
• Implemented structured project management using Waterfall methodology to guide the development of data engineering projects, ensuring detailed planning and execution, which resulted in a 20% reduction in project delays and a more transparent development lifecycle.
• Optimized SQL queries for data retrieval and reporting, achieving a 40% boost in query performance and reducing system latency by 30%.
• Managed and processed large datasets (over 10 terabytes) using Apache Hadoop and MapReduce, delivering robust data processing capabilities while maintaining flexibility in storing both structured and unstructured data using HDFS.
• Collaborated with cross-functional teams to enhance Amazon Redshift data warehouse designs, leading to a 50% improvement in report generation speed.
• Developed a prototype application for e-commerce management using Python, integrating interactive Power BI visualizations to provide real-time insights into sales performance, enhancing decision-making and user engagement with the application.
• Set up and maintained Kubernetes clusters for deploying and scaling data applications, improving the scalability of data-centric systems and ensuring high availability. These efforts resulted in a 30% improvement in system uptime.
• Exploited Git for version control across multiple data pipeline development projects, improving team collaboration through effective branching, merging, and code review practices. EDUCATION
University of Central Missouri Aug 2022 - Dec 2023 Master’s in Computer Science
Narasaraopet Engineering College, India Jul 2017 - Jul 2021 Bachelors in Electronics and Communication Engineering