Post Job Free
Sign in

Data Engineer Analyst

Location:
Tampa, FL
Posted:
November 13, 2024

Contact this candidate

Resume:

Deepika Kolluru

+1-945-***-**** ****************@*****.*** https://www.linkedin.com/in/deepika-kolluru/ SUMMARY

As a Data Engineer with 4 years of experience, I specialize in designing scalable data architectures and managing data ingestion, transformation, and orchestration using Apache Spark and Airflow. I have hands-on experience with Python and SQL, and I am proficient in working with various cloud platforms, including GCP, AWS and Azure. Additionally, I have a strong background in data visualization, utilizing tools such as Tableau, Looker, and Power BI. WORK EXPERIENCE

Data Analyst, Client: Ohio State, US (2024 Jan – Present)

• Revolutionized incident data accuracy by 35% through seamless integration of ingestion and processing pipelines in GCP's BigQuery and Dataflow, empowering a 25% boost in decision-making efficiency for stakeholders.

• Fast-tracked model deployment timelines by 30% using PySpark on Dataproc, enabling real-time analytics and increasing forecasting precision for operational insights.

• Instituted data governance protocols with Google Cloud Composer, achieving a 35% improvement in data compliance and integrity across department.

Data Engineer, LTIMindtree, India (2020 Dec – 2022 Jul)

• Engineered scalable data warehousing solutions on GCP's BigQuery, driving a 40% uplift in operational efficiency and ensuring streamlined integration of diverse data sources.

• Amplified query performance by 40% through advanced SQL tuning and indexing, reducing data retrieval times and enhancing business intelligence for critical operations.

• Spearheaded ETL automation in Python with Google Cloud Dataflow, accelerating data processing by 50% to deliver high- accuracy, timely insights.

Data Analyst, Tectoro, India (2019 Sep – 2020 Dec)

• Transformed data transformation processes with PySpark in Databricks, enabling 30% faster processing of large datasets and enhancing report accuracy for stakeholders.

• Crafted dynamic, real-time Tableau dashboards, boosting user engagement and alignment by 30% for improved strategic planning.

• Elevated data quality and governance by 30% by implementing meticulous controls and collaborating across teams, ensuring compliance and accuracy in reporting.

TECHNICAL SKILLS

Cloud Platforms: GCP (Big Query, Dataflow, Data proc, Google Cloud Storage, Google Cloud Composer), AWS (S3, Redshift, Glue, EMR, Lambda, IAM), Azure (Azure Synapse, Data Lake, Databricks)

Data Engineering: Data modeling, warehousing, ETL pipelines, distributed computing, Informatica DQ, Snowflake

Big Data Technologies: Apache Spark, Hadoop, Hive, EMR, Kafka ETL & Data Integration Tools: Apache Spark, AWS Glue, Apache Airflow Query Languages: SQL, PL/SQL, HiveQL, Spark SQL

Scripting Languages: Python, R, Java, Scala

Database Systems: AWS Redshift, MySQL, PostgreSQL, Oracle, Teradata, SQL Server Visualization Tools: Power BI, Tableau, Google Data Studio CERTIFICATIONS

• Azure Data Engineer: Associate Microsoft (DP-203)

• Google cloud certified: Professional Data Engineer EDUCATION

University of North Texas (UNT) TX, USA GPA: 3.5/4.0 Master of Science in Data Science May 2024

GITAM University Hyderabad, India Jun 2020

Bachelor of Technology in Computer Science and Engineering AWARDS

• Awarded Spot Award by Mindtree for effective and efficient delivery of multiple projects.



Contact this candidate