Deepika Kolluru
+1-945-***-**** ****************@*****.*** https://www.linkedin.com/in/deepika-kolluru/ SUMMARY
As a Data Engineer with 4 years of experience, I specialize in designing scalable data architectures and managing data ingestion, transformation, and orchestration using Apache Spark and Airflow. I have hands-on experience with Python and SQL, and I am proficient in working with various cloud platforms, including GCP, AWS and Azure. Additionally, I have a strong background in data visualization, utilizing tools such as Tableau, Looker, and Power BI. WORK EXPERIENCE
Data Analyst, Client: Ohio State, US (2024 Jan – Present)
• Revolutionized incident data accuracy by 35% through seamless integration of ingestion and processing pipelines in GCP's BigQuery and Dataflow, empowering a 25% boost in decision-making efficiency for stakeholders.
• Fast-tracked model deployment timelines by 30% using PySpark on Dataproc, enabling real-time analytics and increasing forecasting precision for operational insights.
• Instituted data governance protocols with Google Cloud Composer, achieving a 35% improvement in data compliance and integrity across department.
Data Engineer, LTIMindtree, India (2020 Dec – 2022 Jul)
• Engineered scalable data warehousing solutions on GCP's BigQuery, driving a 40% uplift in operational efficiency and ensuring streamlined integration of diverse data sources.
• Amplified query performance by 40% through advanced SQL tuning and indexing, reducing data retrieval times and enhancing business intelligence for critical operations.
• Spearheaded ETL automation in Python with Google Cloud Dataflow, accelerating data processing by 50% to deliver high- accuracy, timely insights.
Data Analyst, Tectoro, India (2019 Sep – 2020 Dec)
• Transformed data transformation processes with PySpark in Databricks, enabling 30% faster processing of large datasets and enhancing report accuracy for stakeholders.
• Crafted dynamic, real-time Tableau dashboards, boosting user engagement and alignment by 30% for improved strategic planning.
• Elevated data quality and governance by 30% by implementing meticulous controls and collaborating across teams, ensuring compliance and accuracy in reporting.
TECHNICAL SKILLS
Cloud Platforms: GCP (Big Query, Dataflow, Data proc, Google Cloud Storage, Google Cloud Composer), AWS (S3, Redshift, Glue, EMR, Lambda, IAM), Azure (Azure Synapse, Data Lake, Databricks)
Data Engineering: Data modeling, warehousing, ETL pipelines, distributed computing, Informatica DQ, Snowflake
Big Data Technologies: Apache Spark, Hadoop, Hive, EMR, Kafka ETL & Data Integration Tools: Apache Spark, AWS Glue, Apache Airflow Query Languages: SQL, PL/SQL, HiveQL, Spark SQL
Scripting Languages: Python, R, Java, Scala
Database Systems: AWS Redshift, MySQL, PostgreSQL, Oracle, Teradata, SQL Server Visualization Tools: Power BI, Tableau, Google Data Studio CERTIFICATIONS
• Azure Data Engineer: Associate Microsoft (DP-203)
• Google cloud certified: Professional Data Engineer EDUCATION
University of North Texas (UNT) TX, USA GPA: 3.5/4.0 Master of Science in Data Science May 2024
GITAM University Hyderabad, India Jun 2020
Bachelor of Technology in Computer Science and Engineering AWARDS
• Awarded Spot Award by Mindtree for effective and efficient delivery of multiple projects.