Post Job Free
Sign in

Engineer Data

Location:
Alexandria, VA
Salary:
140000
Posted:
January 06, 2021

Contact this candidate

Resume:

POOJITHA VAGALE DUGGAPPA

**************@*****.*** 312-***-**** 1226 Koi Terrace, Fremont, CA 94536 EDUCATION

• Illinois Institute of Technology, Chicago IL Jan 2016 – Dec 2017 M.S. Computer Science

Coursework: Advanced Database Management Systems, Big Data Technologies, Advanced Data mining, Social Network Analysis, Software Project Management, Software Engineering, Unified Modelling Language, Software System Architecture.

• Visvesvaraya Technological University, Bangalore, India Aug 2011 – May 2015 B. Eng in Information Science

Coursework: Cloud Computing, Parallel Programming, Computer Networks, Data Structures, Operating Systems, OOP concepts, Unix. SKILLS

• Programming: Python, JavaScript, Java, React, HTML5, CSS3

• Database : MySQL, Oracle, Postgres, Redshift, Snowflake, Hive

• Tools : Tableau, ETL, Jupyter, Eclipse, PyCharm

• Other : AWS, Kubernetes, Jenkins, Airflow, Confluence PROFESSIONAL EXPERIENCE:

Nexmo Inc (Vonage API Platform), San Francisco CA Oct 2018 – Present Data Engineer

• Architected complex pipelines on Airflow using DAGs which vastly reduced manual intervention of scheduled jobs and re-runs by 80%, which led to the eventual termination of older and obsolete ETL processes.

• Improved existing ETL processes and optimized the queries to speed up daily Billing and CRM data by ~40%.

• Leveraged the highly sought-after Kimball Approach/Dimensional model to design a multi-functional Data Warehouse.

• Devised several Airflow Operators to enable data transfer from Salesforce to Snowflake and also from Redshift to S3 Buckets.

• Built innovative solutions to achieve different types of Billing models, which helped accommodate multiple varied parameters defined by the Customers.

• Heavily involved in building the infrastructure to optimally extract, transform and load data from a variety of sources using Hive and Snowflake.

• Championed mining of raw data using Python and transforming meta data into understandable metrics which give a pattern into Customer thinking which helps in making predictions based on that data.

• Introduced logic to transform the raw logs ingested from upstream systems into our S3 buckets, which were later aggregated into a more consumable form to further use this data to bill Customers.

• Pioneered the creation of Data Workbooks using Tableau by understanding the Organization’s fundamental business model, which was utilized by Marketing and Sales team to reduce the outstanding tickets by ~45%.

• Simplified the Reporting and Analytic processes through creative solutions which helped clear the backlogs and reduce the SLA breach by ~35%.

• Created Tableau reports to help understand the growth metrics, operational efficiency and other key business performance metrics which was instrumental in powering Business decisions on Product Pricing.

• Orchestrated various Data Analysis and Integrity workshops across the Organization which were of significant help for resources to build new ad- hoc reports from scratch.

• Identified cost reduction of ~$20,000 per month by spotting Terabytes worth of data in S3 buckets, which were past the current retention policy, and moving it to AWS Glacier.

• Practiced Agile processes with weekly sprints and code reviews with heavy involvement in all process documentation through Confluence.

• Designed scalable solutions to a wide variety of problems by integrating Airflow worker components into Kubernetes pods whilst simultaneously achieving CI/CD using Jenkins.

Minds At Work, Chicago IL

Data Engineer Jan 2017 – Oct 2018

• Involved in mining unstructured data and pre-processing it into meaningful scripts using Python and Pandas library.

• Highly optimized the performance of query functions that pull data from large tables by developing Teradata SQL scripts using OLAP functions.

• Became proficient in system Analysis, ER Dimensional Modelling, Data Design and implementing RDBMS specific features.

• Utilized views, stored procedures and custom SQL queries to validate data from Spreadsheets and SQL server to Tableau.

• Managed and designed the reporting environment which included modelling data sources, creating and scheduling reports and creating metadata.

• Proficient in Scrum Methodology (Agile) to implement project life cycles of reports design and development. Energy Labs

Data Analyst, Bangalore, India Jun 2015 – Dec 2015

• Actively involved in developing and maintaining the data reporting architecture which resulted in automating 6 critical reports thereby saving close to 15-man hours per week.

• Participated in requirements gathering and translating these requirements into insight generation and report visualization.

• Designed and developed KPIs, dashboards and reports by utilizing various reporting tools including Tableau Dashboards.

• Documented business processes, functional requirements, user roles, developmental notes, testing and release authorization for the complete Software Development Lifecycle process.

PROJECTS:

Chicago Bike Share Analysis Jul 2016 – Aug 2016

• Performed analysis of Chicago’s bike share system (Divvy) raw data to gather and generate actionable points such as discovering most/least exploited Divvy stations, docking capacity, average duration of each trip, favorable weather conditions. Generated visualizations using ggplot2 to aid in understanding the metrics. These insights are to help Divvy optimize docking capacity by 30% at each station ensuring higher availability to users at most exploited stations.

Feedback app Oct 2016 – Dec 2016

• Developed a web application utilizing responsive UI that allows students to register their questions/feedback regarding a course. Created a tracker that displays stats such as counts of questions/feedback received by the professor, status, and tools used in the solution such as HTML5, CSS, JavaScript, React JS, Bootstrap, MongoDB.



Contact this candidate