Post Job Free
Sign in

Data Engineer

Location:
Dallas, TX
Posted:
January 15, 2021

Contact this candidate

Resume:

EXPERIENCE

Data Engineer (contractor) Banking and Financial Services September 2019 - Present

Bank of America, Dallas, TX

Programming Languages:

Databases:

BI and Reporting Tools:

Big Data Analytics:

Cloud Technology:

SQL, Python (NumPy, Pandas, Scikit - Learn), R (caret, Dplyr), Java, Linux/Unix RDBMS - MS SQL Server, MySQL, Oracle, PostGreSQL, Cassandra

Tableau Desktop, SQL Server, Microsoft Excel

Hadoop, HDFS, Map Reduce, Hive, Impala, Hue, NoSQL - HBase, Spark, Basics Scala AWS(EC2, EMR, S3, Redshift, QuickSights, Athena), Azure ML Studio,Basics Azure data Factory THANUJHAA SRIEE AMMANARUL DANASEKARAN

*********.*****@*****.*** 214-***-**** www.linkedin.com/in/adts Dallas, TX Open to Relocate Tableau: https://public.tableau.com/profile/adts Github: https://github.com/ThSrAd SUMMARY

5 years of professional experience in engineering and data analysis, proficient in ETL, developing data warehouses

Data Engineering experience in distributed systems, managed 'big data' data pipelines and architecture

Certified ‘Tableau Desktop specialist’ built business intelligence reports in Tableau/ data visualization tools to track metrics

Strong analytical skills, knowledge of SQL querying for data analysis and coding in Python, SQL

Hands-on experience in Data Science project life cycle including Data Acquisition processes, Data Cleansing, Feature Engineering, implemented machine learning Models using Python Scikit - learn

Experience in Software Development Lifecycle (SDLC) requirements analysis, programming, testing with Agile and Waterfall SDLC

Highly motivated team player with excellent multitasking skills, time management, and Leadership Quality

Committed, proven track record of successfully delivering results fast-paced, ability to communicate analytical findings EDUCATION

Master’s Degree Information Technology Management, The University of Texas at Dallas, TX, GPA 3.48 May 2019 Courses: Statistics, Databases, Data Visualization, Business Data Warehousing, Big Data Analytics, Data Mining, Machine Learning Bachelor’s Degree Electronics and Communication Engineering, Visvesvaraya Technological University, India, GPA 3.80 May 2014 CERTIFICATIONS

Tableau Desktop Specialist, Dell EMC Associate Data Science Certificate, Scaled Agile Framework Practitioner (SAFE) TECHNICAL SKILLS

Data engineering team, built ETL pipelines/ big data applications to aid risk data management, analytics, reporting

Developed ETL processes in large scale data warehousing application on Hadoop ecosystem to collect, transform and standardize data from multiple sources for real-time and offline analytic processing and generate insights on data

Designed automated data ingestion process to collect data from REST APIs 33% reduction in manual effort, built custom code in Python and Spark for data transformation from unstructured JSON to structured format

Data Transformation and Integration: created data marts in Hive for data analysis and business intelligence reporting, built Sqoop jobs to extract data from relational databases to HDFS, created schema to load data into Hive tables

Data Modeling: Evaluated existing design, developed conceptual, physical, and logical data models for data warehouse/marts

Defined risk metrics benefits 700+ business users supporting critical business decisions, collaborate with stakeholders to scope requirements for Tableau Dashboards, managed datasets and built visualizations

Optimized Tableau dashboard performance, 4x faster rendering of visualizations by using accelerated view and data extracts

Data Governance: Initiated data quality improvement in ETL processes, wrote test cases QA rules to detect duplicate records, remediate data anomalies/data integrity issues, ensuring accurate data is available for stakeholders and business processes

Performance tuning and optimizing SQL queries to reduce table scans, created indexes, used CTE to replace slow sub-queries

Identified system integration challenges, developed optimal data processing architecture for new data and ETL pipelines, provided recommendations for improvements and effectively communicated strategy to team/leadership Technology/Tools: MySQL, Hadoop, Python, Linux, Spark, Tableau, Hive, Git, Visual Studio, Jira, Putty, WINSCP Software Engineer Intern, Data Science January 2019 - May 2019

Hoonuit, Minneapolis, MN

Data Science Team, built machine learning models in a data analytics product used to manage school district data

Research large data sets (~ 1M) student records, conducted exploratory data analysis, summarized descriptive statistics used Matplotlib, Seaborn visualizations to discover patterns, trends in data

Built a machine learning model with 81% accuracy to predict the risk of school dropouts, identified cost-effective model in terms of efficient memory and CPU times - Random Forest, Gaussian Naïve Bayes, and Support Vector Machines(SVM)

Wrote Python code, used NumPy, Pandas library for data cleansing and pre-processing, saving 15 hours of manual effort per week

Experience in Data preparation, cleansing, feature identification and prioritization using NumPy and Pandas packages, prepared analytical datasets, aggregated data from multiple data sources MySQL databases, API and CSV for student data analysis

Hands-on experience with Data Science project life cycle, Data Acquisition, Data Cleansing, Modeling, Evaluation (confusion matrix, ROC curve, RMSE, F-Score) Model Tuning, partner with data scientists to prototype predictive models and test models on cloud Technology/Tools: SQL (Structured Query Language), Python 3.x, Pandas, NumPy, Sci-Kit Learn, Machine Learning algorithms (Regression, Decision Trees, Clustering), Microsoft Excel(VLOOKUP, Pivot Tables), Matplotlib, Seaborn Visualizations, Predictive Modeling, SQL, Statistical Analysis,Azure ML studio,Data Factory

Data Analyst Intern May 2018 - July 2018

Virtual Tech Gurus, Dallas, TX

Data Analytics, generated insights on Cloud Infrastructure, service usage, provided insights to improve user engagement

Created data models, relational database tables from multi-data CSV reports(customer data) Flat file, XML using bulk insert utility, aggregated data, performed necessary transformations (ETL) supporting Business Intelligence

Wrote DDL, DML, DCL Relational Database queries, complex queries for data analysis and retrieval involving self-join, sub-queries, analytical/windows functions for data analysis and reporting

Developed interactive dashboard with cohort analysis report in Tableau with filters, calculated fields and parameters helped improve retention of customers by 17% for specific cloud data storage services offered by the organization

Understanding and knowledge of Tableau dashboard design, brainstormed requirements gathering sessions for dataset consolidation, prepared data reports to interpret customer behavior and provide recommendations on market opportunities

Analyzed disparate data structures; performed data blending, wrote SQL queries to extract data sources on Tableau Server

Strong understanding of advanced Tableau data visualization features including calculated fields, parameters, table calculations, row- level security, R integration, joins, data blending, and dashboard actions Technology/Tools: Relational Databases, Tableau Desktop, SQL Reports, Microsoft Excel Senior Systems Engineer - Financial services October 2014 - July 2017 Infosys Limited, Bangalore, India

IT Services consulting worked for a Fortune 500 Client in Banking and Finance Domain to build and maintain applications

Experience in Software Development Life cycle (SDLC) for banking/financial services including requirements analysis, business use case/user story preparation, coding, test script preparation and application maintenance with Agile scrum

Business Intelligence report quality: 15% improvement in data quality, investigated data modeling issues, presented root cause analysis for data mismatch, perform data accuracy checks, validity of data calculations, report drilldowns and data format

Led offshore product issue resolution, provide root causes analysis, code fix for 300+ critical tickets for USA customers

Adopt best practices in software quality assurance, achieved 98% quality compliant product delivery test design, analysis, validation of test results (Regression, System, UAT), and documentation for 3 major projects adopting Agile standards

Project Management: Facilitate weekly project meetings with product owners/project managers and other stakeholders communicate project progress, track milestones achieved milestones, document, report project status Technology/Tools: Toad Datapoint, Microsoft Visual Studio, Java/J2EE, Swagger Tool, Microsoft Excel, MS Visio, IBM Rational Team Concert (Project Management), A/B Testing, Protofluid Responsive Design Testing, Relational Databases PROJECTS

Real time fleet monitoring for a ride share company (Aws Lambda, Amazon Kinesis Firehose, S3, Amazon DynamoDB, Athena)

Built a serverless app data solution on cloud to process real-time data streams for a fictional ride-sharing company Market Data Analysis for Mobile Application (Python, NumPy, Pandas, Jupyter Notebook, Data Wrangling, S3, Microsoft Excel)

Data analysis for building mobile apps, helped understand what type of apps are likely to attract more users and generate revenue Streaming Pipelines with Kafka (Kafka, REST proxy, PostgreSQL, docker)

Constructed a streaming event pipeline in Apache Kafka using public data from Chicago Transit Authority Retail sales data Analysis (Aws QuickSights, S3, Microsoft Excel)

Analyzed retail sales created visualizations and stories, built business intelligence dashboards to generate insights LEADERSHIP EXPERIENCE AND ACTIVITIES

Women in Technology – Member, University of Texas at Dallas, AnitaBorg GraceHopper – Volunteer ADDITIONAL INFORMATION

Languages: Fluent in English, Telugu, Tamil, Hindi, Kannada; Interests: Volunteering for Tech Events, Analytics, Tech Podcasts



Contact this candidate