EXPERIENCE
Data Engineer (contractor) Banking and Financial Services September 2019 - Present
Bank of America, Dallas, TX
Programming Languages:
Databases:
BI and Reporting Tools:
Big Data Analytics:
Cloud Technology:
SQL, Python (NumPy, Pandas, Scikit - Learn), R (caret, Dplyr), Java, Linux/Unix RDBMS - MS SQL Server, MySQL, Oracle, PostGreSQL, Cassandra
Tableau Desktop, SQL Server, Microsoft Excel
Hadoop, HDFS, Map Reduce, Hive, Impala, Hue, NoSQL - HBase, Spark, Basics Scala AWS(EC2, EMR, S3, Redshift, QuickSights, Athena), Azure ML Studio,Basics Azure data Factory THANUJHAA SRIEE AMMANARUL DANASEKARAN
*********.*****@*****.*** 214-***-**** www.linkedin.com/in/adts Dallas, TX Open to Relocate Tableau: https://public.tableau.com/profile/adts Github: https://github.com/ThSrAd SUMMARY
5 years of professional experience in engineering and data analysis, proficient in ETL, developing data warehouses
Data Engineering experience in distributed systems, managed 'big data' data pipelines and architecture
Certified ‘Tableau Desktop specialist’ built business intelligence reports in Tableau/ data visualization tools to track metrics
Strong analytical skills, knowledge of SQL querying for data analysis and coding in Python, SQL
Hands-on experience in Data Science project life cycle including Data Acquisition processes, Data Cleansing, Feature Engineering, implemented machine learning Models using Python Scikit - learn
Experience in Software Development Lifecycle (SDLC) requirements analysis, programming, testing with Agile and Waterfall SDLC
Highly motivated team player with excellent multitasking skills, time management, and Leadership Quality
Committed, proven track record of successfully delivering results fast-paced, ability to communicate analytical findings EDUCATION
Master’s Degree Information Technology Management, The University of Texas at Dallas, TX, GPA 3.48 May 2019 Courses: Statistics, Databases, Data Visualization, Business Data Warehousing, Big Data Analytics, Data Mining, Machine Learning Bachelor’s Degree Electronics and Communication Engineering, Visvesvaraya Technological University, India, GPA 3.80 May 2014 CERTIFICATIONS
Tableau Desktop Specialist, Dell EMC Associate Data Science Certificate, Scaled Agile Framework Practitioner (SAFE) TECHNICAL SKILLS
Data engineering team, built ETL pipelines/ big data applications to aid risk data management, analytics, reporting
Developed ETL processes in large scale data warehousing application on Hadoop ecosystem to collect, transform and standardize data from multiple sources for real-time and offline analytic processing and generate insights on data
Designed automated data ingestion process to collect data from REST APIs 33% reduction in manual effort, built custom code in Python and Spark for data transformation from unstructured JSON to structured format
Data Transformation and Integration: created data marts in Hive for data analysis and business intelligence reporting, built Sqoop jobs to extract data from relational databases to HDFS, created schema to load data into Hive tables
Data Modeling: Evaluated existing design, developed conceptual, physical, and logical data models for data warehouse/marts
Defined risk metrics benefits 700+ business users supporting critical business decisions, collaborate with stakeholders to scope requirements for Tableau Dashboards, managed datasets and built visualizations
Optimized Tableau dashboard performance, 4x faster rendering of visualizations by using accelerated view and data extracts
Data Governance: Initiated data quality improvement in ETL processes, wrote test cases QA rules to detect duplicate records, remediate data anomalies/data integrity issues, ensuring accurate data is available for stakeholders and business processes
Performance tuning and optimizing SQL queries to reduce table scans, created indexes, used CTE to replace slow sub-queries
Identified system integration challenges, developed optimal data processing architecture for new data and ETL pipelines, provided recommendations for improvements and effectively communicated strategy to team/leadership Technology/Tools: MySQL, Hadoop, Python, Linux, Spark, Tableau, Hive, Git, Visual Studio, Jira, Putty, WINSCP Software Engineer Intern, Data Science January 2019 - May 2019
Hoonuit, Minneapolis, MN
Data Science Team, built machine learning models in a data analytics product used to manage school district data
Research large data sets (~ 1M) student records, conducted exploratory data analysis, summarized descriptive statistics used Matplotlib, Seaborn visualizations to discover patterns, trends in data
Built a machine learning model with 81% accuracy to predict the risk of school dropouts, identified cost-effective model in terms of efficient memory and CPU times - Random Forest, Gaussian Naïve Bayes, and Support Vector Machines(SVM)
Wrote Python code, used NumPy, Pandas library for data cleansing and pre-processing, saving 15 hours of manual effort per week
Experience in Data preparation, cleansing, feature identification and prioritization using NumPy and Pandas packages, prepared analytical datasets, aggregated data from multiple data sources MySQL databases, API and CSV for student data analysis
Hands-on experience with Data Science project life cycle, Data Acquisition, Data Cleansing, Modeling, Evaluation (confusion matrix, ROC curve, RMSE, F-Score) Model Tuning, partner with data scientists to prototype predictive models and test models on cloud Technology/Tools: SQL (Structured Query Language), Python 3.x, Pandas, NumPy, Sci-Kit Learn, Machine Learning algorithms (Regression, Decision Trees, Clustering), Microsoft Excel(VLOOKUP, Pivot Tables), Matplotlib, Seaborn Visualizations, Predictive Modeling, SQL, Statistical Analysis,Azure ML studio,Data Factory
Data Analyst Intern May 2018 - July 2018
Virtual Tech Gurus, Dallas, TX
Data Analytics, generated insights on Cloud Infrastructure, service usage, provided insights to improve user engagement
Created data models, relational database tables from multi-data CSV reports(customer data) Flat file, XML using bulk insert utility, aggregated data, performed necessary transformations (ETL) supporting Business Intelligence
Wrote DDL, DML, DCL Relational Database queries, complex queries for data analysis and retrieval involving self-join, sub-queries, analytical/windows functions for data analysis and reporting
Developed interactive dashboard with cohort analysis report in Tableau with filters, calculated fields and parameters helped improve retention of customers by 17% for specific cloud data storage services offered by the organization
Understanding and knowledge of Tableau dashboard design, brainstormed requirements gathering sessions for dataset consolidation, prepared data reports to interpret customer behavior and provide recommendations on market opportunities
Analyzed disparate data structures; performed data blending, wrote SQL queries to extract data sources on Tableau Server
Strong understanding of advanced Tableau data visualization features including calculated fields, parameters, table calculations, row- level security, R integration, joins, data blending, and dashboard actions Technology/Tools: Relational Databases, Tableau Desktop, SQL Reports, Microsoft Excel Senior Systems Engineer - Financial services October 2014 - July 2017 Infosys Limited, Bangalore, India
IT Services consulting worked for a Fortune 500 Client in Banking and Finance Domain to build and maintain applications
Experience in Software Development Life cycle (SDLC) for banking/financial services including requirements analysis, business use case/user story preparation, coding, test script preparation and application maintenance with Agile scrum
Business Intelligence report quality: 15% improvement in data quality, investigated data modeling issues, presented root cause analysis for data mismatch, perform data accuracy checks, validity of data calculations, report drilldowns and data format
Led offshore product issue resolution, provide root causes analysis, code fix for 300+ critical tickets for USA customers
Adopt best practices in software quality assurance, achieved 98% quality compliant product delivery test design, analysis, validation of test results (Regression, System, UAT), and documentation for 3 major projects adopting Agile standards
Project Management: Facilitate weekly project meetings with product owners/project managers and other stakeholders communicate project progress, track milestones achieved milestones, document, report project status Technology/Tools: Toad Datapoint, Microsoft Visual Studio, Java/J2EE, Swagger Tool, Microsoft Excel, MS Visio, IBM Rational Team Concert (Project Management), A/B Testing, Protofluid Responsive Design Testing, Relational Databases PROJECTS
Real time fleet monitoring for a ride share company (Aws Lambda, Amazon Kinesis Firehose, S3, Amazon DynamoDB, Athena)
Built a serverless app data solution on cloud to process real-time data streams for a fictional ride-sharing company Market Data Analysis for Mobile Application (Python, NumPy, Pandas, Jupyter Notebook, Data Wrangling, S3, Microsoft Excel)
Data analysis for building mobile apps, helped understand what type of apps are likely to attract more users and generate revenue Streaming Pipelines with Kafka (Kafka, REST proxy, PostgreSQL, docker)
Constructed a streaming event pipeline in Apache Kafka using public data from Chicago Transit Authority Retail sales data Analysis (Aws QuickSights, S3, Microsoft Excel)
Analyzed retail sales created visualizations and stories, built business intelligence dashboards to generate insights LEADERSHIP EXPERIENCE AND ACTIVITIES
Women in Technology – Member, University of Texas at Dallas, AnitaBorg GraceHopper – Volunteer ADDITIONAL INFORMATION
Languages: Fluent in English, Telugu, Tamil, Hindi, Kannada; Interests: Volunteering for Tech Events, Analytics, Tech Podcasts