Post Job Free

Resume

Sign in

Data Engineer

Location:
Redmond, WA
Posted:
April 08, 2021

Contact this candidate

Resume:

JYOTI TURKHIYA

www.linkedin.com/in/jyotiturkhiya

Contact: 832-***-****

adlifg@r.postjobfree.com

PROFESSIONAL SUMMARY

• 5+ years of experience in Data and Software engineering field including requirements gathering, development, and testing.

• Hands on experience in building scalable end to end Data pipelines from structured, semi-structured & unstructured data.

• Proficient in Python, relational databases (Oracle, MySQL), NoSQL databases (MongoDB), and data warehouse (Hive).

• Excellent skills in ETL/ELT, Data wrangling, Big Data technologies, Data orchestration and Cloud technology fundamentals.

• Experience with collaborating with Business and IT partners to leverage Big data technologies for business analytics. PROFESSIONAL EXPERIENCE

CVS Health (TCS) - Woonsocket, RI. Feb. 2020 - Present Data Engineer (Technologies: Spark, Python, SQL, Azure Data Lake, Azure Databricks, Airflow, Jira, Rally)

• Performed ETL process for large sets of structured, semi-structured and unstructured data.

• Created and maintained visual representation of data pipeline for purpose of planning and building.

• Implemented centralized ETL to eliminate redundant work across teams resulting in simplified data wrangling process.

• Designed and implemented a scalable real-time data pipeline on semi-structured and structured data for positive reinforcement messaging via various programs regarding patients’ prescription refills.

• Implemented blast pipeline for prescription refill reminders with configurable and gradual reach out.

• Built data frame for data science team to forecast COVID-19 vaccine requirements.

• Designed and created a real time data pipeline for COVID-19 vaccine distribution by gathering data from external official websites and internal teams for all CVS clinics and stores.

• Implemented COVID-19 vaccines availability reporting module to generate daily summary for each stores and states. Tata Consultancy Services (TCS) - Edison, NJ. Nov. 2019 – Jan. 2020 Data Engineer (Technologies: Python, SQL, Selenium)

• Extracted data from web-pages and transformed into structured data to predict Apartment prices using supervised model.

• Implemented queries and stored procedures for CRUD operations on Apartment attributes.

• Performed data cleaning, analysis and classification on Loan campaign response dataset and generated reports.

• Explored several unsupervised learning models and analyzed outcomes for Loan campaign categories. AT&T (Amdocs) - Dallas, TX. July 2019 – Nov. 2019

Software Developer (Technologies: Oracle, Selenium)

• Automated test data and status report creation reducing suit execution time by 60% (saving 2 dev-hours daily).

• Created and executed test plans for feature launches and lead defect prioritization with Product and Development teams. University of Houston - Houston, TX. June 2018 – May 2019 Data Scientist, Research Assistant (Technologies: Python, Java)

• Built a recommendation system to automatically assign forecasting questions to participants of forecasting survey.

• Created NLP (Stemming and Tokenization process) based clusters and grouped participants based on categorical scores. Amdocs - Pune, India. July 2013 – Aug. 2017

Software Developer (Technologies: Java, Oracle, MySQL, Selenium, HP ALM/QC)

• Implemented case severity feature for CRM tool using core Java for backend and Swing for frontend.

• Developed web-based automation tool for smoke testing which reduced manual efforts by 66%.

• Created and executed test plans and test cases including smoke tests, regression tests, integration tests, and end to end tests.

• Created standard impact assessment documents to evaluate changes against requirements & create feature documentation.

• Performed feature gap analysis based on business requirements and established expectations for the end system state. TECHNICAL SKILLS

• Technologies: SQL, Python, Java, R, Spark, HTML5, CSS3, JavaScript

• Databases: Oracle, MySQL, MongoDB, Azure Data Lake, Hive

• Tools: PyCharm, Airflow, Azure Databricks, Tableau, GitHub, Jira, Anaconda, RStudio, Eclipse, HP ALM/QC

• Certifications: AZ-900 Azure Fundamentals (Microsoft), Tableau 2020 (Udemy), IBM Blue Scholar (DB2, RAD, IBM Project) EDUCATION

Master of Science, Computer Science (Specialization - Data Science) Aug. 2017 – May 2019 University of Houston

Bachelor of Engineering, Computer Science Engineering Aug. 2009 – June 2013 Rajiv Gandhi Technical University

ACADEMIC PROJECTS

Statistical Analysis on Microsurgical Studies (Technologies: R) Apr. 2018 - May 2018

• Performed exploratory data analysis on a microsurgery simulation data using various graphical and statistical methods.

• Applied statistical tests on the data to understand and formulate various hypotheses for it. Mining Frequent Item sets (Pattern Mining) (Technologies: Python) Jan. 2018 – Feb. 2018

• Implemented Apriori algorithm to find common product reviewers across Amazon reviews from 54k+ unique customers.



Contact this candidate