Post Job Free
Sign in

Assistant Data Summer Intern

Location:
Queens, NY
Posted:
June 09, 2022

Contact this candidate

Resume:

Anbang Wang

626-***-**** ******@********.*** **-43 Dartmouth St., Forest Hills, NY 11375

Education Columbia University, New York, NY Expected Dec 2022 Master of Science in Data Science GPA 3.42/4.0

Relevant Coursework: Probability and Statistics, Machine Learning, Algorithms in Data Science, Exploratory Data Analysis, Natural Language Processing

Rensselaer Polytechnic Institute, Troy, NY. May 2020 Bachelor of Computer Science GPA 3.78/4.0

Dean’s Honor’s List

Relevant Coursework: Intro to Artificial Intelligence, Operating Systems, Database Systems, Network Programming, Programming Languages, Software Design and Documentation Skills C/C++, Java, Python3, OZ, Haskell, Google BigQuery, Microsoft SQL, MySQL, Oracle, Redis, Mongodb, BigQuery, Pandas, Scikit-learn, Latex, Eclipse, XHTML, Unity, WindowsBuilder (GUI design plugin), Excel, Software, Qiime 1.9.1

Experience Lenovo(Beijing) Corporation, Ltd Jun 2018 - Aug 2018 Data Analysis Specialist Jun 2019 - Aug 2019

Collected and classified large amount of customers’ input feedback from Facebook utilizing SQL.

Wrote a Python program looking for common words from chat logs; searching for synonyms and performing sentence expansion to enlarge phrase database.

Leveraged Pandas, a python data analysis package, to export table containing billions of records from Google BigQuery and retrieve data from it.

Studied meaning of customer input messages from chatlogs by applying techniques of NLP (Natural Language Processing). Start up new venture and built a simple Chinese word segmentation module with python as a key step to improve Moli’s (virtual agent) performance on detecting small talk.

Collected incomprehensible questions from database and increased Moli’s comprehension level by updating intent identification database (Tools: Excel, Mysql).

Collaborated with AI teams to train module with noise to increase accuracy of learning outcomes and reduce over-fitting.

Fixed program errors for data preprocessing procedure (C++). New York University School of Medicine Jul 2017 - Aug 2017 Assistant Data Analysis Specialist

Analyzed sequencing data generated on Illumina or other platforms using Qiime software, an open-source bioinformatics pipeline for performing microbiome studies through publication quality graphics and statistics.

Gathered and organized samples using Freezerworks software and Excel.

Isolated DNA using Qiagen kit and MO BIO’s PowerSoil kit.

Coordinated with entire lab members to review data included in publication submitted to the American Journal of Respiratory and Critical Care Medicine in January 2018 (Excel, Access). Teamwork Experience in Software Design and Documentation Class

Led team of five to build a random music generator, it allows user to pick an instrument related to an emotion for each section. The music generator then plays a randomly generated song composed from one or more sections after users picks different themes or instruments with emotions stimulated (Tools: Eclipse, Java, Windows Builder, Jfugue).

Interests

Escape room, snowboarding, recreational basketball, and playing bamboo flute.



Contact this candidate