Post Job Free

Resume

Sign in

Data Science Scientist

Location:
Harrow, Greater London, United Kingdom
Posted:
April 18, 2024

Contact this candidate

Resume:

Henry Leung Yee Hin, ASA, MSc

Email: ad43kb@r.postjobfree.com; Mobile1: +65-86477419 (only when I am in SG); Mobile2: +44-740******* (UK); WhatsApp: +65-864****** / +44-740*******; CAREER SUMMARY

■ Career evolvements were driven by my ambition to take more sophisticated role.

■ Skills (maths, computer, finance) were proven to be portable across different job functions.

■ Path: Sales Data Operations Investment / Risk Quant & Data Science. SKILLS HIGHLIGHTS

Domain: Data Science, Sports/Betting, Risk Management, Quantitative Finance, Actuarial Science/Insurance. Technical: Deep Learning, Feature Engineering, Quantitative Methods, Cryptography, Unsupervised Learning. Management: Team Management, Project Management, Project Framing & Pitching. PROFESSIONAL AFFILIATIONS

Society of Actuaries (SOA)

■ Associate of the Society of Actuaries (ASA) Aug 2010 attained CFA Institute

■ Level 2 passed. Aug 2010 attained

LANGUAGES

English: Excellent; Cantonese: Native; Mandarin: Excellent Japanese: JLPT – N4 passed Jul 2023 attained

COLLEGE EDUCATION

The Hong Kong Baptist University

MSc in Applied Accounting and Finance. Sept 2009 – Apr 2011 The Hong Kong University of Science & Technology

BEng in Computer Engineering. Sept 2000 – Aug 2003 WORK EXPERIENCE

GreekIsGood Pte. Ltd.

Founder Jun 2022 –

Purposes, Tasks

■ Implement the Data Science & Quant ideas that are too controversial for other companies.

■ Leverage the developed models to manage my own investment portfolio.

■ Create an environment for me to work on after immigration, to fill the gap before receiving an offer.

■ Storage the IPs, and as a model/toy/art display cabinet. Great Eastern Life, Singapore

Data & Strategic Transformation – VP Principal Data Scientist Nov 2021 – Jun 2022 Summary

Assisted the company in the journey of migrating from on-premise to Cloud (AWS). A capstone project was initiated as the 1st attempt in the Cloud environment, the project’s needs have been the major driver of the migration plan.

- DS fundamentals: planned / created relevant tools in Cloud for the project as well as for future concerns.

- Architecture: suggested what the Cloud architectural components should be included.

- Connectivity: defined APIs and sync-up relevant parties for production.

- Projects review: reviewed existing projects to prepare migration in the future. Tokopedia, Singapore

Senior Principal Data Scientist Sep 2019 – Nov 2021 Lead Tasks

■ Framed projects for various teams, thus far the Recom, Risk, and Promo teams.

■ Led DSs to work on the projects and R&D tasks.

■ Led Risk Management personnel and DSs to transform to quantitative risk. Team Based Projects

■ Recom Team: Product Category Clustering.

■ Risk Team: Suspicious Shops Detection, Transaction Fraud Detection, Quantitative Loss Evaluation.

■ Promo Team: Merchant Voucher Optimisation.

Tools

■ Language: Python, Shell Script

■ Cloud Platforms: GCP: DataFlow, BigQuery, AI Platform, AWS.

■ Deep Learning: Tensorflow, Keras; Distributed: Apache Beam; DB: PostgreSQL. APJ Innovation Center, Hewlett Packard Enterprise, Singapore Data Scientist Lead / Manager Oct 2017 – Aug 2019

Senior Data Scientist Aug 2016 – Sept 2017

Lead Tasks

■ Managed team resources, headcounts, etc.

■ Delegated tasks and duties, coached members.

■ Organized team activities, e.g., standups, knowledge sharing, events, etc.

■ Appraised team members’ performances.

Development Tasks

■ Performed data ETL and sampling for text analysis projects

- Techniques: API calls, cron-job, error detection.

- Tools: Python, MySQL, Shell Script.

■ Built a sentiment measuring method on top of various NLP tools

- Techniques: linear regression.

- Tools: CoreNLP, TextBlob, NLTK, Python, Java, Shell Script.

■ Invented a sub-topic detection algorithm from a larger topic.

- Techniques: PCA, k-means clustering, path searching.

- Tools: Python, Spark.

- *Please check the Appendix at the end of the document.

■ Developed a filtering algorithm to filter out trash from the topic detection results

- Techniques: feature extraction, logistic regression.

- Tools: Python.

■ Rationalized highly unstructured, incomplete, obsolete, and various encodings data (Ongoing)

- Tools: Jupyter, Python, R.

■ Developed a system for a certain type of transportation optimization

- Techniques: convolution, linear regression, simulation.

- Tools: Tensorflow, Jupyter, Python, R.

■ Implemented an CNN model for predictive maintenance for a telco

- Techniques: CNN, ARIMA, EMA, STL Decomposition, PCA, etc.

- Tools: Tensorflow, Python, Tableau.

■ Invented an anomaly detection algorithm for a manufacturing solution startup

- Techniques: Fourier transform, probability normalization.

- Tools: Python.

Business Tasks

■ Crafted out business proposals, tenders, etc.

■ Evaluated timeline and resource.

■ Negotiated terms and conditions with clients.

■ Sought for projects funding.

■ Hosted workshops to prepare clients for PoC / projects engagements.

■ Visited clients for sales meetings.

Research Tasks

■ Clustering by applying the Demand & Supply concept

- Techniques: sentence similarity measure, PCA, demand & supply rules, logistic regression, greedy tree.

■ Theory and implementation of root cause analysis under different machine learning frameworks.

- Techniques: backward propagation and various mathematical tools.

■ Text files processing using unsupervised deep learning.

- Techniques: RNN, clustering.

Nugit, Singapore

Data Scientist Jul 2015 – May 2016

TAS II, Hong Kong

Researcher - Modeling Team Jul 2014 – Jul 2015

Summary

Employ StatArb (statistical arbitrage) strategy to bet on horse racing. The strategy is composed of 3 main functional areas:

- Data: Use raw, transformed, and derived data to quantify race information.

- Model: Train our models to reach the best estimates of parameters.

- Backtest: Generate probabilities based on the trained parameters and mimic real bets. The modelling team was formed by PhD, MPhil, and professional designation holders like me, from background of Stats/Maths/A.I./ActSci. We are responsible for model development, as well as to initiate the 2 other areas’ works. We know the input and backtesting requirements well, thus we are also highly involved in prototype development of factors (data) and betting strategies (backtest).

Tasks

■ Developed and modified machine learning models.

- Techniques: collaborative filtering, gradient descent, neural networks, boosting and bagging, PCA, etc.

- Tools: R, C++, SQL.

■ Built a platform that digests and process factor codes/classes created by various programmers.

- Techniques: parallel computing, OOP in Python.

- Tools: Python.

■ Created factors as inputs for the machine learning models.

- Techniques: clustering, Dijkstra's algorithm, clustering, t-test, Chi-square test, correlation test, etc.

- Tools: SAS, R, Python, Matlab, C++, SQL.

■ Devised investment performance analysis and prediction tools.

- Techniques: truncate distribution, convolution, goodness-of-fit test, etc.

- Tools: R, Matlab.

■ Automated backtesting and coped with the constraints (amount, advantage threshold, etc).

- Techniques: brute-force approach.

- Tools: shell script, Java, R, VBA.

■ Performed scenario analysis for the cases in-doubt.

- Techniques: mainly matrix manipulation.

- Tools: R.

Wing Hang Bank, Hong Kong

Middle Office Senior Analyst May 2013 – Jul 2014

■ Developed an in-house R program to calculate equity positions’ Delta Gamma VaR.

■ Re-engineered and validated the the VaR of FX and IR treasury positions (Misys).

■ Validated pricing models for bonds, bonds with options, equity derivatives, etc.

■ Configured treasury system (Misys) to fit the needs of front and back office.

■ Automated data collection and distribution procedures. Hong Kong Zhongcai Finance Investment, Hong Kong

Equity Research Analyst Apr 2011 – Jul 2012.

■ Developed a market trend quantifying model.

■ Developed a portfolio optimization model to mimic the market.

■ Developed an A&H statistical arbitrage strategy to enhance portfolio performance.

■ Traveled to the US for one month to study the trend of the flooring industry. CAI Business Indepth, Hong Kong

Assistant Data Analyst Mar 2010 – Nov 2010

■ Automated information ETL (extract, transform, load) process.

■ Developed analytic software for equities, commodities, debts, etc.

■ Ensured data integrity and enhanced quality of data. Banks and Financial Institutions, Hong Kong

Relationship Manager / Training Officer May 2005 – Feb 2010 Sirius International (HK) Ltd.

Software Engineer Oct 2003 – Apr 2005

■ Designed and implemented Display Content Management and Inventory Control Systems. COMPETITIONS

Dextra (by invitation, private)

Reversible Data Protection Challenge Jan 2016 - Mar 2016

- One of the Top 5 teams, I joined as a single person team.



Contact this candidate