Liping Zhang
Shanghai, China ******
***********@*****.***
Objective: Software Development Engineer (Big Data related)
Skills
l 8+ years experience in Java programming language;
l 5+ years experience in Hadoop Big Data related ecosystem development;
l 4+ years experience in IBM InfoSphere BigInsights development and Agile development;
l 1.5 years experience in Cloudera and Intel Big Data product and solution development;
l Experience in Hadoop, YARN, HBase, ZooKeeper, Phoenix performance tuning,
troubleshooting and cluster management, open source framework, Github development
with Maven, Jenkins and Git technology;
l Experience in Spark MLlib(Decision Tree, Bayes, CF,etc.), Scala, Mahout, Search, Lucene,
Solr, Chinese segmentation, SAS, DataStage, Akka, and Kafka development;
l Assigned as Apache HBase Contributor and Apache Zookeeper Contributor;
l Familiar with Eclipse, LINUX, AWS, SHELL script development and NOSQL.
Professional Experience
9/2014 – Present
Software Engineer for Cloudera Company (Shanghai)
l Development work:
1) “China High-speed Railway(CHR) big data” project: It was mainly for designing
and implementing a big data analysis platform for China High Railway Failure data.
Demonstrated it in BJTU to Doug Cutting, Railway experts and Press.
Responsibility: Designed CHR big data solution architecture with Sqoop, YARN,
HDFS, Hive, Impala, Spark, built system, and implemented machine learning
analysis on railway failure data with CDH and Spark MLlib, used K-means
algorithm to analysis and forecast CHR failure.
2) “Hongkong Bank of Communications Co.” project: Integrated IBM DataStage with
CDH, migrated queries business to CDH.
Responsibility: Wrote source code and configuration files for the functionalities
according requirement, and optimized queries performance with Hive and Impala.
3) “CMCC” project: Designed high availabilities to CMCC big data platform, set up
security authorization and authentication for the system.
Responsibility: Wrote scripts and configuration files for high availabilities and
Kerberos on CDH with YARN, HDFS, HBase, Hive, Kerberos, Sentry, Hue, and
Navigator.
4) “NCI” project: Designed big data analysis platform for life insurance, included
bulk load, secondary index query, and near real time full text search.
Responsibility: Wrote source codes and scripts to implement Ad-hoc HBase
secondary index with HBase, Phoenix, built near real time full text indexes and
search with Flume, Lily HBase, solr, lucene, Cloudera Search, and Hue.
Tools: JDK1.6/1.7, Eclipse, Intellij Idea, Apache Maven, Git, Bash Shell, Java language
07/2013 – 09/2014
Software Engineer, Intel Asia Pacific R&D Center (Shanghai)
Ø Development for BigDataAnalysis Platform Project
Responsibility:
l Designed and implemented 17 big data solution tools for IDH (Intel Distributed
Hadoop) enterprise solutions, including Hadoop HBase ETL tools, Gzip splitting
and file combination tools, HBase bulk load and general load tools, HBase
secondary indexer and paging query tools with HBase Coprocessor, near real time
load tool with Kafka, distributed log collection tool with Akka etc;
l Optimized performance of tools, as bulk load, near real time load, query, etc, with
performance gain, won 4 enterprise projects;
l Implemented LZMA/ORC in Hadoop and HBase, both boosted higher compression
ratio and gained better decompression time.
Tools: JDK1.6, Eclipse, Intellij Idea, Apache Ant, Git, Bash Shell, Java language
03/2010 – 07/2013
Staff Software Engineer, IBM China Development Laboratory (CDL) (Shanghai)
Ø Development of IBM InfoSphere BigInsights Product
Responsibility:
l As a Core Initiator of IBM InfoSphere BigInsights Product, owned infrastructure
design and installation/integration development of 24 components, including hdm,
console, Security (e.g. FlatFile, LDAP authentication), Hadoop, HBase, ZooKeeper,
Jaqlserver, Jaql, bigsheets, Text-analytics, GPFS integration, etc;
l Debugged and fixed bugs in BigInsights Enterprise product, delivered 13 versions
BigInsights releases from beta release to BigInsights 2.1 release;
l Wrote source codes for BigInsights Hadoop/HBase/ZooKeeper upgrading and
enterprise features, as HBase full/incremental backup and restore, HA, etc.,
Backporting open source JIRA to IBM Hadoop 0.20.2, IBM HBase
0.90.4/0.90.5/0.92.0/0.94.0/0.94.3, IBM ZooKeeper 3.3.3/3.3.4/ 3.4.2/3.4.3/3.4.5.
Tools: JDK1.6, Eclipse, Maven, Ant, Git, Jenkins, Bash Shell, Java language
05/2008 – 03/2010
Software Engineer Intern, Microsoft China R&D Group, IBM (Beijing), NSN (Hangzhou)
Ø Internship Development work:
Responsibility:
l As Microsoft China R&D Group’s Postgraduate Campus Ambassador of Zhejiang
University;
l As IBM CDL Extreme Blue Program Software Engineer Intern, acceptance rate:
20/4000 in China, Developed "Cloud-enabled EHR" project EHR data visualization
tool-set widgets platform with cloud computing (Saas) technology as Dev Lead;
l As Nokia Software Engineer Intern, developed functions for JANINA platform.
Tools: JDK1.5, Eclipse, Java programming language, C programming language
Education
Ø Master of Computer Science and Technology from the College of Computer Science
and Technology, Zhejiang University, 2010, Recommended Postgraduate, Rank 1/286
l Team Lead of the two 863 projects. Developed CIM, PIM, PSM UML & SmartC
graphical modeling environment tools, MDA-based Word auto-generation tool,
SmartC editor, code/model au-to-generation based on Eclipse GEF, GMF.
Ø Bachelor of Computer Science and Technology from the College of Computer Science
and Technology, Hunan University, 2007, Rank 1/408
l Team lead, won "SIT" project competition 3rd Prize.
Certification
02/2015 Cloudera CCP: DS (DS-200, Fall 2014 Challenge)
12/2014 Cloudera CCAH, CCDH, CCSHB, Data Analyst
06/2014 PMP
12/2013 SCJP6 (Score: 100 / 100)
10/2011 IBM BigInsights Technical Professional
Awards
IBM “First Patent Award”, IBM IM 2nd Line Mgr “Best Innovation Award”
Zhejiang Province Excellent Master Graduate (1%)
"Kezhen ZHU scholarship" (0.05%, supreme honor award of ZJU)
Zhen JIANG scholarship for outstanding new graduates (0.2%)
Top 10 outstanding graduates of ZJU CCNT Lab & 1st grade scholarship of ZJU and HNU
China 1st college career plan competition champion of Hunan Province division (0.1%)
Hunan Province Excellent University Student Scholarship (0.2%)
Yuanyu ZHONG scholarship (0.1%) & National 2nd grade scholarship (1%)
Paper and Patent
l Published 4 EI indexed papers as 1st author: IEEE BigData 2014 Congress, IASP2009,
ICESS2009, JOURNAL OF ZHEJIANG UNIVERSITY ENGINEERING SCIENCE;
l Filed 3 granted patent as 1st author: 200810061925.1, CN920120163US1, P73299PCT
(the last one is in the legal processing);
l Registered 3 Computer Software Copyright as 1st author: 2008SR06691, 2008SR21765,
2009SR049616; published 1 US disclosure: IPCOM000228209D
References
Available on request.