Post Job Free

Resume

Sign in

Engineer Software

Location:
Shanghai, China
Posted:
March 03, 2015

Contact this candidate

Resume:

Liping Zhang

Shanghai, China ******

+86-150********

acoj96@r.postjobfree.com

Objective: Software Development Engineer (Big Data related)

Skills

l 8+ years experience in Java programming language;

l 5+ years experience in Hadoop Big Data related ecosystem development;

l 4+ years experience in IBM InfoSphere BigInsights development and Agile development;

l 1.5 years experience in Cloudera and Intel Big Data product and solution development;

l Experience in Hadoop, YARN, HBase, ZooKeeper, Phoenix performance tuning,

troubleshooting and cluster management, open source framework, Github development

with Maven, Jenkins and Git technology;

l Experience in Spark MLlib(Decision Tree, Bayes, CF,etc.), Scala, Mahout, Search, Lucene,

Solr, Chinese segmentation, SAS, DataStage, Akka, and Kafka development;

l Assigned as Apache HBase Contributor and Apache Zookeeper Contributor;

l Familiar with Eclipse, LINUX, AWS, SHELL script development and NOSQL.

Professional Experience

9/2014 – Present

Software Engineer for Cloudera Company (Shanghai)

l Development work:

1) “China High-speed Railway(CHR) big data” project: It was mainly for designing

and implementing a big data analysis platform for China High Railway Failure data.

Demonstrated it in BJTU to Doug Cutting, Railway experts and Press.

Responsibility: Designed CHR big data solution architecture with Sqoop, YARN,

HDFS, Hive, Impala, Spark, built system, and implemented machine learning

analysis on railway failure data with CDH and Spark MLlib, used K-means

algorithm to analysis and forecast CHR failure.

2) “Hongkong Bank of Communications Co.” project: Integrated IBM DataStage with

CDH, migrated queries business to CDH.

Responsibility: Wrote source code and configuration files for the functionalities

according requirement, and optimized queries performance with Hive and Impala.

3) “CMCC” project: Designed high availabilities to CMCC big data platform, set up

security authorization and authentication for the system.

Responsibility: Wrote scripts and configuration files for high availabilities and

Kerberos on CDH with YARN, HDFS, HBase, Hive, Kerberos, Sentry, Hue, and

Navigator.

4) “NCI” project: Designed big data analysis platform for life insurance, included

bulk load, secondary index query, and near real time full text search.

Responsibility: Wrote source codes and scripts to implement Ad-hoc HBase

secondary index with HBase, Phoenix, built near real time full text indexes and

search with Flume, Lily HBase, solr, lucene, Cloudera Search, and Hue.

Tools: JDK1.6/1.7, Eclipse, Intellij Idea, Apache Maven, Git, Bash Shell, Java language

07/2013 – 09/2014

Software Engineer, Intel Asia Pacific R&D Center (Shanghai)

Ø Development for BigDataAnalysis Platform Project

Responsibility:

l Designed and implemented 17 big data solution tools for IDH (Intel Distributed

Hadoop) enterprise solutions, including Hadoop HBase ETL tools, Gzip splitting

and file combination tools, HBase bulk load and general load tools, HBase

secondary indexer and paging query tools with HBase Coprocessor, near real time

load tool with Kafka, distributed log collection tool with Akka etc;

l Optimized performance of tools, as bulk load, near real time load, query, etc, with

performance gain, won 4 enterprise projects;

l Implemented LZMA/ORC in Hadoop and HBase, both boosted higher compression

ratio and gained better decompression time.

Tools: JDK1.6, Eclipse, Intellij Idea, Apache Ant, Git, Bash Shell, Java language

03/2010 – 07/2013

Staff Software Engineer, IBM China Development Laboratory (CDL) (Shanghai)

Ø Development of IBM InfoSphere BigInsights Product

Responsibility:

l As a Core Initiator of IBM InfoSphere BigInsights Product, owned infrastructure

design and installation/integration development of 24 components, including hdm,

console, Security (e.g. FlatFile, LDAP authentication), Hadoop, HBase, ZooKeeper,

Jaqlserver, Jaql, bigsheets, Text-analytics, GPFS integration, etc;

l Debugged and fixed bugs in BigInsights Enterprise product, delivered 13 versions

BigInsights releases from beta release to BigInsights 2.1 release;

l Wrote source codes for BigInsights Hadoop/HBase/ZooKeeper upgrading and

enterprise features, as HBase full/incremental backup and restore, HA, etc.,

Backporting open source JIRA to IBM Hadoop 0.20.2, IBM HBase

0.90.4/0.90.5/0.92.0/0.94.0/0.94.3, IBM ZooKeeper 3.3.3/3.3.4/ 3.4.2/3.4.3/3.4.5.

Tools: JDK1.6, Eclipse, Maven, Ant, Git, Jenkins, Bash Shell, Java language

05/2008 – 03/2010

Software Engineer Intern, Microsoft China R&D Group, IBM (Beijing), NSN (Hangzhou)

Ø Internship Development work:

Responsibility:

l As Microsoft China R&D Group’s Postgraduate Campus Ambassador of Zhejiang

University;

l As IBM CDL Extreme Blue Program Software Engineer Intern, acceptance rate:

20/4000 in China, Developed "Cloud-enabled EHR" project EHR data visualization

tool-set widgets platform with cloud computing (Saas) technology as Dev Lead;

l As Nokia Software Engineer Intern, developed functions for JANINA platform.

Tools: JDK1.5, Eclipse, Java programming language, C programming language

Education

Ø Master of Computer Science and Technology from the College of Computer Science

and Technology, Zhejiang University, 2010, Recommended Postgraduate, Rank 1/286

l Team Lead of the two 863 projects. Developed CIM, PIM, PSM UML & SmartC

graphical modeling environment tools, MDA-based Word auto-generation tool,

SmartC editor, code/model au-to-generation based on Eclipse GEF, GMF.

Ø Bachelor of Computer Science and Technology from the College of Computer Science

and Technology, Hunan University, 2007, Rank 1/408

l Team lead, won "SIT" project competition 3rd Prize.

Certification

02/2015 Cloudera CCP: DS (DS-200, Fall 2014 Challenge)

12/2014 Cloudera CCAH, CCDH, CCSHB, Data Analyst

06/2014 PMP

12/2013 SCJP6 (Score: 100 / 100)

10/2011 IBM BigInsights Technical Professional

Awards

IBM “First Patent Award”, IBM IM 2nd Line Mgr “Best Innovation Award”

Zhejiang Province Excellent Master Graduate (1%)

"Kezhen ZHU scholarship" (0.05%, supreme honor award of ZJU)

Zhen JIANG scholarship for outstanding new graduates (0.2%)

Top 10 outstanding graduates of ZJU CCNT Lab & 1st grade scholarship of ZJU and HNU

China 1st college career plan competition champion of Hunan Province division (0.1%)

Hunan Province Excellent University Student Scholarship (0.2%)

Yuanyu ZHONG scholarship (0.1%) & National 2nd grade scholarship (1%)

Paper and Patent

l Published 4 EI indexed papers as 1st author: IEEE BigData 2014 Congress, IASP2009,

ICESS2009, JOURNAL OF ZHEJIANG UNIVERSITY ENGINEERING SCIENCE;

l Filed 3 granted patent as 1st author: 200810061925.1, CN920120163US1, P73299PCT

(the last one is in the legal processing);

l Registered 3 Computer Software Copyright as 1st author: 2008SR06691, 2008SR21765,

2009SR049616; published 1 US disclosure: IPCOM000228209D

References

Available on request.



Contact this candidate