Post Job Free

Resume

Sign in

Data Analysis

Location:
Los Angeles, CA
Posted:
November 21, 2017

Contact this candidate

Resume:

Y IFAN&WEI&

****$Ellendale$Pl,$Los$Angeles$CA****7$

Cell:$213*********$ $Email:$ac3fjp@r.postjobfree.com LinkedIn:$www.linkedin.com/in/yifan8wei$$

$

S UMMARY&&

Data$scientist$with$3+$years$hands8on$analytics/programming$experience.$USC$graduate$with$solid$academic$record$ and$excellent$sense$of$teamwork$and$critical8thinking.$Served$as$team$lead$several$times$with$proven$record$of$high$ quality$delivery.$Looking$for$data$scientist/analyst$opportunity

! Programming Python,$Java,$SAS,$R,$SQL,$MATLAB$

! Tools/Platforms:$$Excel/Office,$Linux,$OSX,$Windows$

! Skills Statistic,$Mathematic$modeling,$Machine$learning,$Analytics$tools$

&

E DUCATION&&

University&of&Southern&California,$Los$Angeles,$CA Feb.$2016$–$August$2017$$ M.S.$in$Applied$Mathematics

• Relevant(Courses:(Analysis(of(Algorithms,(Data(Analysis,(Machine(Learning,(Applied(Probabilities,(Complex( Analysis,(Methods(of(Applied(Mathematics,(Numerical(Analysis(and(Computation( South&China&Normal&University, Guangzhou,$China Sept.$2010–June$2014& B.S.$in$Applied$Mathematics$(Dean's$Honor$List GPA$3.6/4.0$

• Relevant(Courses:(Mathematic(analysis,(Applied(regression(analysis,(Stochastic(process,(Discrete(mathematics,( Applied(time(series,(Multiple(regression(analysis,(Logistic(analysis,(Survival(analysis

$

E XPERIENCE&

Shenzhen&LanDYou&Technology,$Guangzhou,$China Data&Scientist&(Intern July&2016&–&August&2016&

• Tools$used:$$SQL,$Python,$Machine$learning$

• Worked$in$multiple$areas$(data$analysis$and$application,$automotive$IT$and$smart$cars$business$support)$

• Developed$software$that$could$automatically$classify$the$customer$reviews$into$negative$or$positive$categories$

• Wrote$Unit$Test$for$the$application,$increased$test$coverage$from$<50%$to$~80%,$maintained$and$improved$the$ existing$application$to$meet$new$business$demands$

• Apply$Bayesian&Theorem$to$do$word$segmentation$for$each$customer$review/evaluation$

• Built$positive$and$negative$dictionaries$by$analyzing$and$mining$big$data$

• Completed$the$project$four$days$ahead$of$deadline,$with$~83%$correctness$ratio$

&

P ROJECTS &

Stock&Forecast&based&on&Machine&Learning&(Team$Lead Jan&2017&–&May&2017&

• Applied$program$to$forecast$the$stock$trend$using$four$machine$learning$algorithms:$decision&tree,&logistic& regression,&neural&network$and$support&vector&machine&

• Sampled$2000$listed$stocks$in$2016$US$stock$market,$selected$nice$indicators$for$each$stock$to$do$the$classification$

• Applied$training$and$testing$with$85%$and$15%$of$sample$data$correspondingly,$developed$forecast$with$Python$

• Evaluated$the$four$methods,$and$SVM$won$with$~76%$accuracy$ Linear&Regression&Analysis&for&Disease&Datasets Oct.&2016–Dec.&2016$

• Preprocessed$three$coronary$artery$disease$datasets$(each$has$3500$records$with$11$attributes)$from$research$ centers$in$California.$Completed$data$verification,$P&value$calculation,$variance&analysis,&TDtest$etc.$$

• Built$a$linear$regression$model,$cleaned$noise$points$and$evaluated$whether$to$add$spline$or$higher&order$term$in$ the$model,$$applied$model&selection$method$from$SAS$

• Performed$the$analysis&of&variance&and&covariance,$checked$confounder$possibility$in$the$model$ Data&Structure&and&Algorithm&Design$(Individual August&2016–Dec.&2016$

• Applied$greedy$algorithm,$dynamic$programming,$divide$and$conquer$algorithm$to$solve$the$sorting$problem,$$ solved$the$maze$and$other$classic$programming$problems$

• Used$and$compared$core$data$structure:$HashMap,&HashTable,&LinkedList,&ArrayList,&stack&in$Java$and$Python$ Numerical&Analysis&Course&Experiments$(Individual May2016–June&2016$

• Applied$numerical$approximations$to$differential$equations$by$the$Finite&Difference&Method$(e.g.$use$Poisson$ matrix$to$approximate$the$second$derivative$

• Utilized$the$GaussDSeidel$and$Chasing&Method$to$solve$linear$equations,$developed$the$program$in$MATLAB$



Contact this candidate