Sign in

Big Data Engineer

Beyoglu, Istanbul, Turkey
August 13, 2018

Contact this candidate




*+ years of project experience in Big Data Ecosystem and Java, Scala development

• 12+ years of Software Engineering, System Engineering and Analysis experience

• 5+ years of extensive IT experience in all phases of Business Process Management Systems, Document Management Systems, Core Banking Frameworks (coding, architecting, designing)

• Project experience in Python for OpenStack Cloud System development

• Clear understanding of Scrum (Certified Scrum Master), Software Development Life Cycles (SDLC), Requirements Analysis.

Big Data Hadoop, HDFS, HBase, Hive, Impala, Kafka, Kafka Connect, Spark, ElasticSearch, Zookeeper, Sqoop, Cloudera(CDH), Hortonworks, Zeppelin, Nutch, SolrCloud, Docker, Impala, Hive

PLs & Frameworks Java, Python, Scala, C#, WCF, IIS, Remoting, Nhibernate, MQ, Hibernate Databases Oracle, MS SQL, MySQL, HBase, MongoDB, Impala, Hive Configuration Tools Git, Jira, Bitbucket, Maven, Doors, Enterprise Architect, SVN, Confluence, TFS Experience

Big Data Analytics & AI

Veloxity – Istanbul, Turkey 03-2017 –

Senior Big Data Engineer &

Team Lead

Ü Responsible for the design, management and maintenance of Big data platform and storage technologies. (including installation, troubleshooting, performance tuning, configuring, upgrading and linux bash scripts)

Ü Developed Real-time data pipelines with Kafka, Kafka Connect, Kafka & Spark Streaming and Sqoop Ü Developed Real-time & batch Spark jobs for data analytics (Scala/Java) Ü Developed various insights from raw data about customer behaviors Ü Designed and implemented dynamic customer profiling system for right customer targeting Ü Designed and implementing location base customer analytics (imported whosonfirst data to elasticsearch and implemented spark jobs for analytics & profiles) Ü Designed and implementing system for matching customers’ locations and Point of Interests matching (imported openstreetmap data to elasticsearch and implemented spark jobs for analytics

& profiles)

Ü Developed and dockerized Java/Spring backend API services for serving analytics to customers Ü Used Elastic Search and Hbase NoSql databases for high performance search applications Ü Used Ignite for scalable caching, Zepplin for notebook Ü Used Machine Learning techniques to extract customer behavioral segments Ü Provided technical leadership and mentoring to team members Big Data Platform Tech. Stack: Cloudera, Hadoop, Hdfs, Yarn, Hive, Impala, Spark, SparkSQL, Zepplin, Kafka, Kafka Connect, Sqoop, Flume, Elastisearch, HBase (NoSql), Ignite, Docker Canan Girgin

Senior Big Data Engineer

Eksioglu Mah. Sehitler cad. No : 2/G 34794 Cekmekoy, Istanbul, Turkiye, +90-555-*******

Canan Girgin / RESUME Page 2

Cloud Computing and Big Data Research Lab (B3lab)

TUBITAK – Gebze, Kocaeli, Turkey 12.2010 – 09.2016 Damla: National Search Engine Project

Big Data Engineer &

Technical Team Lead & Scrum Master

Ü Responsible for the research and development of Big Data processing and storage technologies. Ü Developed various MapReduce jobs. (MapReduce, Java, HDFS, HBase) Ü Prepared architectural design documents and detailed design documents. Ü Planned, implemented, and maintained 20 node high performance Hadoop Cluster and 8 node SolrCloud cluster (Centos 6, crawling performance: 40.000-60.000 url/sec. through-put.) Ü Crawled and stored 85 million webpages, 600 million seeds, 200 TB HDFS size (Nutch, HBase) Ü Indexed web pages for faster search with SolrCloud Ü Developed Language identifier (for enabling to focus only on Turkish contents) MapReduce job Ü Developed MapReduce Job for classification of crawled webPages. Some classification categories are: sport, news, adult content, shopping... Performed distributed classification on Hadoop. Ü Improved SolrCloud capabilities by integrating Turkish tokenizer (Zemberek) and Turkish synonyms. Ü Developed systems for improved search ranking performance Ü Designed and implemented image search system architecture Ü Used Git, Maven, Jira, Confluence, IntelliJ Idea, developed with Agile methodology (SCRUM) Ü Committed bug fix and improvements to Nutch open source Project SEN: Scheduler ENhanced Project Senior Software Engineer Scheduler ENhanced (SEN) is a general purpose scheduler for cloud environments. Media: SEN-Media, Presentation: SEN Presentation, Web Page : B3Lab - SEN Ü Experience in all phases of lifecycle such as coding, designing, developing unit tests Ü Developed algorithms about reallocation of virtual machines for energy efficiency. Ü Developed Python application modules and Rest services. Ü Integrated to OpenStack platform and tested on it. Ü Used Git-flow, Bitbucket, Jira, Confluence, PyCharm, developed with Agile methodology (SCRUM) Ü Presented at OpenStackDayIstanbul conference: Conference Schedule E-BELGEM: Archive Management System Senior Software Engineer Aim of the project was developing an Electronic Record Management System. Ü Management of eliciting & analyzing & managing requirements, design activities. Ü Prepared architectural design and requirements documents. Ü Developed document Capture& viewer components. Java-Seam-Primefaces -Hibernate –MSSQL Software Infrastructures & Business Applications

Asya Finans - İstanbul, Turkey 08.2006 – 12.2010

Core Banking Software Framework

Senior Software Engineer & Architect

Developed, architected, and maintained infrastructure and framework of core banking application, used by 5K+ employees to serve 3M+ customers at 200+ branches. (.Net Windows App, C#, Oracle DB). Ü Conducted application design, technical reviews, and application stability consulting. Ü Implemented core components and architectural changes by C# using OOP. (Data access layer, cache layer, log & exception management, .Net Remoting layer, windows services) Ü Coordinated code reviews and performance tuning tasks of 100+ developers, provided guidance. Ü Discovered and fixed stability/scalability issues by code optimization and restructuring framework. Ü Consultant on integration of applications and use of web services, WCF, and MSE. Canan Girgin / RESUME Page 3

Ü Researched, evaluated, and integrated third party software products, worked closely with vendors. Ü Architected web framework using Asp.Net MVC, WCF, and NHibernate. Implemented service layer, exception handling, log management, authorization, and WCF behaviors. Ü Designed, developed, and maintained Application Release Automation and Deployment Service. Ü Designed and developed a new document management system PUSULA - B.P.M . Project Software Engineer & Architect Pusula is a modular and extendable Business Process Management (BPM) infrastructure which supports more than 100 different business workflows successfully. Ü Designed and developed a generic, new workflow automation system (C#, WCF, Oracle) Ü Developed several workflows (Credit Card Application, Financial Analysis and Investigation etc.) Ü Used Technologies: C#, PL SQL, .Net Remoting, WCF, Oracle 10g - 12c Education

B. Sc. - Computer Engineering - Ege University 07.2006 M.Sc. - Computer Engineering - Yildiz Technical University 02.2014 Thesis :Semantic Relation Extraction by Conditional Random Fields from Turkish Wikipedia Pages PhD. - Computer Engineering – İstanbul University 01.2015- Courses completed, in thesis period


Ü Genre Classification of Web Pages in a Turkish Search Engine. BigR&I International Symposium on Big Data Research and Innovation Ü Question Identification on Turkish Tweets.

INISTA International Symposium on Innovations in Intelligent Systems and Applications. Ü Semantic Relation Extraction by Conditional Random Fields from Turkish Wikipedia Pages. IEEE 22. Signal Processing and Communications Applications Conference Ü Language Based Web Crawling On Big Data.

IEEE 22. Signal Processing and Communications Applications Conference Ü Business Model Canvas Perspective on Big Data Applications. IEEE International Conference on Big Data

Contact this candidate