* * * * * Certified Big Data Developer
* * ******** ********** ******* of Anirban Das
Career Précis :
Total 15.8 years exp.in a fast-paced and highly matrixed environment – On Enterprise IT Application & practice development, COE-open source, Big Data Engg. & Analytics delivery excellence I solution definition I revenue generation and account growth Tech Lead/ Architect
Individual Contributor
Consulting & Solution Delivery
0
1
2
3
4
5
6
Industrial
Automati
on, Data
Acquisitio
n, SCADA,
PLC prog.
SAP,BIW,
EDW,
Enterprise
App, BI
Business
Informati
on
Managem
ent,
Analytics
BI,Big
Data, Data
Analytics,
Advance
Analytics
Tech Lead/ Architect 2.5 6 4.3 4.3
Individual Contributor 3 5.2 2.4 5.6
Consulting & Solution Delivery 3.5 3.7 3 4.3
2.5
6
4.3 4.3
3
5.2
2.4
5.6
3.5
3.7
3
4.3
TECHNICAL ROLES/KPIS DEFINED
EXP IN NO. OF YEARS
TECHNOLOGY DOMAIN DIMENSIONS
A Glimpse : Role distribution across my career span Tech Lead/ Architect Individual Contributor Consulting & Solution Delivery 2 P a g e Certified big data developer I +91-990******* 2
Competence Profile: Anirban Das Certified Data Storage Specialist – Anirban Das
Present key working Skills on Digital &
Data transformation with Adv.Analytics :
Envision, lead and execute POT/POC/MVP
Using extended team.
BI, Big Data modelling with MapR Hadoop
distribution I Advance Analytics with Python
ML-library toolkits
Technology Domain :
EDW I Digital Tr., BI, Big Data Analytics I
Applied Statistical Data Modeling I Cloud
Compute, Network & Storage I Data
Storage & CDP
Industry Vertical :
Clinical Research & Healthcare I Retail
Store I Telecom & Networking I
Hospitality I Oil & Gas I IMS I Energy &
Utilities I Investment Banking I Infra,
buliding, Environment & Water Engineering
I Infrastructure Control Asessment I Cyber
Security & Tech Controls (CTC)
Pedagogical Performance
MBA, BEE (JU), M.Tech-CS, AMIE, C. Engg.
Pursuing PhD ( on BD & advance analytics)
Chartered. Engg.(IEI), Govt. of India
Certifications :
Prologue
MapR Big Data Developer-CDP
Data Science with
Python
COURSERA SAP 4.7
Spark 1.6
Cassandra
MapR
Certified
Spark
Developer
ANIRBAN, an astute professional brings with his
execution focused IT solution development experience, entrepreneurial outlook & energy, interdisciplinary educational & extensive senior management experience with a remarkable ability to combine visionary
leadership, business acumen, and strong execution in building and managing successful consulting
organizations.
He has spent significant time working with key
customers across the globe & brings in extensive
experience in area of Data engineering/EDW, Analytics through applied statistics, Business Intelligence (BI) and Information Management.
Key Performance Area/Forte:
Attract, influence, develop and lead a high performing and engaged team with
Camaraderie
Data Analytics Practice Objective:
-‘Torture the data, it will confess anything-‘
Clientele from: EMEA, MENA, GCC, APAC, US
USA I UK I Germany I Australia I Singapore Malaysia I India I KSA I Belgium
Employment Chronicle:
SIEMENS LTD. I Invensys plc. UK IElectronik Systeme LAUER GmbH IGermany I IKF TECHNOLOGIES LTD. Inc.USA I Elecom Software, USA I KPMG, SIDF, KSA I REBACA Analytics Inc., USA I KPMG Global,
Press Publication: Success Story by THE TELEGRAPH
The Indian Express, Mumbai
The Economic Times
Present Organization:
ARCADIS Consulting Inc., USA www.arcadis.com
In the capacity of:
Principal Architect- Big Data, BI & Analytics
Solution, Consulting & Delivery
Global Excellence Center
Location: Bangalore
M: +91-990******* I E : *******.*****@*****.***
Skype : anirban.das8
3 P a g e Certified big data developer I +91-990******* 3
Competence Profile: Anirban Das Certified Data Storage Specialist – To work with a companionable growth oriented Data Analytics organization having affable work ethnicity, coherent techno work environment while dedicating and committing to the entrusted responsibility through industrious and creative aptitude to propel my organization to newer frontier of augmentation and excellence Prime focus on getting excellence in Big Data, BI & Data Analytics arena in the capacity of Big Data solution architect
Contemporary Status
Rel. Exp. in Digital transformation & Dat Engg. with BI & Analytics : Out of Total 15.9 years in IT 5.7 yrs in BI, SCADA, RPA, Virtual Reality, Big Data Analytics, Machine learning, Data Storage, Protection/Security
Current Position : Principal Architect/Practice Head- Big Data Analytics (CSD) Lead Global Excellence Center, Digital & Data Analytics ARCADIS Consulting Inc., Atlanta, USA
Current location : Bangalore, India
Locational preference : Can relocate
As Visiting Faculty : Indian Institute of Management -Kolkata, (fr MDP); SMU Scholastic Performance: Academia Dossier
Career Annals (Working Company Silhouette) : A digital journey Job Location Organization Last Designation Timeline Germany, India, Nepal, Riyadh SIEMENS Technology Services
(IKF GmbH Payroll) ( MNC)
Sr. Application Engineer- Data
Acquisition, SCADA, PLC
2002, Aug- Sep 2006
Germany, USA, India, Singapore
IKF Technologies Ltd. ( NSE, BSE
Stock Listed), USA ( MNC)
Global Solution Head- Big Data,
EDW, BI
2002, July- October, 2015
Santa Clara, USA, Dubai, Singapore
Rebaca Analytics Inc., USA (MNC)
VP- Big Data, BI & Analytics
Cosulting & Solution Delivery
Nov., 2015- April 2017
Atlanta, USA, Bangalore Arcadis Consulting Inc., USA(MNC) Netherland based, Stock Listed
Sr. Technical Architect- Digital &
Data Science ( AD, Grade-4)
May, 2017- Nov., 2017
Singapore, India, Dubai
Hewlett & Packard Enterprise(HP)
And IKF Technologies Ltd. (MNC)
Consultant Architect (
Independent) – Big Data Engg &
Data Science
Dec., 2017- Present
SIEMENS, Schneider Electric & Beijer Electronics under the payroll of IKF TECHNOLOGIES GmbH (2002-2015) .IKF Technologies ltd., is a luxemberg & NSE listed organization. Worked with Infogemglobal Inc., USA ( a subsidiary of IKF USA) BEE - 2002
From Jadavpur
University,Five Star
Accredited Govt.
Univ.)
MTech- Computer
Science
2014
Karnataka University
MBA - Systems
2006
International Institute of
Management Sciences
Chartered Engineering-
Institute of Engineering,
2005
Govt. of India
Pursuing PhD
On
Critical Big Data App
framework with data
science
Milestone
4 P a g e Certified big data developer I +91-990******* 4
Competence Profile: Anirban Das Certified Data Storage Specialist – A _Glimpse: Skill sets
Programmable Logic Controller Allen Bradley, Siemens, Delta, Messung, Mitsubishi, Toshiba, Schneider, Beijer, ABB
PLC Programming Language Ladder Logic, STL, FBD
SCADA ( Data Acquisition) WinCC, Citect, RSView, Wonderware, Intouch, Cimplicity, GE iFix, LAUER- WOP, BEIJER GmbH
Operating System Windows 2000/XP/VISTA/7, Redhat Linux, CentOS, Ubuntu 14.0/15.0 HMI/MMI (Man Machine
Interface)/Touchpad
LAUER WOP, BeijeriMX, Siemens, Delta, Messung, Mitsubishi, Toshiba, Schneider, ABB
Drives ( VFD-AC/DC) Parker, Rockwell, Siemens, Delta, Messung, Mitsubishi, Toshiba, Schneider, Beijer, Baldor, ABB, PARKER (584SV, 620V, 650V, 605C, 690+ Vers BALDOR, LG (iG 5),YASKAWA (616G5), CUTES, DELTA (VFD-A) Drive Programming Software Windows based ConfigED Lite+, CELite 5& 6, LINK Tools GPIB/Data Communication RS232, 422, Ethernet, Modbus, Profibus-DP, Canbus, Interbus, AS31 Scripting & Programming Language JavaScript 1.8.5, Core Java 1.7/1.8, Python 3.5++ Database, SQL/No-SQL Oracle 8i/11i ;SQL SERVER 2008; My SQL, Hbase 5.8, MapR DB, Mongo Reporting tools Crystal Report, i-Report, Spago BI 2.3.0, Jaspersoft 6.3 IDE Eclipse; Dreamweaver; Pycharm (JetBrains), IntelliJ 2017.3.2 Application Server Tomcat Apache 2.2/2.4
Ingestion Tool Kafka 0.9.1, Sqoop 1.4.6 ++, Talend with Big Data 6.0 Orchestration & Scheduler Zookeeper 3.4.8, OOZIE 4.1.0 Data Storage Solution/Data
Analytics/Data Visualization/ETL
TSM (IBM Tivoli Storage Manager ) 5.3/5.4, CDP
(HCL Infosystems Ltd.), Pentaho 7.0, Open source BI- Spago 2.3.0, Jaspersoft(TIBCO) 6.3, Power BI ( 2016/17), Logstash 6.1, KNIME 3.1.2 Big Data Technologies/ Data
Analytics with Data Science
Hadoop Framework (2.1.7) with Map-Reduce, HBASE 5.8, HDFS, Pig 0.15, Hive 1.2.1, Python 3.5, and shell scripting, Spark SQL & Spark core ( 1.6.1 Spark Streaming (Dstreams) with scala (2.11.7); Machine Learning (SVN,ANN, K-Means), Time series, Regression, with Ensemble learning, ElasticlogstashKibana 5.0.0 & MapR CDP 5.0,6.0 CLOUDERA 5.3.3 with Cloudera Manager 5.3.3 & HUE 3.12, MCS 5.0 Abstract / Crux of experience on BIG DATA technologies with advanced analytics Hadoop/Spark/ELK Stack
• In-depth working knowledge of Hadoop-Spark
architecture and admin,support & Linux system
administrator
• Experience with terabyte or larger size
unstructured & poly-structured data sets in Json,
.txt,.Csv,ASCII, Avro, XML
• Proficient in Hadoop ecosystem with Map-Reduce,
HBASE 5.8, HDFS, Pig 0.15 or Hive 1.2.1, and shell scripting; loaded the dataset into PIG environment for ETL (Extract, Transform and Load) purpose using PIG Latin Script. The processed results from Map
Reduce programs are stored into HIVE in the form
of tables for further analysis.
• Expert understanding of ETL, ELT principles and how to apply them within Hadoop distribution
• Coding/scripting validation ability in Java 1.7, 1.8, python 3.5++, Scala 2.10.6 onwards (for SPARK Core, SparkSQL)
• Experienced with linux system monitoring &
analysis. Good understanding of distributed computing environments
• Exposure to Virtualization (VMware), VPC & Cloud - AWS - EC2, S3, EBS, EMR
• Experience with Map-R Converged Data Platform
5.1 & 6.0 with MCS (MapR control system)
Atlassian JIRA 7.2, Confluence 5.10, Agile
methdology with SCRUM
0 P a g e *******.*****@*****.*** I Skype : anirban.das8 I +91-990******* 0 A Succinct Competence Profile of Anirban Das
Project Snapshots - On IoT, BI, Big Data Analytics & Applied Stat modelling
# Project : Gas Consumption forecasting & descriptive analytics on raw ingested smart/analog & BLE sensors data through IoT framework ( device, network & application layer) & CDMA Client /Solution Owner :
Air Liquidie, Singapore
Duration: Nov.2015 - Present I
IoT Data ingestion & Analytics framework implementation with CDMA(Continuous data monitoring & Analysis) from derivative stage
Service Delivery Model : Hybrid - Onshore & Offshore
Role : Lead Solution & Tech Architect – Digital Transformation
Technology Domain Used :
IoT, Big Data Engg., ETL, BI, Advance Analytics with Statistical modelling (TS)
Consulting Orgn. : ARCADIS, USA
Industry Vertical : Oil & Gas
Project Location : Singapore
Technology Stack/Deployment
R 3.2.1 (GNU package), IDE-R-studio, D3.js, HTML5 (front end), Python 3.5, R-Forecast Package: ARIMA, Holtwinter, Archive Network – CRAN, Unit Root Tests – augmented Dickey–Fuller test (ADF), Kwiatkowski-Phillips-Schmidt-Shin (KPSS) test, nDiff, Diff . R-Shiny Pro(on Premise) with Elastic Search 6.0 as TS datastore cluster,TSDB 2.2 & InfluxDB+Grafana 4.0, Rasberry Pi3B, Rasbian OS, ESP8266 MC Cloud : ThingsBoard 1.3 ; Message Protocol MQTT 5.0, REST on http Role Description
Complete understanding of raw sensor data, data exchange format, storing & processing while prepared expectation criteria from the client.
Clean and prepare stationary data to fit the TS model with unwavering accuracy from device layer to application layer.
Used necessary hypothesis to figure out best time series model in R.
Defined Lamda architechture with Tableau,Grafana,D3.js. R-shiny for data
Rendering through a distributed VPC computing platform. Project Details/ Project Overview
Forecasting of gas consumption for various industrial customers using last 2 years historical TS records of per hour gas consumption. ARIMA and Holt-Winter are two different time series forecasting methods have been used for forecasting of next 6 hours (with minute level TS data), 6 days &3 months consumption.
All data are in ASCII format of data which was pushed to lamda data processing framework/Architechture. CDMA & stat modelling for deriving impactful KPIs from hindsight to insight to foresight through a compelling UI/UX.
Project : Infrastructure Control Assessment(ICA), Analytics of security data from UTM (Unified Threat Mgmt) Consulting Organization : ARCADIS Inc, USA Duration : August 2016 Industry Vertical : Healthcare, Infrastructure Mgmt Service Solution Owner : GlaxoSmithkline, USA I Toga Network, USA Role : Solution & Delivery lead
Project Location : Atlanta, USA Service Del. Model : Offshore Project Overview: Infrastructure Control Assessment (ICA) Technology Domain: ETL, Big Data Engg. & BI SPAM analysis to ensure that emails always reach their target. By doing accurate spam analysis GSK can automate email authentication. Users can send emails from any domain and protect from spam sites. Spam anomalies analyze the following stats:
Email header: where the spam starts, spam domain, Return-path, Received tags, X-IP tag, X-UIDL of spam mail. Spam mail stats from penalized/banned sites/portals, Referrer spam stats, disavowing spam link stats Security threats/DDOS attacks analysis. Assess network vulnerabilities and risks Technology Stack : ELK –ElasticSearch-LogStash-Kibana 5.1.1, Java 1.8, D3.js,AngularJS/JQuery, HTML5, Bootstrap 1 P a g e certified big data developer I +91-990******* 1 Competence Profile: Anirban Das Certified Data Storage Specialist– Co-Product Development ( Big Data, BI, ML) :
Analytical tool: Root Cause Analysis (RCA) Tool (COE Initiative) RCA is a tool for making automated Root Cause Analysis against error conditions reported by application logs from the different components in an enterprise
application ecosystem. The tool uses the Elastic Stack
(ELK) to derive its intelligence by correlating and analyzing the different logs along with system health KCIs based on timestamp driven pattern recognition and introspection rules executed by a C# based
middleware component.
Rules against error and anomalies to provide
recommendations
• Use of packages across a segment of a network /
Seasonal behavior of a subscriber base and individual subscriber
• Usage pattern based on time of day / Production error due to incorrect system configuration or sizing
• Error caused due to faulty system hardware
components
• Error caused due to incorrect behavior of 3rd party components
• Remedial recommendations against error conditions Implementation:
The RCA server consumes data pushed into Elastic
Search by Log-Stash and executes a multithreaded
sequential workflow of correlating the data followed by execution of the configured rules and extracting the relevant preconfigured recommendations.
Case Study URL :
https://www.rebaca.com/whatwedo-domain-big-data/
CISCO Testimonial :
https://www.rebaca.com/client/alldigital/
Implementation for: CISCO, ERICSSON
Big Data Framework Development with Lamda Architechture: Analytics framework: Customized BIG data Framework for Streaming Data Analytics (COE Initiative) The objective is to create advanced analytics on data ingested using Kafka-Spark-Cassandra framework
prepared for handling un-structured streaming & batch data from disparate data-sources.
Implementation:
Input data is received either through REST call or streamed to Kafka topics and the Spark-Cassandra
Framework, through Kafka subscription, receives the data. Then different ETL procedures and data
processing are applied to the ingested data. Both the processed and the raw ingested data is saved to the Cassandra DB for further data modeling and
visualization. The raw ingested or semi-processed data is also used for training purposes using different machine learning algorithms (in-built Spark MLlib / Other algorithms).
Case Study URL: https://www.rebaca.com/whatwedo-
domain-big-data/
Implementation for: Empirix Inc., USA, ALEF
Mobitech, USA
Technology Stack
•Elastic Search 6.0, Log-Stash, Kibana, Java 1.8,
Angular-JS/JQuery, Bootstrap, MapR Hadoop
Distribution 5.2, OJAI, MapR DB (Json & Binary
Table with Habse API)
Technology Stack
•Java 1.8, Apache Spark 2.0, Apache Kafka 2.0,
Apache Cassandra 3.0and D3.js, and MapR
Hadoop Distribution 5.2
1 P a g e certified big data developer I +91-990******* 1 Competence Profile: Anirban Das Certified Data Storage Specialist–
# Project : Critical Big Data Application framework (CAF) for Cyber Security & Technology Controls(CTC) Client /Solution Owner :
RO, Cyber Security Control, Govt. of
India, Delhi, Head Quarter
Duration: Nov.2017 - Present I
Sensor Data ingestion with MapR Hadoop big data framework with EDA through BI & custom application interface
Service Delivery Model : On premise implementation
Role : Lead Data & Tech Architect
Technology Domain Used :
IoT, Big Data Engg., ETL, BI, Advance Analytics with Statistical modelling (TS)
Consulting Orgn. : HP Enterprise
Industry Vertical : Cyber Security
Project Location : Delhi, GOI
Technology Stack/Deployment
MapR Hadoop Distribution 6.0, MapR FS, MapR DB ( Hbase API), OJAI, Talend with Big data (using Spark) studio, D3.js, Elastic Search 6.0, HTML5 (front end), Tableau, SparkR 2.1, SparkMLlib, TIKA Library Spark 2.0, Kibana, NER, NLP with python, Time Series modelling with R Role Description
Complete data modelling with Hadoop, Hive, MapRDB, ES
Index pattern design, search query modelling, Document indexing, mapping & modelling. Storing image, pdf, doc using TIKA library.
With advance analytics data modeling
Project Objective The Risk & Control team supports and manages the oversight and delivery of all Risk, Control, Compliance, Audit and regulatory, Resilience and Cyber requirements and Technology Controls, and its partners
Project Details/ Project Overview
Exploratory Data analysis of the security sensor data, segmentation of the targets through sensor data, GIS data plotting, hotspotting, anomaly detection, recommendation, text analytics, search and complete visulation of KPI/metrics ( quantifiable measure)
Project on advance Analytics with R & BI:
Project Title: To predict customer satisfaction score based on key influencing process management parameters
Client: REBACA, USA
Employment Type: Full-Time Duration: Nov 2015 - Feb 2016 Project Location: Santa Clara, USA Site: Offsite
Role: Solution Architect Team Size: 3
Skill Used: Statistical tool: R with R-Studio, Excel, SPSS ; Sentiment Analytic tool using NLP : Stanford NLP, CoreNLP, Qdap, OpenNLP ( other NLP for accuracy validation) ; Text Mining : Wordcloud, tm ; R- Packages : • MICE Package for missing value imputation • Psych package for factor analysis • Random forest package for random forest machine learning techniques • Statistical Methods : Multivariate Linear regression ; Machine Learning Techniques : Random forest
Project Overview:
To predict customer satisfaction score based on key influencing process management parameters (KCI- Key Capability Indicator.
Project Scope :
To create predictive analytical model for any running project using multivariate linear regression & machine learning technique using Key Capability Indicators (KCI) of project. Used Sentiment analysis on client’s project feedback to generate satisfaction score.
2 P a g e certified big data developer I +91-990******* 2 Competence Profile: Anirban Das Certified Data Storage Specialist–
Project Title: To predict customer satisfaction score based on key influencing process management parameters
Client: REBACA, USA
Employment Type: Full-Time Duration: Nov 2015 - Feb 2016 Project Location: Santa Clara, USA Site: Offsite
Role: Solution Architect Team Size: 3
Skill Used: Statistical tool: R with R-Studio, Excel, SPSS ; Sentiment Analytic tool using NLP : Stanford NLP, CoreNLP, Qdap, OpenNLP ( other NLP for accuracy validation) ; Text Mining : Wordcloud, tm ; R- Packages : • MICE Package for missing value imputation • Psych package for factor analysis • Random forest package for random forest machine learning techniques • Statistical Methods : Multivariate Linear regression ; Machine Learning Techniques : Random forest
Project Overview:
To predict customer satisfaction score based on key influencing process management parameters (KCI- Key Capability Indicator.
Project Scope :
To create predictive analytical model for any running project using multivariate linear regression & machine learning technique using Key Capability Indicators (KCI) of project. Used Sentiment analysis on client’s project feedback to generate satisfaction score.
Project Title: Big Data Analytics solution for Mobile and Voice Data monitoring in physical and cloud networks
Client: Empirix Inc. USA
Employment Type: Full-Time Duration: Jun 2015 - Present Project Location: Atlanta, USA Site: Offsite
Role: Solution Architect Team Size: 5
Skill Used: Hadoop, HDFS, Vertica ( DB), Microstrategy ( BI) Project Overview :
Big Data Analytics solution for Mobile and Voice Data monitoring in physical and cloud networks Enhancement and maintenance of a monitoring and Analytics solution for mobile and voice data Features supported
Protocol data is captured from service provider networks using virtual and physical probes positioned at different points in the network. This data is transferred to Regional Operations Servers (ROS) for inter-probe correlation; after which it is exported to a central mediation server for business analytics. At central mediation server, the data is suitably enriched using reference data (dimension data) before getting loaded into the designed Vertica warehouse. From Vertica database, MicroStrategy picks up the data to perform different aggregations based on the clients reporting needs.
Tools and Technologies
Platform: Linux I Big Data technologies : Hadoop 2.7.1, HDFS, Yarn, Kafka 0.9 Enrichment/Correlation: Java application, 1.7 ; Analytics DB : Vertica 3 P a g e certified big data developer I +91-990******* 3 Competence Profile: Anirban Das Certified Data Storage Specialist–
Project Title: Monitoring, Big Data Analytics & Reporting of network service KPI through SELF CARE web app portal
Client: ALEF MT Inc., USA
Employment Type: Full-Time Duration: May 2015 - Present Project Location: Atlanta, USA Site: Offsite
Role: Sr. Project Leader Team Size: 9
Skills Used: Search Engine : Elastic Search ETL : Logstash ; BI Reporting : Kibana Web app Portal : Liferay Data Size : 1 billion bytes/day
Project Overview:
To create basic & advance analytics on big data for a RAN caching OEM vendor. Project Scope :
Periodic forwarding of service metrics & faults through SNMP traps
Dashboard to enable control and management of custom content & services
Interactive reports & advance analytics that describe usage, billing and services used
Project Title: To model predictive& prescriptive analytics from the unexplored & unused CDR datafrom CPS ( CISCO POLICY SUITE )
Client: CISCO, USA
Employment Type: Full-Time Duration: Jan 2016 - Present Project Location: Bangalore & Kolkata Site: Offsite Role: Solution Architect Team Size: 4
Skill Used: Big Data Tools & Technologies Apache Hadoop 2.7.1, Yarn, Map Reduce, Sqoop, Oozie, Zookeeper Database : Hbase/MongoDB Cloud Environment : AWS : Amazon Web Services EC2 ( Elastic Compute Cloud ) Amazon S3 EBS ( Elastic Block Storage ) EMR ( Elastic Map Reduce ) Statistical Modeling : with R ( Version 3.3.0) & R - Studio Linear Multivariate regression for prediction Times Series Analysis using ARIMA & Exponential smoothing K-Means, KNN, SVM ( machine learning
). Using Map-R 5.1 CONVERGED DATA PLATFORM in AWS. Project Details/Features supported:
To derive actionable insights from CDR data which are still unexplored, unused and are being archived by the customers
To increase superlative end user experience & engagements
To open immense possibilities to new policy introductions/maintain
To take best informed & improved decision from predictive analysis Advance predictive analytics on CDR :
To predict the subscriber's data usage Cyclic & Seasonal trend/pattern Additive/Multiplicative nature on existing trend
To predict bandwidth usage of the organization for a month /day - Cyclic & Seasonal trend/pattern Additive /Multiplicative nature on existing trend
To predict Termination error from respective Nas IP/Frame IP with respect to time and more.. Actionable insights from Predictive analysis:
To promote plan/offers to the subscriber depending on their login time /session timespan, data usage ( up-selling) To recommend subscriber for better new plan/policy to be enhanced from their existing plan. 4 P a g e certified big data developer I +91-990******* 4 Competence Profile: Anirban Das Certified Data Storage Specialist–
Project Title : RESTORI, Analytics product services for Retail & Hospitality vertical Project Location : KU, Malaysia Site : Hybrid
Role : Sr. Technology Leader & Tech Product Architect Team Size : 5 Technology Stack : Hadoop, HDFS, EC2 AWS, S3, EMR, SOLR, D3.Js ( Data Visualization), Java & python
Product link : https://play.google.com/store/apps
Press Release : https://www.telegraphindia.com/1150824/jsp/business/story_38728.jsp#.VdtqISWqqkq Role Description:
• Suggested & implemented in AWS, different payment gateway viz. paypal, CC Avenue using their API. Implemented different VAS (Value added Services) viz, Make my trip, food panda, Bookmytable, Bookmyshow, Groupons using their web services.
Project Overview:
Conceptualized & designed the analytics product on Retail & Hospitality domain from the nascent stage. Restori is a product made using Big data & Hadoop clusters with SOLR as backend. Designed search algorithm in the product based on retail, hotel & Restaurant Analytics. Comprehended the challenges while advising on how to transform the application with improved UI/UX and technologies. Managing the project from creating project charter to WBS, activity mapping and more. Configured & administered the hosting of the entire application in Amazon Web Services (AWS) using Elastic Cloud Environment (EC2). Configured EC2 instances, RDS, Elastic cache clusters, elastic IP, workspaces. Created a test environment at Godaddy managed VPS servers. Designed the demographic analytics, word cloud, competitive benchmark, popularity trending, rating distribution, sentiment analytics, #Hashtag with social media ROI Analytics of the product. Prepared use cases and test cases for the first beta release of the product. Converted Restori to android & IOS APP. Achievements/ Attainments:
Released first beta of “Restori” as a product implemented on Big Data technologies using Hadoop ecosystem with SOLR as backend. Identified by press “The Telegraph” as one of the promising Big Data products invented & implemented by REBACA Analytics from Kolkata. Clentele: Future Group, Maxmart, Citimart, Symphony Hotels, some famous QSR Cloud native (AWS) Big Data Analytics Product implementation Overview: 5 P a g e certified big data developer I +91-990******* 5 Competence Profile: Anirban Das Certified Data Storage Specialist– Career Contour
Last Job Profile
Present Organization : ARCADIS Consulting Inc., Atlanta, NA-USA (NV based MNC)
Present scope of work / Role : Practice Head/ Pr. Architect - Big Data & Analytics Consulting,
Category/Period : Since April 2017 – till date
Immediate last Organization : REBACA Analytics Inc., USA
Designation : Practice Head- Big Data, BI, Analytics Consulting & Delivery
Period : Nov. 2015- March 2017
Prime Roles:
KPIs: Responsible for building a Digital Center of Competence to drive thought leadership, strategic direction, analytical insights, predictability, intelligent automation, and digital IoT innovation across GEC (Global Excellence Center, India), ARCADIS, USA.
Thought leadership – Deep breadth and depth of domain expertise in Big Data, BI, Analytics and Intelligent Automation (Machine Learning). Deep analytical skills including - statistical modeling, stochastic modeling, and neural networks to drive real benefit outcomes.
Digital roadmap – Establish, define, design, and deliver a digital center of competence that will demonstrate best in class Big Data Platforms, Data Management, Data Mining, Data Analytics, Data Solutions, Data Visualization, and Machine Learning.
Innovation - Collaborate with Delivery practices to create IP and Innovation roadmap
IoT Practice in ARCADIS : https://www.linkedin.com/in/anirbandas2015/detail/recent-activity/shares/
Key digital technology domain areas :
Hadoop (2.7.1), MapR CDP ( 5.5), Power BI, Spark( 1.6.0) with scala 2.11.7, Cassandra (2.1.11), Amazon Web Services (EC2, EBS,EMR,S3), Microsoft Azure, Python 3.5++, R, SQL, NOSQL( Hbase,MongoDB), Robotics Process Automation ( Work fusion), Augmented reality with Unity, MS Hololens ( COE initiative), AZURE Data Factory, Blob. Retail Store: Maxmart (online & Offline):
Project Title: Data processing, analysis, aggregation/hierarchy oriented scripts for processing of approx 1 M Client : Maxmart India Vertical : Retail/ e-Commerce Employment Type : Full-Time Duration : Mar 2015 - Aug 2015 Project Location : India Site : Offsite
Role : Tech Lead – Data Solutions Team Size : 4
Skill Used : Spark 1.5, MongoDB, HDFS 2.2.1
Project Details:
Data processing, analysis, aggregation/hierarchy oriented scripts for processing of approx 1 million rcrds.
create product oriented hierarchies
interactive product sales aggregations by buckets of selected products
create percentiles for comparing sales stats for products in newly aggregated product hierarchies 6 P a g e certified big data developer I +91-990******* 6 Competence Profile: Anirban Das Certified Data Storage Specialist–
# Organization : IKF TECHNOLOGIES LTD., Stock Listed Limited Co. (NSE, BSE, Luxemberg) NASDAQ, NSE, BSE listed IT company