Shalaka Thombare
Address: **** **** ****** #****, ***** Clara California, 95051
Email: *******.*******@*****.***, Mobile: 980-***-****, Linkedin: https://www.linkedin.com/in/shalakathombare EDUCATION
• University of North Carolina Charlotte, NC, USA January 2017 – May 2018 MS in Computer Science (Concentration: Natural Language Processing/ Machine Learning) GPA: 3.8
• BE in Electronics & Telecom Engineering, University of Pune, India June 2009 – June 2013 PROFESSIONAL EXPERIENCE
Software Engineer (NLP & Dialog Research), DMAI Inc (June 2018 - Present) I am working on developing algorithms to develop a complete and generic dialogue system, using C++ & Python.
• Led the development of predictive model for sequential/time series data using HMM (generative model), logistic regression and recurrent neural networks (LSTM).
• Significant contribution to the prototype development of a utility driven planning language like PDDL for the design of conversational & tutoring agents. My contributions include research of academic tools like cognitive tutor authoring tools by CMU, design of algorithm to convert the CTAT’s “behaviour graph”, research & development using PullString Converse.
• Developed & integrated utility update model for the dialogue system using Bayesian network models for predictive tasks.
• Led development of event management system using ActiveMQ broker, developed python-based GUI component, & integrated dialogue system component with the GUI, visualizer components.
• Developed a phonetic spelling corrector (phonetic word vectors) and contributed to the development of spelling correction system for a natural language understanding (NLU) server. Natural Language Processing Intern, Vector Analytics (February 2018 – March 2018)
• Applied topic modelling, document classification using Mallet to unstructured text data. (https://vector- analytics.com/the-company)
Graduate Research Assistant: University of North Carolina Charlotte (April 2017 – August 2017)
• Applied a Wikipedia articles concept embeddings model in predicting prior art for patent documents.
• Contributed to the Prior Art extraction using PTAB data mainly using JAVA. Senior System Engineer: Infosys, Pune, India (February 2014 – October 2016) Contributed to development of background web services in .NET using Sql with Azure database ACADEMIC PROJECTS
• Thesis on Prediction of Emerging Technologies:
o I have worked with Dr Zadrozny in applying unsupervised learning techniques to predict the emerging technologies using US Patent data.
• Classification of 20NewsGroup data using Word Embeddings using Tensorflow with AWS, Windows Azure & Google cloud. ( https://github.com/ShalakaGit/WordEmbeddings)
• Machine Learning & Visualisation
o Wine recommendation system on the wine reviews dataset using Python & D3. o Text extraction in Python using Naïve Bayes & Decision Tree approach. TECHNICAL SKILLS
• Programming Languages: Python, C++, Java, C#, C.
• Machine Learning Libraries: Tensorflow, Scikit-learn, Gensim, Pandas, Numpy, Scipy, CRFsuite.
• NLP & Information Retrieval: Nltk, Lucene, Solr.
• Databases: Mongo DB, Sql.
• Version Control: GitHub, BitBucket.
• Other Tools: Bazel, gRPC, Protocol Buffers, Mallet, Apache Thrift. INDEPENDENT PROJECT : Topic Modelling of the Speak-Up Magazine data as a project at Hackathon: Finalist of the Queen City Hackathon