VENKATA SAI KRISHNA AVINASH NUKALA adhbgj@r.postjobfree.com
linkedin.com/in/avinash-nvsk 513-***-****
github.com/nvskavinash snowflake certified
EDUCATION
Master of Science in Computer Science August 2018 present
University of Cincinnati, Cincinnati, OH GPA: 3.875/4.0
Courses: Big Data Analytics, Cloud Computing, Data Base Theory, Intelligent Data Analysis, Deep Learning, Adv.
Methods in Data Security and Privacy, Data Structures and Algorithms, Computer Networks
Bachelors in Electrical and Electronics Engineering August 2012 May 2016
Amrita Vishwa Vidyapeetham, Coimbatore, India GPA: 3.9/4.0
Courses: Embedded System Design, Data Structures and Algorithms, Micro Controllers, IT Essentials
SKILLS
Languages: Python, PySpark, C, C++, Java, Embedded C, MATLAB, SQL
DBMS: MySQL, MS SQL Server, Oracle
Web Technologies: HTML5, CSS3, AJAX, XML, JavaScript, REST, JSON, ODBC
Frameworks: Flask, Apache Spark, Apache Hive, Hadoop, Hive
Platforms & Tools: Anaconda, Jupyter, ETL, Informatica, Snowflake, Canopy, PyCharm, Cloudera, Oracle VM, MS
SQL Server, MySQL, Eclipse, AWS, Docker, PuTTy, Google Colab, GIT, Linux, Goorm IDE
EXPERIENCE
Graduate Research Assistant, University of Cincinnati, OH Aug 2019 present
Technologies & Modules used: Python, Linux, WSGI, AWS EC2, psutil, perf, PyShark, NumPy, pandas, Matplotlib.
Website Cryptojacking Detection using Machine Learning: Detecting if a website is cryptojacked or not and up
to what percentage cpu throttle, by leveraging the CPU power, network trace and cache data of a sample website
built using HTML5, CSS3, Malicious JavaScript and hosted on Apache Tomcat server in AWS-EC2. The data is
inputted to the machine learning model for performing multi-class classification. A part of the research work got
accepted at the 8th IEEE CNS 2020 conference. Paper id: 157*******.
Data Engineer, Larsen & Toubro Pvt. Ltd., India Jul 2016 April 2018
Tools & Technologies used: SQL, ETL, Informatica, MS SQL Server, Unix, Linux, IBM NDM, Tivoli, AutoSys.
Developed an application for Finance and Decision Managements teams to track projects progress.
Data from various Lobs such as P&M, planning, store departments is pushed into the Linux server using IBM
NDM. The data is then pulled from various relational DBs and processed using the ETL tool Informatica power
center at site level. Used different transformations to load data from interim tables to fact tables. Developed DQ
checks to ensure the integrity and quality of the data in fact tables.
Developed schedules & jobs definition using scheduling tools like Tivoli and AutoSys.
Developed many pre-processing and post-processing Unix scripts for checking and processing the file.
PROJECTS
Decision Tree in Spark: Developed Decision Tree Algorithm from scratch using key,value pairs in PySpark.
Movie Recommendation System: Users are given movie recommendations based on their previous movie ratings. ALS
algorithm of matrix factorization was used in this project.
Hotel Management System: Developed a web application using Python Flask framework, Ms SQL Server. The SQL
schema was built using 5 tables that are interconnected using 4 Foreign Keys. The hotel manager can register/modify a
new/existing food item into their data base using INSERT/UPDATE/DELETE commands abiding the schema conditions.
It is hosted on apache server with mod WSGI container in AWS-EC2. The application provides basic operations to the
hotel manager using GET/POST/DELETE methods in the form of JSON response. Parallelly, developed a user interface
using HTML5, CSS3, JavaScript, maintaining UX Standards.
tf-idf for a set of books: Calculated tf-idf values for each word for a set of books using three phases of map reduce for
finding the signature of each book.
EXTRACURRICULAR ACTIVITIES and AWARDS
Appointed as Treasurer for Graduate Student Association at University of Cincinnati.
Awarded with two Academic Excellence Awards and one Certificate of Merit by Amrita University, India.
Awarded the University Graduate Scholarship of $13,582 per semester by University of Cincinnati.