Data Engineer Azure

Location:

Cleveland, OH

Posted:

November 15, 2023

Contact this candidate

Resume:

VISHNU VARDHANA BOLLOJU

Azure Data Engineer

MOBILE NO: +1-216-***-**** LINKED IN: https://www.linkedin.com/in/vishnu-vardhana-bolloju-a68627103/ EMAIL: *********************@*****.***

Data Engineer Big Data Data Warehousing ETL SQL Server Azure Data Factory Azure Databricks Power BI Cloud Services Python AI ML Java PySpark Azure Synapse Azure Logic Apps Azure Functions SQL Data extraction Data Modeling Data analysis Big Data (Hadoop) Spark Data Science Analytic Tools

PROFESSIONAL SUMMARY

Experienced Azure Data Engineer with 5+ years of expertise in Microsoft Azure, including ETL, data extraction, and secure containerization. Proficient in various Azure services, adept at designing Spark applications, SQL, and database structures for Business Intelligence. Recent Master's graduate with the latest industry knowledge. A proactive team player known for strong communication skills and a positive attitude, committed to driving data-driven success.

EDUCATION DETAILS

Master of Science: Computer Science – 3.66/4.00 CGPA December-2023 Cleveland State University – CLEVELAND, OHIO, UNITED STATES

Bachelor of Technology: Computer Science and Engineering – 7.75/10.00 CGPA May-2018 Vardhaman College of Engineering – HYDERABAD, TELANGANA, INDIA

TECHNICAL SKILLS

Programming Languages : T-SQL, Python, Java, Spark, C, PowerShell, R.

Web Technologies : HTML5, CSS3, Bootstrap 4, JavaScript.

Databases : Microsoft SQL Server, Oracle, MySQL, H2.

Tools & Utilities : Power BI, Weka 3.5, Rational Rose, Microsoft Office.

Cloud Services : Azure, AWS, S3, Azure Active Directory, MFA.

Operating Systems : Windows, Ubuntu.

Development Tools (IDE) : Microsoft Visual Studio, Eclipse, SQL Server Management Studio (SSMS).

Methodology : Agile, Scrum.

Version Control : GitHub, Azure Git.

Data Management Skills : Data Analytics, Deep Learning Sentiment analysis, Machine learning, Deep learning, Model selection and evaluation, Text data cleaning and tokenization, Feature extraction, Problem-solving VADER sentiment analysis.

Interpersonal Skills : Team Management, Adaptability, Critical Thinking.

Machine Learning and Statistics : Regression, Random Forest, Clustering, Time-Series Forecasting.

PROFESSIONAL EXPERIENCE

Organization: IFinite Solution Internship (September 2023–Present)

Role: Data Engineer Intern

Responsibilities:

Azure Data Service Mastery: Become proficient in Azure's data services, including Azure SQL Database, Azure Data Factory, Azure Databricks, Azure HDInsight, and Azure Data Lake Storage, by actively participating in their configuration, management, and usage.

Data Pipeline Design and Maintenance: Collaborate on the design and maintenance of data pipelines using Azure Data Factory, ensuring that data is efficiently extracted, transformed, and loaded into Azure data repositories.

Data Integration Excellence: Work on integrating disparate data sources and destinations within Azure, honing your skills in connecting and orchestrating data flow between Azure services.

Azure Security and Compliance: Gain expertise in Azure's data security features and practices, with a focus on protecting sensitive data and ensuring compliance with industry regulations.

Optimization and Efficiency: Strive to optimize data pipelines and Azure services for both performance and cost-effectiveness, exploring techniques like scaling resources and using serverless compute options to maximize efficiency.

University: Cleveland State University Internal (May 2022– August 2023)

Responsibilities: I have served as a Teaching Assistant (TA), supporting professors in lectures, labs, grading, and course material preparation, while also taking on the responsibilities of a Research Assistant (RA) by collaborating on research projects, data analysis, and research report writing. Additionally, you will provide academic support through tutoring and leading study sessions. As a Student Mentor, you will guide undergraduate and newer graduate students in coursework, research, career development, and contribute to orientation and student activities, fostering a supportive and engaging academic community.

Role: Research Assistantship (Work on AI based model)

Project Title: Sentiment Analysis on various Dataset with NLP

Project Description: Led a sentiment analysis project on Twitter dataset, Amazon review dataset, categorizing tweets or reviews as positive, negative, or neutral. The project aimed to uncover public sentiment on amazon reviews and Twitter related to diverse topics, brands, and events.

Responsibilities: Managed data collection, preprocessing, model selection, training, and evaluation.

Skills and Tools: Utilized Python, NLP libraries (NLTK), Scikit-learn, VADER sentiment analysis, Twitter and Amazon API’s, and Jupyter Notebook.

Approach: Gathered tweets using the API’s, cleaned and tokenized text data, used VADER sentiment analysis for initial sentiment scoring, and combined it with machine learning (e.g., logistic regression, Naive Bayes) and deep learning models (e.g., LSTM, BERT) for sentiment analysis.

Achievements: Achieved 85% accuracy in sentiment classification, identified sentiment trends, and developed real-time sentiment monitoring.

Challenges: Overcame issues with noisy Twitter data and addressed imbalanced datasets.

Collaboration: Worked with a cross-functional team, including data engineers and NLP experts.

Impact: Offers practical insights for social media and digital marketing, brand reputation management, and event sentiment analysis for opinion-based individuals.

Project Title: Real-time Data Processing and Analysis with Hive, Hadoop, and Azure Auto-Scaling (Worked on Big Data)

Project Description: This project aims to build a real-time data processing and analysis system that leverages the power of Hive, Hadoop, and Microsoft Azure's auto-scaling capabilities. The project will focus on processing and analyzing large volumes of data efficiently and automatically scaling resources to meet varying demands.

Data Management: Ingest and preprocess extensive datasets into Azure Data Lake Storage, ensuring data integrity.

Cluster Management: Configure and maintain the Azure-based Hadoop cluster, ensuring scalability and reliability.

Efficient Data Flow: Develop data pipelines for streamlined data movement and transformation.

Data Security: Implement robust data security measures for controlled access and storage.

Performance Optimization: Optimize data processing jobs for peak performance and efficiency.

Collaborative Data Availability: Collaborate closely with Hive developers to ensure data availability and accessibility.

Schema Design: Design Hive schemas and tables for effective data storage and analysis.

Query Optimization: Write and fine-tune HiveQL queries for insightful data analysis.

Monitoring and Scaling: Monitor query performance and collaborate with data engineers to enhance data structures. Collaborate with auto-scaling engineers to define scaling strategies.

Analysis and Documentation: Perform data analysis, visualize results, provide valuable insights, validate findings, and prepare comprehensive documentation, ensuring project transparency and adherence to best practices.

Organization: Tata Consultancy Services LTD Client: Microsoft (Jan 2020– Apr 2022)

Role: Data Engineer

Responsibilities:

Data Transformation: Led the acquisition and standardization of diverse master data sets, ensuring alignment across enterprise phases. Spearheaded the transformation of various data types into a standardized format for enterprise-wide utilization.

Data Solution Design & Development: Designed and developed complex applications, focusing on data solutions, business intelligence reporting, ETL development, testing, and documentation.

Azure Data Management Expertise: Managed end-to-end data processes over Azure cloud, including data ingestion, analysis, cleansing, and governance across Azure services like Azure Blob Storage, Azure Data Lake, Azure Data Factory, and more.

Project Management & Coordination: Successfully planned, managed, and coordinated project activities, encompassing sprint/iteration planning, approval, sign-off, and release management.

Power BI & Analytics Mastery: Leveraged Power BI for dimensional modeling, hierarchies, measures, and aggregations. Proficient in data integration from multiple sources and report design for actionable insights.

Cloud Computing & DevOps Integration: Architected scalable cloud solutions on Azure, focusing on efficient infrastructure and services. Integrated Azure DevOps for CI/CD, ensuring secure code management and application deployment.

Organization: Saavy Tech Private Limited Client: Internal (July 2018 – Nov 2019)

Role: Java Full Stack Developer

Responsibilities:

Agile Development & Methodology: Proficiently applied Agile methodology, employing Test-Driven Development (TDD) practices to ensure early and reliable software delivery.

Object-Oriented Design & Scalability: Leveraged Object-Oriented Analysis and Design (OOAD) principles and J2EE design patterns to architect scalable systems, emphasizing modularity and flexibility.

Java Expertise & Design Patterns: Demonstrated advanced Java skills, including Multithreading, Collections Framework, File I/O, and concurrency. Utilized design patterns like Singleton, Data Access Objects (DAO), Factory, and MVC for efficient development.

Spring & Hibernate Integration: Designed and implemented Model-View-Controller (MVC) architecture using the Spring framework. Developed Hibernate classes, DAOs, and configured Hibernate files for data retrieval and storage.

JSTL & Communication: Effectively employed JSTL tags to facilitate communication between controllers and Java Server Pages (JSP), ensuring efficient data flow.

Development Tools & Deployment: Utilized Eclipse IDE for application development and streamlined deployment processes with Maven build scripts, enhancing project efficiency.

Contact this candidate