Post Job Free

Resume

Sign in

Data Engineer

Location:
Dayton, OH
Posted:
February 27, 2024

Contact this candidate

Resume:

Poojitha Moganti

Email 937-***-**** LinkedIn Portfolio GitHub

SUMMARY

Seeking Software, BigData Engineer roles. I’m having 5+ years of industry experience transforming data into insights. Proven ETL expertise, full stack software engineering skills. Passionate about innovation and data-driven solutions. Proficient in Python, Java, SQL, Angular, JavaScript, Cloud and Big Data technologies. I’m willing to relocate to any part of the USA and work on site.

SKILLS

Java, Python, Linux, SQL, HTML, CSS, XML, JavaScript, Angular, React, SpringBoot,Restful API, Data warehouse, Data Lakes, Hadoop, Hive, Spark, PySpark, ETL, Data pipelines, Data modeling, Data Processing, Data Integration, Data transformation, Data Analytics, Docker, Kubernetes, Git, CI/CD, Splunk, JWT, Dynatrace, Terraform, AWS(Lambda, S3, EMR, Glue, API Gateway, Redshift), GCP(Big Table, Big Query, Cloud SQL, Pub Sub), Azure(Azure Data Factory, Azure SQL, Data Warehouse, Azure Synapse), Power BI. PROFESSIONAL EXPERIENCE

Tiaa, Charlotte, NC Aug 2023 – Feb 2024

Software Engineer

● Successfully Identified and led the migration of end-of-life web applications to the cloud, employing Angular, Rest API, Java, Terraform and AWS services like S3, Lambda, API Gateway, CloudFront, CloudWatch, Application Load Balancer, EC2, SQS, SNS, Quick Sight.

● Ensured a secure and scalable cloud architecture, preserving application integrity throughout the migration process.

● Crafted a web application utilizing Angular framework and RESTful API integration.

● Implemented features encompassing dynamic dropdowns, interactive buttons, and robust search functionality.

● Implemented Java and Lambda functions to enable authorized user access from Azure Active Directory utilizing JSON Web Tokens (JWT) for authentication.

● Orchestrated the deployment of highly efficient CI/CD pipelines using Electric Flow Cloud Bees, resulting in a remarkable 25% acceleration of development and release processes across multiple environments.

● Implemented robust logging solutions using Log4j and SLF4J for multiple web applications, significantly enhancing monitoring and troubleshooting capabilities.

● Proficiently utilized Infrastructure as Code - Terraform to seamlessly integrate with multiple AWS services.

● Automated infrastructure provisioning, achieving unparalleled efficiency and scalability.

● Utilized technologies such as Spark, Hive, and Hadoop to handle large-scale data processing tasks.

● Seamlessly integrated programs with AWS services (S3, EMR, Glue, Redshift) for efficient data management and storage.

● Orchestrated the migration of data from on-premises databases to the AWS cloud environment.

● Proficient in Advanced Data Analysis using Power BI, transforming complex data into visually appealing and actionable insights.

HS Technologies, Dayton, OH Aug 2022 – Aug 2023

Software Engineer (Data)

● Employed Git and GitHub Desktop for streamlined code management, ensuring efficient collaboration and version control.

● Collaborated on data processing and analytics using Google Cloud Platform like Big Query, Data Proc, Cloud Storage.

● Manifested a strong commitment to data privacy and security, contributing to GDPR compliance initiatives.

● Validated all databases and tables for regulatory requirements, ensuring full compliance.

● Applied secure data handling practices, configuring hashers, and incorporated masked columns in multiple database tables using Java and Spark.

● Conducted thorough reviews of ETL code, pinpointing and securing sensitive columns to enhance data security and compliance.

● Enhanced data quality through address transformations and standardization for new onboarding tables using Spark SQL, RDD and Python.

● Executed data mining and analysis, extracting valuable insights, and generating meaningful reports.

● Offered valuable recommendations, contributing to data-driven decision-making through rigorous data examination.

Third Eye Software, Hyderabad, IND Nov 2018 – Jul 2021 Software Engineer (Data)

● Facilitated the migration of code from Java to Python, resulting in a 30% increase in code flexibility and adaptability.

● Pioneered the implementation of a robust Full Stack software solution for the company's product line, utilizing cutting-edge technologies such as HTML, CSS, Angular, XML, Java, MySQL and AWS.

● Integrated the program seamlessly with diverse AWS services including S3, EMR, Glue, and Redshift.

● Executed secure data transfer from on-premise databases to the AWS cloud environment.

● Engineered a full-stack application for the company's premier product, adeptly utilizing technologies like React.js, Node.js, and MySQL to ensure seamless functionality and user experience.

● Orchestrated the integration of processed data with Azure Data Factory, establishing fluid data flow between cloud platforms.

● Conducted a comprehensive comparative analysis, pinpointing areas for enhancement in both platforms to optimize system efficiency.

● Employed the Beautiful Soup library and Python scripts to develop web crawlers, facilitating the extraction of data from diverse web pages, subsequently storing it in an S3 bucket. The SmartBridge, Hyderabad, IND Nov 2017 – Aug 2018 IoT Engineer

● Developed a retail webpage utilizing Java and full-stack technologies such as React.js for the front end and Spring Boot for the back end, seamlessly integrated with a chatbot to enhance user engagement and provide an intuitive shopping experience.

● Led the successful design and implementation of ChatBots using IBM Watson, showcasing expertise in natural language processing (NLP) and AI-driven conversational interfaces.

● Applied advanced natural language processing (NLP) techniques tailored for e-commerce, ensuring the ChatBot's ability to understand and respond effectively to customer queries related to product specifications, availability, and order-related inquiries.

● Established mechanisms to collect and analyze sentiment-related data from user interactions.

● To provide valuable insights into customer satisfaction and identifying areas for improvement.

● Utilized these insights to refine the ChatBot's responses and enhance its emotional intelligence over time.

● Successfully leveraged Books datasets to implement cutting-edge Optical Character Recognition capabilities, showcasing proficiency in transforming visual data into actionable text.

● Develop interactive IoT sensor data dashboards using Java and full-stack technologies such as Angular for the front end and Spring Boot for the back end, ensuring real-time data visualization and user interaction.

● Utilize IBM Cloud services including IBM Watson IoT Platform and IBM Cloudant for data management, deploy the application on IBM Cloud for scalability, security, and seamless integration, adhering to industry standards for authentication, authorization, and continuous deployment.

● Demonstrated leadership and organizational skills by assuming the role of Team Coordinator, overseeing and leading various events, including Artificial Intelligence, Machine Learning, and IoT hackathons, bootcamps, summer internships, and faculty development programs. PROJECTS

YouTube Trending Video Data Analysis using AWS:

● Developed a personal project leveraging Kaggle's YouTube Dataset, showcasing self-driven initiative and analytical process.

● Utilized AWS CLI, IAM, S3, and Glue Catalog, Glue crawler services to orchestrate end-to-end data processes, ensuring data integrity and privacy.

● Implemented ETL jobs with Pyspark, harmonizing data transformation for insightful analysis.

● Employed Athena and SQL to query the Glue Catalog, producing visualizations in Power BI for impactful data communication.

● Innovated by incorporating Lambda functions, automating routine tasks, and reflecting a commitment to scalable solutions.

● Personally cleaned and preprocessed data, resulting in high-quality datasets for advanced analytics. Data Pipeline for real time streaming data using GCP:

● Independently conceptualized and implemented a real-time streaming data pipeline, integrating third-party APIs for continuous data ingestion.

● Employed Apache Kafka, Kafka topics, and Zookeeper to establish a resilient and scalable messaging system for seamless data flow.

● Leveraged GCP Services for Robust Data Storage and Analysis:

● Utilized Google Cloud Storage for efficient and secure cloud-based storage, ensuring accessibility and reliability of streaming data.

● Integrated Big Query for real-time analytics, applying advanced querying techniques to derive actionable insights from streaming datasets.

● Harmonized Data Visualization with Power BI for Impactful Insights:

● Integrated Power BI with real-time streaming data, delivering compelling and actionable visualizations to facilitate data-driven decision-making.

● Demonstrated expertise in GCP's data and analytics ecosystem, showcasing the ability to leverage cloud-native tools for comprehensive data processing and visualization. Tokyo Olympics Data Analysis using Azure:

● Conceptualized and executed an independent data analysis project using the Kaggle Olympics dataset, showcasing a self-driven approach to extracting meaningful insights.

● Utilized Azure Data Factory for orchestrating ETL processes, ensuring seamless data extraction, transformation, and loading.

● Implemented Azure Data Lake Gen 2 as a central repository for storing diverse Olympic datasets, showcasing proficiency in cloud-based storage solutions.

● Employed Databricks within the Azure ecosystem for scalable data processing and analytics, demonstrating advanced capabilities in big data management.

● Leveraged Azure Synapse Analytics to synthesize and analyze complex Olympic datasets, showcasing adeptness in handling large-scale analytical workloads.

● Applied Power BI for data visualization, creating impactful dashboards that conveyed key trends and performance metrics related to the Tokyo Olympics. EDUCATION

University of Dayton, Dayton, Ohio Graduated: Dec 2022 Masters, Computer Science

Certification in Autonomous Systems and Data Science Relevant Coursework: Advanced programing & Data Structures, Deep Learning, Artificial intelligence, Advanced Computer vision, Image processing, Virtual & Mixed Reality, Web Semantics, Algorithm Design, Database Management Systems.

LEADERSHIP & CERTIFICATIONS

● Microsoft Azure Fundamentals - AZ900

● Google Data Analytics Certification (Coursera)

● AWS Solution Architect Virtual Experience.

● JPMorgan Software Engineering Virtual Experience Program.

● DataBricks Data Engineer.

● Snowflake Data Engineering and Machine Learning using Snowpark for Python

● Certification in ORACLE OCJP SE6 JAVA PROFESSIONAL. ACHIEVEMENTS

● Runner up in CTF at CyberAuto Challenge 2023 - Michigan

● Finalist in TECHGIG GEEK GODDESS - 2018, THEME IOT.

● Won first prize in IEEE-WIE ENGINEERING PROJECT EXPO NNRG 2017



Contact this candidate