JERSEY CITY, US • ************@*****.*** • 201-***-****
SAI DINESH TALAPANENI
Software Engineer
PROFESSIONAL SUMMARY
Software Engineer with over four years of expertise in crafting scalable data solutions using cloud technologies like AWS and Databricks. Demonstrates strong proficiency in ETL processes, Python, and data integration patterns, focusing on innovative problem-solving and future-oriented strategies. Committed to leveraging advanced skills in OpenMP, OpenShift, and Pytorch to drive data-driven decision-making and enhance organizational success. EMPLOYMENT HISTORY
SOFTWARE ENGINEER Jan 2024 - Present
Tricubic INC United States
• Involved in migrating existing data warehouses to AWS ecosystem services.
• Implemented data integration patterns using Pyspark for incremental data sync.
• Developed Event driven ETL pipeline to extract data from AWS S3 Data Lake and further processed it using PySpark in Databricks and Lambda.
• Utilized Python Boto3 to access various AWS services.
• Provided support to S3, IAM, Security Groups and Cloud Watch creation and update.
• Worked on modifying terraform templates for instantiation of AWS services.
• Created CloudWatch alerts for failures over the application. TEACHING ASSISTANT Aug 2023 - Dec 2023
Valparaiso University United States
• Assisted in developing projects, assignments, and grading for Software quality and Software verification courses.
• Mentored 60+ graduate students in coding and debugging, providing problem-solving strategies. DIGITAL SPECIALIST ENGINEER May 2021 - Aug 2022
Infosys India
• Developed and implemented scalable and efficient data pipelines using AWS services such as S3, Glue and Lambda.
• Migrated all the applications to AWS EC2 which are previously on-premise.
• Built and managed data streaming pipelines using Apache Kafka.
• Developed and maintained ETL workflows using AWS Glue with PySpark.
• Created user data PowerShell scripts, terraform scripts to spin EC2 instances in AWS.
• Created Glue jobs to ingest data from on-premise to S3 buckets and Lambda to process further.
• Utilized SQS in few other pipelines to decouple the lambda triggers.
• Setting up DR for all applications to support Prod failures.
• Validating migrated applications end-to-end and testing DR by making Active-passive approach.
• Removing sysadmin account from applications and implementing Impersonation on AD account to do Data operations.
• Migrated all SQL account in data pipelines to Cred API managers to get the password rotated automatically without manual intervention.
• Migrated all services to use gMSA account instead of password protected account. BIG DATA INTERN Jan 2021 - May 2021
Revature India
• Gathered business requirements with source/BA team and helped source team on pattern decision.
• Managed the input data files using Hadoop file system commands in landing server by using SFTP or C:D setup for Linux pattern.
• Authenticated and validated different GCP SA using g-cloud commands for proper data movements using GCP client libraries.
• Created GCP storage buckets to store converted AVRO/JSON files of source data.
• Created Big Query Datasets, tables, and views to load data from GCS.
• Involved in Nifi Template development to consume source files, transforming into AVRO and stores at GCS.
• Created Unix wrapper scripts to load AVRO data from GCS to Big Query partitioned tables using Cloud SDK.
• Involved in DZDO setup and file transfer between GCS to other LINUX environments.
• Created and enhanced Unix scripts to re-run the data ingestion from failure point.
• Involved in End-to-End data ingestion to GCP using application.
• Implemented notification alerts upon failures of data ingestion.
• Analyzed if any failures with files and Data base ingestion and suggested source team for further actions.
• Written logic for filtering source data over source data files before loading into BQ tables.
• Written re-try logic for data movement between GCS to BQ tables using BQ cli.
• Notified end users upon data loads using batch control table implementation under Big Query. EDUCATION
MASTER'S IN INFORMATION TECHNOLOGY Aug 2022 - Dec 2023 Valparaiso University Valparaiso, IN
BACHELOR OF TECHNOLOGY IN INFORMATION TECHNOLOGY Aug 2017 - May 2021 Panimalar Engineering College Chennai, India
COURSES
AWS CERTIFIED SOLUTIONS ARCHITECT – ASSOCIATE
AWS
SKILLS
Python (Experienced), Flask, R (Skillful), C/C++ (Skillful), SQL (Experienced), Django, OpenCV, TensorFlow, Keras, Pytorch, CUDA, OpenMP, React.js, Langchain, Celery, Pandas (Experienced), NumPy
(Experienced), SciPy, Sklearn, MongoDB, MySQL (Experienced), PostgreSQL, Pinecone, RDS (Experienced), ETL, Athena, Kinesis, S3 (Experienced), Spark (Experienced), Machine Learning (Skillful), Deep Learning, GANs, Computer Vision, NLP, LLMs, Git, GitHub (Experienced), Kubernetes (Experienced), Docker, OpenShift, AWS (Experienced), Azure, PySpark (Experienced), Boto3, Terraform, CloudWatch, Apache Kafka (Experienced), PowerShell, Hadoop (Experienced), Big Query (Experienced), GCP (Experienced).