SRIKANTH REDDY BOJJA
*************.****@*****.*** Buffalo Grove, IL 279-***-**** LinkedIn
SUMMARY
Results-driven Data Engineer with 5+ years of experience and a master’s in computer science, skilled in Cloud, data modeling, Python, and CI/CD pipelines. Expertise in optimizing data workflows and enabling data-driven decisions in Agile environments. EXPERIENCE
CVS Health Buffalo Grove, Illinois
Data Engineer June 2023 - Present
● Developed a GCP-based solution using Composer, Python scripts, BigQuery, and Dataproc to fetch real-time COVID and flu data from multiple sources, integrate feedback into SFMC for patient surveys, and process response dispositions for drug feedback and side effect analysis, sending surveys to nearly 2,000 patients daily and generating $1M in revenue for CVS Health.
● Effectively established and configured an Airflow environment to facilitate workflow, Jenkins CI/CD, resulting in efficient deployment processes and the hosting of four critical repositories.
● Led the comprehensive end-to-end design of a pharmacy product data model, leveraging BigQuery for table creation and maintenance, RStudio for Python coding, Airflow for orchestration, and Jenkins for Continuous Integration and Continuous Deployment (CI/CD). This product played a pivotal role as the primary resource supporting multiple business domains, overseeing the management of data for nearly 80,000 340B pharmacies. SIKKA.AI San Jose, California
Data Engineer Intern February 2023 – May 2023
● Successfully performed Logger Changes on the PMS and deployed them on client machines after QA and unit testing as part of nuget package manager upgrade to v2.1.0
● Using MVC model, updated the real-time API's to cater client needs and worked on optimizing the JSON serializer to improve execution time and memory usage impacting the day-to-day activities of 500+ active clients. AMAZON Hyderabad, India
Data Engineer August 2018 - January 2022
● Developed a machine learning framework in Python and R, achieving 80% accuracy in determining employee seat positions based on WIFI data during COVID-19.
● Leveraged AWS and big data technologies to perform ETL operations, creating 29 rules using Scala and PySpark to detect defects in the invoice lifecycle.
● Built and maintained a 16-node Redshift data warehouse, migrated Oracle ETL pipelines, saving 15-20 TB per cluster by optimizing unused jobs.
● Designed and implemented a scalable architecture for global shipping support, hosting 55-60 Tableau and AWS Quicksight reports in the cloud.
● Proactively engage with business teams to gather requirements and build tailored data pipelines, ensuring their needs are met while fostering strong, collaborative relationships.
● Advanced SQL skills in complex queries, joins, and aggregations for robust data analysis.
● Proficient in ETL tools like Informatica, Apache Spark, and BI tools such as Tableau and Power BI for data integration and reporting.
TECHNICAL SKILLS
Programming/ Scripting Languages: Python, Java, R, Shell, Spark, PySpark, Scala Frameworks/Libraries: Django, Flask, Pandas, Bootstrap, Keras Databases: MYSQL, PostgreSQL, Redshift, Oracle DB, DynamoDB, MongoDB, Hive and HDFS. Tools: Quicksight, Tableau, PowerBI, MYSQL Workbench, Postgres, Git, Active Directory Cloud: Amazon AWS, GCP, Google App Engine, Google Cloud SQL, Google Cloud Storage. AWS Technologies: Redshift, DynamoDB, Lamdba, SNS, SQS, SageMaker, EC2, S3, CloudWatch, Athena. Big Data Tools: Spark, Hadoop, Hive, Metastore, Flink, Airflow, Scala, Map Reduce, Kafka. EDUCATION
California State University, Long Beach, CA, US Long Beach, CA, US Master of Science (M.S.) in Computer Science January 2022 – May 2023 Vardhaman College of Engineering Hyderabad, India
Bachelor of Technology (B. TECH) Concentration: Computer Science May 2015 - May 2019