NEERAJ BORRA
DATA ENGINEER
Kansas City, MO +1-816-***-**** ******.*****@*****.*** LinkedIn
PROFESSIONAL SUMMARY
Results-driven Data Engineer with 3+ years of experience designing, automating, and optimizing data pipelines, ETL workflows, and cloud- native architectures across AWS, Azure, and GCP. Proficient in Python, SQL, Spark, and modern data platforms, with expertise in data modeling, warehousing, and big data processing. Adept at improving data quality, scalability, and performance to enable advanced analytics and business intelligence. Strong collaborator with proven success delivering end-to-end data engineering solutions in both enterprise and agile environments.
TECHNICAL SKILLS
Programming & Scripting: Python, SQL, Scala, Java, R, Shell Scripting Data Engineering & Big Data: Apache Spark, Hadoop, Kafka, Airflow, Flink, Hive, Pig Databases & Warehousing: Snowflake, Redshift, BigQuery, PostgreSQL, MySQL, Oracle, MongoDB Cloud Platforms: AWS (S3, EMR, Glue, Lambda, Redshift), Azure (Data Factory, Synapse, Databricks), GCP (BigQuery, Dataflow, Pub/Sub) ETL, BI & Analytics: Informatica, Talend, DBT, Tableau, Power BI, Looker, Excel DevOps & Tools: Git, Jenkins, Docker, Kubernetes, Jira, Confluence, Trello, Slack Soft Skills: Problem-Solving, Communication, Teamwork, Critical Thinking, Time Management, Adaptability, Leadership PROFESSIONAL EXPERIENCE
INTEL
DATA ENGINEER Austin, TX Feb 2025 – Present
Designed and deployed real-time streaming pipelines with Apache Kafka and Spark Structured Streaming, reducing IoT sensor data latency by 35% across 20+ production lines.
Implemented data quality and validation frameworks using Python and AWS Glue, ensuring 99% accuracy in analytics consumed by 100+ stakeholders.
Optimized Snowflake data models and queries, boosting reporting performance by 25% for finance and operations teams.
Partnered with cross-functional teams using Jira, Slack, and Trello to align data solutions with business goals, increasing collaboration efficiency.
TRIGENT SOFTWARE
ASSOCIATE DATA ENGINEER Jan 2020 – Dec 2022
Built and automated 20+ ETL workflows using Python, SQL, and Apache Airflow, reducing data integration time by 40% across multiple client projects.
Designed and maintained data warehouse models in AWS Redshift and Snowflake, supporting analytics for 50+ business users.
Developed interactive Power BI dashboards to provide real-time insights into KPIs, enabling faster data-driven decisions.
Enhanced data ingestion pipelines from APIs, FTPs, and on-prem databases, ensuring reliable data availability with 99% uptime.
Implemented data partitioning and clustering strategies in BigQuery, improving query efficiency by 30% for large datasets.
Documented technical processes and conducted knowledge-sharing sessions with peers, improving overall team productivity.
Partnered with cross-functional teams to resolve data quality issues, showcasing problem-solving and communication skills. WIPRO
DATA ENGINEER (INTERN) May 2019 – Dec 2019
Assisted in building ETL workflows with SQL and Informatica, automating data movement from on-prem databases to AWS S3.
Conducted data cleaning and preprocessing in Python, improving dataset usability for downstream analytics by 25%.
Created SQL queries and reports for internal teams, ensuring timely delivery of actionable data insights.
Collaborated with senior engineers on data model design for a client’s HR analytics project, improving report accuracy.
Supported agile team ceremonies and project tracking using Jira and Confluence, demonstrating adaptability and teamwork. EDUCATION
University of Missouri – Kansas City – Kansas City, MO Master of Science in Computer Science – Specialization: Data Science
Khammam Institute of Technology and Science – India Bachelor of Science in Computer Science
PROJECTS
E-Commerce Customer Experience Enhancement – Built personalized recommendation models on GCP using BigQuery, Cloud ML, and Cloud Storage, improving conversion rates and recommendation speed.
Uber Data Analytics with GCP – Developed ETL pipelines using Mage and BigQuery, enabling analysis of ride request and GPS data; built Looker dashboards for operational insights.
Healthcare Analytics – Diabetes Detection – Applied ML algorithms (Logistic Regression, Random Forests) with Pandas and NumPy to predict diabetes risk, enabling early interventions.
Cloud Deployment Project – Designed and deployed a web application across AWS, Azure, and GCP, setting up VMs, networking, and PaaS/IaaS deployments.
CERTIFICATIONS
AWS Certified Data Engineer – Associate
Microsoft Certified – Azure Data Fundamentals
Databricks Certified Data Engineer Professional
Tableau Desktop Specialist
Agile Foundations – LinkedIn Learning