Ranjith Kumar Kakarapu
*************@*****.***
EXECUTIVE SUMMARY
With over 15 years of experience as a technology leader, I have a proven track record in driving data integration initiatives for large enterprises.
Expertise in big data technologies like Apache Spark, and Kafka, designing scalable data pipelines, batch processing, and real-time streaming solutions.
Designed and optimized ETL workflows in Databricks, leveraging Apache Spark for high-performance data transformations and analytics.
Proficient in ETL tools (IBM DataStage Azure Data Factory) and cloud platforms (Databricks, Azure Data Lake), optimizing data ingestion and storage.
Extensive expertise in data warehousing, data engineering, and SQL-based analytics within Oracle, Teradata enabling scalable and efficient data processing.
Expertise in Python, SQL and PL/SQL for data manipulation, analysis, and model development.
Built and optimized machine learning models (regression, classification, clustering, time-series forecasting), improving forecasting accuracy and operational insights.
Skilled in SQL and PL/SQL, query optimization, stored procedures, and source-to-target mapping (STM) to ensure data accuracy and integrity.
Experience in cloud-based data solutions using Azure Data Lake, improving data storage optimization and query performance by up to 30%.
Strong expertise in data cleansing, transformation, and migration using SQL, Python, and ETL tools, ensuring data accuracy, consistency, and seamless integration into target systems.
Experienced in CI/CD, DevOps, and UAT testing with JIRA, Rally, and ServiceNow, ensuring seamless data deployment.
Developed precise technical documentation and impactful presentations using MS Word and PowerPoint, translating complex concepts into clear insights for stakeholder alignment and decision-making.
Technical Skills
Languages
Python, SQL, Java Script, Shell Scripting
Bigdata Technologies
Apache Spark, Kafka
Cloud Technologies
Azure Data Factory, Azure Databricks, Azure Blob Storage, Azure Synapse analytics,
Databases
Teradata, Oracle, DB2, PostgreSQL, SQL Server
ETL Tools
IBM DataStage, Azure Data Factory, Azure Databricks
Scheduling Tools
IBM Tivoli Workload scheduler
PROFESSIONAL EXPERIENCE
Cognizant Technology Solutions ( Client: American Airlines)
Senior Data Engineer
January 2021 – Present
Project Details:
Designed and implemented data pipeline architecture using Azure Data Factory (ADF), Databricks, and Spark, ensuring scalable and efficient data ingestion, transformation, and storage.
Built a data quality framework to validate datasets across multiple pipeline stages, facilitating anomaly detection, consistency checks, and improved data accuracy.
Created and managed catalogs and schemas in Unity Catalog, enabling centralized metadata management and data governance in Databricks.
Developed and optimized Spark applications in Databricks for large-scale data processing, reducing processing time by 20% through performance tuning and workflow optimization.
Orchestrated complex data ingestion and transformation workflows using Azure Data Factory (ADF), integrating structured, semi-structured, and unstructured data seamlessly.
Led end-to-end testing, including component testing and integration testing, to validate data pipelines for production readiness.
Collaborated with cross-functional teams to design and document data architecture, aligning with business requirements and ensuring future scalability.
Optimized existing Databricks workflows by identifying bottlenecks, tuning configurations, and improving overall pipeline performance.
Applied data security best practices, ensuring compliance with enterprise standards during encryption and decryption processes.
Provided technical guidance to team members by conducting design reviews and enforcing coding and architectural best practices.
Migrated ETL jobs to Databricks
Cognizant Technology Solutions ( Client: American Airlines)
Technical Lead
November 2017– January 2021
Project Details:
Developed and optimized Spark jobs with Scala APIs, benchmarking against traditional SQL for Azure-based projects.
Orchestrated ETL jobs with Azure Data Factory, enhancing data integration and workflow efficiency.
Implemented secure Snowflake data warehousing strategies, leveraging Azure Synapse Analytics to enhance reporting and compliance.
Worked with APIs for data interchange with Snowflake, focusing on Azure integration.
Developed data pipelines for Snowflake, achieving a 30% reduction in processing times with Azure tools.
Converted SQL queries to Apache PySpark in Azure Databricks, utilizing Delta Live Tables (DLT) to enhance data pipelines and improve transformation efficiency.
Converted SQL queries and procedures to Apache PySpark, Spark SQL and Spark Scala within Azure Databricks environments.
Designed and Developed ETL jobs using IBM DataStage
Scheduled ETL jobs using IBM Tivoli Workload Scheduler.
Performance tuning for long running Sql Queries.
Cognizant Technology Solutions (Client: American Airlines)
Senior Consultant
September 2011 – Nov 2017
Project Details:
Developed and optimized ETL jobs and Scheduled ETL jobs
Orchestrated ETL jobs with IBM DataStage, enhancing data integration and workflow efficiency..
Developed Data Marts adhering to Star Schema and Snowflake Schema methodologies using various data modeling tools.
Applied Agile (SCRUM) methodologies to streamline the software development lifecycle.
Designed and Developed ETL jobs using IBM DataStage
Scheduled ETL jobs using IBM Tivoli Workload Scheduler.
Performance tuning for long running Sql Queries.
Cognizant Technology Solutions (Client: American Airlines)
Programmer Analyst
December 2007 – September 2011
Project Details:
Gathered architectural requirements to ensure alignment with desired outcomes.
Developed and optimized ETL jobs and Scheduled ETL jobs
Orchestrated ETL jobs with IBM DataStage, enhancing data integration and workflow efficiency.
Developed Data Marts adhering to Star Schema and Snowflake Schema methodologies using various data modeling tools.
Designed and Developed ETL jobs using IBM DataStage
Scheduled ETL jobs using IBM Tivoli Workload Scheduler.
Performance tuning for long running Sql Queries.
Adopted an Agile 2-week sprint model, ensuring continuous improvement through regular meetings and deployment cycles.
EDUCATION
•Bachelor of Science – 2006, Kakatiya University, India.
CERTIFICATIONS
AZ-900: Microsoft Certified Azure Fundamentals
Salesforce Certified Platform Developer I
Salesforce Certified AI Associate