Post Job Free
Sign in

Azure Data Machine Learning

Location:
Irving, TX
Posted:
April 23, 2025

Contact this candidate

Resume:

Ranjith Kumar Kakarapu

*************@*****.***

+1-940-***-****

EXECUTIVE SUMMARY

With over 15 years of experience as a technology leader, I have a proven track record in driving data integration initiatives for large enterprises.

Expertise in big data technologies like Apache Spark, and Kafka, designing scalable data pipelines, batch processing, and real-time streaming solutions.

Designed and optimized ETL workflows in Databricks, leveraging Apache Spark for high-performance data transformations and analytics.

Proficient in ETL tools (IBM DataStage Azure Data Factory) and cloud platforms (Databricks, Azure Data Lake), optimizing data ingestion and storage.

Extensive expertise in data warehousing, data engineering, and SQL-based analytics within Oracle, Teradata enabling scalable and efficient data processing.

Expertise in Python, SQL and PL/SQL for data manipulation, analysis, and model development.

Built and optimized machine learning models (regression, classification, clustering, time-series forecasting), improving forecasting accuracy and operational insights.

Skilled in SQL and PL/SQL, query optimization, stored procedures, and source-to-target mapping (STM) to ensure data accuracy and integrity.

Experience in cloud-based data solutions using Azure Data Lake, improving data storage optimization and query performance by up to 30%.

Strong expertise in data cleansing, transformation, and migration using SQL, Python, and ETL tools, ensuring data accuracy, consistency, and seamless integration into target systems.

Experienced in CI/CD, DevOps, and UAT testing with JIRA, Rally, and ServiceNow, ensuring seamless data deployment.

Developed precise technical documentation and impactful presentations using MS Word and PowerPoint, translating complex concepts into clear insights for stakeholder alignment and decision-making.

Technical Skills

Languages

Python, SQL, Java Script, Shell Scripting

Bigdata Technologies

Apache Spark, Kafka

Cloud Technologies

Azure Data Factory, Azure Databricks, Azure Blob Storage, Azure Synapse analytics,

Databases

Teradata, Oracle, DB2, PostgreSQL, SQL Server

ETL Tools

IBM DataStage, Azure Data Factory, Azure Databricks

Scheduling Tools

IBM Tivoli Workload scheduler

PROFESSIONAL EXPERIENCE

Cognizant Technology Solutions ( Client: American Airlines)

Senior Data Engineer

January 2021 – Present

Project Details:

Designed and implemented data pipeline architecture using Azure Data Factory (ADF), Databricks, and Spark, ensuring scalable and efficient data ingestion, transformation, and storage.

Built a data quality framework to validate datasets across multiple pipeline stages, facilitating anomaly detection, consistency checks, and improved data accuracy.

Created and managed catalogs and schemas in Unity Catalog, enabling centralized metadata management and data governance in Databricks.

Developed and optimized Spark applications in Databricks for large-scale data processing, reducing processing time by 20% through performance tuning and workflow optimization.

Orchestrated complex data ingestion and transformation workflows using Azure Data Factory (ADF), integrating structured, semi-structured, and unstructured data seamlessly.

Led end-to-end testing, including component testing and integration testing, to validate data pipelines for production readiness.

Collaborated with cross-functional teams to design and document data architecture, aligning with business requirements and ensuring future scalability.

Optimized existing Databricks workflows by identifying bottlenecks, tuning configurations, and improving overall pipeline performance.

Applied data security best practices, ensuring compliance with enterprise standards during encryption and decryption processes.

Provided technical guidance to team members by conducting design reviews and enforcing coding and architectural best practices.

Migrated ETL jobs to Databricks

Cognizant Technology Solutions ( Client: American Airlines)

Technical Lead

November 2017– January 2021

Project Details:

Developed and optimized Spark jobs with Scala APIs, benchmarking against traditional SQL for Azure-based projects.

Orchestrated ETL jobs with Azure Data Factory, enhancing data integration and workflow efficiency.

Implemented secure Snowflake data warehousing strategies, leveraging Azure Synapse Analytics to enhance reporting and compliance.

Worked with APIs for data interchange with Snowflake, focusing on Azure integration.

Developed data pipelines for Snowflake, achieving a 30% reduction in processing times with Azure tools.

Converted SQL queries to Apache PySpark in Azure Databricks, utilizing Delta Live Tables (DLT) to enhance data pipelines and improve transformation efficiency.

Converted SQL queries and procedures to Apache PySpark, Spark SQL and Spark Scala within Azure Databricks environments.

Designed and Developed ETL jobs using IBM DataStage

Scheduled ETL jobs using IBM Tivoli Workload Scheduler.

Performance tuning for long running Sql Queries.

Cognizant Technology Solutions (Client: American Airlines)

Senior Consultant

September 2011 – Nov 2017

Project Details:

Developed and optimized ETL jobs and Scheduled ETL jobs

Orchestrated ETL jobs with IBM DataStage, enhancing data integration and workflow efficiency..

Developed Data Marts adhering to Star Schema and Snowflake Schema methodologies using various data modeling tools.

Applied Agile (SCRUM) methodologies to streamline the software development lifecycle.

Designed and Developed ETL jobs using IBM DataStage

Scheduled ETL jobs using IBM Tivoli Workload Scheduler.

Performance tuning for long running Sql Queries.

Cognizant Technology Solutions (Client: American Airlines)

Programmer Analyst

December 2007 – September 2011

Project Details:

Gathered architectural requirements to ensure alignment with desired outcomes.

Developed and optimized ETL jobs and Scheduled ETL jobs

Orchestrated ETL jobs with IBM DataStage, enhancing data integration and workflow efficiency.

Developed Data Marts adhering to Star Schema and Snowflake Schema methodologies using various data modeling tools.

Designed and Developed ETL jobs using IBM DataStage

Scheduled ETL jobs using IBM Tivoli Workload Scheduler.

Performance tuning for long running Sql Queries.

Adopted an Agile 2-week sprint model, ensuring continuous improvement through regular meetings and deployment cycles.

EDUCATION

•Bachelor of Science – 2006, Kakatiya University, India.

CERTIFICATIONS

AZ-900: Microsoft Certified Azure Fundamentals

Salesforce Certified Platform Developer I

Salesforce Certified AI Associate



Contact this candidate