Data Engineer Python Developer Cloud & AI Specialist

Location:

Jersey City, NJ

Salary:

$75 P/hr

Posted:

May 08, 2026

Contact this candidate

Resume:

KRISHNA TEJA

Data Engineer Python Developer Cloud & AI Specialist

**.*******@*****.*** +1-732-***-**** Jersey City, NJ

PROFESSIONAL SUMMARY

Results-driven Python Developer and Data engineer with 10 years of experience architecting end-to-end data pipelines, building scalable cloud infrastructures, and delivering AI-powered solutions across banking, healthcare, and retail domains. Proven expertise in ETL/ELT design, real-time data streaming, data warehousing, and generative AI integration. Adept at leveraging Azure, AWS, and modern data stack tools to process large-scale datasets and drive business intelligence.

10 years of expertise as a Data Engineer and Python Developer specializing in building enterprise-scale ETL/ELT pipelines, distributed data processing systems, cloud data lake/warehouse architectures, and real-time streaming platforms across banking, healthcare, and retail domains.

Certified Azure Data Engineer Associate (DP-203) — hands-on expert in Azure Data Factory (ADF) pipeline orchestration, Azure Databricks, Azure Synapse Analytics, Delta Lake, ADLS Gen2, and Azure Stream Analytics for end-to-end cloud data engineering.

Mastered RESTful API development with Flask and Django, adopting industry best practices for endpoint security, data serialization, and API versioning to facilitate robust backend services.

Architected and delivered end-to-end data pipelines processing 800GB+ daily using PySpark on Azure Databricks and AWS EMR — orchestrating complex DAGs with Apache Airflow and Azure Data Factory across 60+ batch jobs for front-office risk analytics.

Deep expertise in Apache Spark and PySpark — designed and tuned distributed batch and streaming jobs, managed Spark cluster configurations (executor tuning, memory management, broadcast joins, partition optimization), and implemented custom UDFs and aggregations on Azure Databricks and AWS EMR.

Built and governed multi-zone cloud data lakes on Azure Data Lake Storage Gen2 and AWS S3 using Bronze/Silver/Gold medallion architecture, Delta Lake for ACID-compliant transactions, schema evolution, Z-ordering, and data compaction for optimized read performance.

Experience working with Azure functions, Redis cache, Orchestrator, and BigQuery, for parallel data processing and streaming pipelines

Designed high-throughput real-time streaming architectures using Apache Kafka (producers, consumers, Kafka Streams, topic partitioning, consumer group rebalancing), AWS Kinesis Data Streams, and Azure Event Hubs processing millions of financial events per hour with guaranteed delivery.

Provisioned and version-controlled all cloud data infrastructure using Terraform (IaC) across Azure and AWS managing Databricks workspaces, ADF pipelines, Redshift clusters, S3 data lakes, VNets, IAM roles, and security groups with full environment parity.

Implemented robust data quality frameworks using Great Expectations and custom PySpark validation rules automated schema checks, null detection, referential integrity validation, statistical drift monitoring, and SLA alerting across all critical data pipelines.

Experience with pipeline orchestration using Apache Airflow, including building DAGs with retry logic, branching, and task dependencies, and working with Airflow deployments on Kubernetes and MWAA.

Built and maintained comprehensive data observability and monitoring stacks using Azure Monitor, AWS CloudWatch, ELK Stack, Prometheus, and Grafana tracking pipeline SLAs, data freshness, row count anomalies, schema drift, and infrastructure health KPIs.

Containerized data engineering workloads using Docker and orchestrated on Kubernetes (AKS/EKS) managed Helm chart deployments, resource quotas, auto-scaling, and persistent volume claims for stateful data processing services.

Established CI/CD pipelines for data engineering using Azure DevOps and GitHub Actions automated DAG deployment, Databricks notebook promotion, dbt model runs, data contract testing, and environment promotion gates across dev/staging/prod.

Mentored junior and offshore data engineers, led design reviews, and drove Agile delivery — translating complex financial and healthcare data requirements from quant analysts, risk managers, and business stakeholders into production-grade pipeline architectures.

EDUCATION & CERTIFICATIONS

Bachelor of Computer Science — PESIT, Bangalore, India

Microsoft Certified: Azure Data Engineer Associate (DP-203)

TECHNICAL SKILLS