Post Job Free
Sign in

Data Engineer Engineering

Location:
San Francisco, CA, 94107
Salary:
$200000
Posted:
August 04, 2025

Contact this candidate

Resume:

BALASUBRAMANYAM VR

San Ramon, CA ***** 612-***-**** ****.**@*****.***

PROFESSIONAL

SUMMARY

Highly skilled Software Engineer with 16+ years of experience in designing, building, and optimizing scalable data platforms. Expertise in data architecture, ETL/ELT pipelines, data modeling, cloud computing (AWS, Azure), and big data technologies. Adept at leading cross-functional teams, defining data governance strategies, and implementing robust data engineering solutions for real-time and batch processing. Strong experience in AWS-based data platforms, CI/CD, streaming (Kafka, Spark Streaming), APIs, and regulatory compliance.

TECHNICAL SKILLS

• Cloud & Big Data: AWS (S3, Lambda, Redshift, Event Bridge, ECS), Apache Spark, Hadoop, HDFS, ADLS

• Data Engineering & ETL: Python, Java, Scala(Spark shell), Apache Airflow, DBT, Informatica, Talend, Azure Data Factory, Tableau, MDM, Shell scripting.

• Data Modeling & Databases: SQL (PostgreSQL, Redshift, Hive, Oracle, MySQL, DB2), NoSQL (HBase, DynamoDB)

• Streaming & APIs: Spark Streaming, Apache Kafka, REST/SOAP APIs, OAuth 2.0

• DevOps & CI/CD: Azure GIT, Jenkins, Docker, GitHub.

• Data Governance & Quality: TDD methodology, data security (GDPR, HIPAA compliance)

• Other Tech stack: Mainframes, Cobol, Corn Shell

WORK HISTORY

SR. DATA ENGINEER 12/2021 to CURRENT

Intapp Palo Alto, CA

• Architected and developed data pipelines using Python, AWS, Airflow, DBT improving time to market with reusability by 80%, reduced processing time of by 70% and minimized application downtime and job failures by 30%.

• Designed real-time and batch ETL solutions integrating SaaS and cloud applications (Azure Cost Management, Hubspot, Workday, SharePoint etc) with Redshift data warehouse.

• Implemented Apache Airflow orchestration on AWS ECS, architected and automated transformations with DBT and created dashboards on Tableau.

• Built integration between Salesforce and Netsuite with AWS EventBridge for event-driven data sync.

• Defined Data Governance framework and automated quality checks to improve data accuracy and integrity.

• Spearheaded the CI/CD automation of data pipelines with Azure GIT and Jenkins reducing deployment time by 60%.

• Conduct code reviews, technical discussions, and spearheaded project planning and roadmap development.

• Leading a team of Data Engineers across multiple time zones, driving project strategy and development in agile methodology.

BIG DATA/DATA ENGINEERING TECH LEAD 05/2017 to 11/2021 Kaiser Permanente Pleasanton, CA

• Built and optimized Hadoop-based data solution for Kaiser Permanente, enabling data integration on Mobile application (OWL) used by 80% of operating staff at Kaiser.

• Architected scalable data flows and crafted comprehensive technical design documents, presenting solutions to executive stakeholders.

• Developed real-time streaming data ingestion using Kafka, Spark Streaming, reducing data latency from hours to minutes.

• Operated within the Scaled Agile methodology, collaborating with Product Managers, Business Analysts, and the QA Team to deliver successful solutions. TECH LEAD / ARCHITECT 05/2015 to 04/2017

DentaQuest Charlestown, MA

• Designed and developed enterprise Data Warehouse from scratch, consolidating 10+ data sources and reducing report generation time from hours to minutes.

• Led a team of 40+ developers to implement ETL pipelines on SQL Server, integrating Python, Informatica, and Airflow for scalable data processing.

• Architected and implemented end-to-end data ingestion and archival solutions

• Designed the process for high availability, scalability and security compliance

(HIPAA, SOC2).

ETL TECH LEAD 08/2014 to 04/2015

Wellmark BCBS Des Moines, IA

• Developed and optimized high-performance ETL pipelines using Informatica, Teradata, and Shell scripting, improving query execution by 50%.

• Implemented pushdown optimization, FastLoad, and Teradata query tuning to enhance overall performance.

• Led team of Data Engineers across multiple time zones and managed end to end deliverable of the project.

MAINFRAMES AND ETL DEVELOPER, TEAM LEAD 08/2007 to 07/2014 Target corporation Minneapolis, MN

• Led a 15+ member team in developing mission-critical supply chain applications on Mainframes and Informatica ETL pipelines for data migration.

• Built and enhanced order management, replenishment, and supply distribution systems, improving inventory accuracy by 20%.

• Owned and supported critical retail applications as a Subject Matter Expert supporting business teams.

EDUCATION

M.Tech CAD/CAM 2007

Vellore Institute of Technology, Tamil Nadu, India CERTIFICATIONS

• AWS Certified Solution Architect- Associate,

• Spark and Hadoop Developer

. • AWS Certified Developer Associate



Contact this candidate