Nagendra Kumar
Data Engineer
Irving, Texas +1-469-***-**** *************.******@*****.*** LinkedIn SUMMARY
Data Engineer with 4 years of experience designing, building, and optimizing scalable data pipelines and infrastructure. Proficient in leveraging cloud platforms like AWS and Azure, and skilled in Python, SQL, and Spark for big data processing. Adept at implementing ETL processes, ensuring data quality, and collaborating with cross- functional teams to deliver actionable insights. Proven track record of driving performance improvements and enabling data-driven decision-making in fast-paced environments. Passionate about solving complex data challenges to support business growth and innovation. EXPERIENCE
Data Engineer, Morgan Stanley Jun 2023 – Present Remote, USA
• Developed and maintained scalable data pipelines using Azure Data Factory and Databricks to process and analyze financial data in real time.
• Designed optimized data models in Azure Synapse Analytics, improving query efficiency and reducing reporting latency by 35%.
• Automated data pipeline orchestration with Apache Airflow, ensuring seamless workflow execution and proactive error detection.
• Engineered real-time data ingestion frameworks with Azure Event Hubs and Kafka, enabling near- instantaneous updates for trading and risk management systems.
• Enforced strict data governance policies and regulatory compliance (SOX, SEC) by implementing robust encryption, role-based access controls, and audit logging via Azure Key Vault and Azure Monitor.
• Conducted performance tuning of ETL workflows and storage optimization in Azure Data Lake, reducing cloud costs by 25%.
• Partnered with quantitative analysts to deliver accurate, clean datasets for algorithmic trading and market risk models.
• Built reusable frameworks for data validation, lineage tracking, and reporting consistency, ensuring accuracy across critical financial pipelines.
Data Engineer, Accenture Jul 2020 – Jul 2022 India
• Implemented ETL pipelines using Azure Data Factory and Databricks to process large-scale datasets for retail clients, improving data integration efficiency by 30%.
• Migrated on-premise data systems to Azure Synapse Analytics, enhancing query performance and reducing operational costs by 40%.
• Built real-time data ingestion workflows for a retail project using Kafka and Azure Event Hubs, enabling low- latency analytics for inventory tracking.
• Optimized NoSQL database performance for an e-commerce client using AWS DynamoDB, ensuring high availability for real-time customer behavior tracking.
• Automated cloud infrastructure provisioning with Terraform for AWS-based projects and Azure Resource Manager (ARM) templates for Azure-specific workloads, reducing deployment errors by 25%.
• Improved data storage efficiency for a logistics client by optimizing AWS S3 for archival and analytical workloads.
• Developed Python-based data validation frameworks to ensure compliance and accuracy in reporting pipelines.
• Delivered interactive dashboards for multi-cloud clients using Power BI and Tableau, supporting executive- level decision-making.
• Ensured data security by implementing encryption, access controls, and compliance audits across cloud environments, tailored to each client’s requirements. EDUCATION
UNIVERSITY OF NORTH TEXAS, DENTON, TX
Master of Science, Information Systems and Technology December 2023 GANDHI INSTITUTE OF TECHNOLOGY AND MANAGEMENT, VISAKHAPATNAM, INDIA Bachelor of Technology, Electronics and Communication Engineering June 2021 SKILLS
Programming: Python, Java, Scala, R, SQL
Big Data Technologies: Apache Hadoop, Airflow, Apache Spark, Hive, Pig, Presto Data Warehousing: Snowflake, Redshift, BigQuery, Teradata Databases: MySQL, PostgreSQL, NoSQL (MongoDB, Cassandra, DynamoDB) ETL Tools: Informatica, Talend, Apache Nifi, AWS Glue Cloud Platforms: Azure (Azure Data Factory, Databricks), AWS (S3, EC2, Redshift, Lambda) Version Control: Git, GitHub
Project Management: Jira, Agile (Scrum/ Kanban)
Containerization and Orchestration: Docker, Kubernetes Data Formats: JSON, Parquet, Avro, CSV
DevOps and CI/CD: Jenkins, Terraform, Ansible
Soft Skills: Problem-Solving, Collaboration, Verbal and Written Communication, Best Practices, Attention to Detail, Stakeholders Management
CERTIFICATIONS
Microsoft Power BI Data Analyst Professional Certificate: Coursera Work Smarter with Microsoft Excel
Programming for Everybody (Getting Started with Python): Coursera Introduction to Programming with MATLAB: Coursera