Shivakumar **************@*****.*** Mobile: +1-339-***-****
PROFESSIONAL SUMMARY
Results-oriented Data Engineer with over 5 years of experience designing, building, and optimizing scalable data pipelines and data architectures. Expert in Python, SQL, Spark, and cloud platforms including AWS and GCP. Skilled in ETL development, big data processing, and AI/ML integration to deliver high-impact analytics and business intelligence solutions. Experienced in deploying automated data workflows, ensuring data quality, and collaborating with cross-functional teams to meet business goals. Strong background in data warehousing, streaming data, and visualization using Tableau and Power BI.
TECHNICAL SKILLS
Databases: Oracle, MySQL, PostgreSQL, MongoDB, Hive, Hadoop, Snowflake
Programming Languages: Python, SQL, Java, R, MATLAB
Big Data & ETL Tools: Apache Spark, Talend, SSIS, Apache Kafka, Apache Airflow
AI/ML Frameworks: TensorFlow, Keras, Scikit-learn, OpenCV, NLP (Transformers, GPT APIs)
Cloud Platforms: AWS (S3, Lambda, EC2, SageMaker), Google Cloud Platform (GCP)
BI Tools: Tableau, Power BI, Amazon QuickSight, Google Data Studio
DevOps & Containerization: Git, GitLab, Docker, Kubernetes
Operating Systems: Linux, Windows
PROFESSIONAL EXPERIENCE
Data Engineer- Verizon, New York, NY 2023 – Present
Designed and optimized high-throughput ETL pipelines using Python and Apache Airflow, enhancing financial data ingestion reliability by 40%.
Automated data validation workflows with Python and Ansible, increasing system uptime and data accuracy by 25%.
Architected scalable AWS-based data solutions using S3, Lambda, and EC2 to support AI-driven data processing and analytics.
Integrated generative AI and ML models into data pipelines for real-time anomaly detection and predictive analytics in financial datasets.
Developed AI-augmented Tableau dashboards, enabling self-service analytics with actionable business insights.
Implemented comprehensive pipeline monitoring and alerting, reducing failures and downtime by 30%
IT- Data Engineer– JPMorgan Chase, Jersey City, NJ. 2022-2023
Designed and deployed a Generative AI-powered analytics assistant leveraging OpenAI GPT APIs for natural language financial data queries, reducing reporting time by 70%.
Integrated AI conversational analytics with Tableau dashboards to enhance interactive data exploration.
Developed secure backend services with Python and AWS Lambda to process user queries while ensuring compliance.
Created Talend ETL workflows to load structured and semi-structured data into PostgreSQL and Snowflake warehouses.
Containerized applications using Docker and Kubernetes and implemented CI/CD pipelines to streamline deployments.
Data Engineer– Cotiviti, Hyderabad, India 2018-2021
Consolidated business intelligence metadata across systems, enhancing data governance and quality.
Developed complex T-SQL queries and SSIS packages to optimize ETL workflows.
Designed and executed Talend ETL jobs, reducing data load times by 60%.
Applied machine learning and NLP algorithms to improve dynamic reporting and decision support.
Created Power BI dashboards presenting actionable healthcare claims insights.
Managed version control and collaborative development via Git.
Participated in migration of legacy data pipelines to cloud infrastructure to improve scalability.
EDUCATION:
Master of Science in Applied data analytics New England College. 2022
Coursework: Database Design and Implementation, Business Intelligence and Analytics, Data Mining for Data science and Data Analytics, and Machine learning data science.
CERTIFICATIONS:
Snow Pro Core Certification (S0023792) Google Associate Cloud Engineer (123722)
Databricks Get Started Days (Data Engineering + SQL Analytics and BI)
AI & Data Science Certification