Post Job Free
Sign in

BI Data Engineer

Location:
Jersey City, NJ
Posted:
July 30, 2025

Contact this candidate

Resume:

Niveditha Channapatna Raju

Phone: 201-***-**** Email: *********.**.*@*****.*** LinkedIn: linkedin.com/in/niveditharaju/ GitHub: github.com/Niv-Raj Result-driven Data Professional with 4 years of experience specializing in data architecture, machine learning, and designing scalable data pipelines. Skilled in statistical analysis, data modelling, and using BI tools like Tableau, Power BI, and Looker to create impactful dashboards. Proficient in Python, R, SQL, Snowflake, AWS, and Azure, with strong expertise in API integration for seamless data extraction and automation. Experienced in managing both structured and unstructured data across cloud storage solutions such as AWS S3, Azure Blob Storage, and Snowflake Data Lake. Expertise in ETL pipelines and utilizing frameworks like Azure Databricks and PySpark to handle large-scale datasets. Adept at applying machine learning techniques like predictive modelling, classification, and clustering to deliver data-driven business solutions. Experienced in database schema design, indexing, and performance tuning for MySQL, PostgreSQL, and Snowflake. Strong understanding of data governance, ensuring compliance and data integrity, with a proven track record of collaborating with cross-functional teams and working in client-facing roles to support strategic decision-making. TECHNICAL SKILLS

Programming & Data Tools: Python, R, SQL, PySpark, C, MATLAB, Apache Airflow, Integrate.io, Git, GitHub, ETL pipelines Cloud Data Engineering: AWS (S3, Redshift, Glue, Lambda), Azure (Databricks, Data Lake, Synapse, Data Factory, DevOps), Snowflake BI & Databases: Tableau, Power BI, Looker, MySQL, SQL Server, PostgreSQL, Data Modelling, API Integration, A/B Testing ML & Analytics: Scikit-learn, XGBoost, LightGBM, TensorFlow, Keras, Pandas, NumPy, Matplotlib, Forecasting, Statistical Modelling WORK EXPERIENCE

BI Data Engineer Feb 2024 – Present

JerseySTEM (Remote, US)

• Designed ERD and optimized MySQL database schema, enhancing data architecture for scalable, real-time analytics, improving data integration and system performance by 30%.

• Automated ETL workflows, extracting data from Google Cloud to MySQL using Python, increasing pipeline efficiency by 15%.

• Collaborated with business and IT teams to design and implement data models and structures, ensuring data is efficiently stored and accessible in the data warehouse.

• Built ETL pipelines using Integrate.io, optimizing data ingestion, reducing manual interventions, and ensuring smooth data integration from multiple sources.

• Integrated data into Looker Studio, enabling real-time reporting and ETL monitoring, improving business intelligence insights.

• Developed interactive Looker dashboards, enhancing data visualization and reducing refresh time by 70% (from 40s to 10s).

• Optimized SQL queries using indexing and partitioning, improving query performance by 20% and accelerating data retrieval.

• Collaborated cross-functionally using Agile and JIRA, boosting project completion by 15% and enhancing team efficiency.

• Documented data architecture, processes, and solutions, ensuring clear reporting of design decisions and data flows.

• Ensured architecture supports advanced analytics, reporting, and dashboarding requirements, enabling timely accurate insights. Data Scientist Sep 2023 - Dec 2023

Bayer (Remote, NJ US)

• Refined marketing personas for 13 products using Azure Databricks, improving efficiency by 31% with top 3 best-sellers.

• Developed KPIs in Power BI for 26 major & 120 subcategories, optimizing ad strategies and providing audience insights.

• Improved ad targeting by 15% using data mining and refining segmentation with metrics like Click Ratio & Engagement Index.

• Designed A/B testing strategies, boosting campaign performance by 12% through statistical analysis and refining ad strategies. Data Science Mentor Jan 2023 - Dec 2023

NJIT Career Development Services (Newark, NJ)

• Mentored 50+ students on resumes, job search, and interviews, enhancing career readiness and equipping them with technical skills.

• Led 13 workshops on Python, SQL, ML, and BI tools, helping students develop strong technical skills for data roles.

• Collaborated on 2 career fairs and a reverse fair, expanding student industry connections and boosting hiring potential.

• Boosted CDS Instagram engagement by 12% and reach by 20% through targeted career content and student success stories. Data Engineer Jan 2021 - Aug 2022

MathCo (Remote, IN)

• Implemented automation through Python scripts for data ingestion, significantly reducing manual task duration by 87.5%.

• Optimized PySpark ETL in Azure Databricks, cutting runtime by 75%, and integrated CI/CD in Azure DevOps for automation.

• Designed and implemented data models and optimized data architecture for Azure Data Lake, improving scalability and performance.

• Reduced storage usage by 8-10GB per pipeline run in Azure Data Lake by eliminating unnecessary intermediate data files.

• Streamlined data warehouse management on Azure Data Factory, cutting pipeline runs by 60-70% and enhancing efficiency.

• Developed and deployed Power BI dashboards for business analytics, improving decision-making and supporting key performance indicators (KPIs).

• Automated customer data pipeline integrating Web API and PostgreSQL into AWS S3 and RDS, reducing manual tasks by 80%.

• Optimized data transformation with AWS Glue, improving processing speed by 60% and making queries in Redshift 5x faster.

• Documented data architecture, including data dictionaries, policies, and procedures, ensuring transparency and knowledge sharing.

• Improved marketing ROI by 15% using a multivariate regression model to optimize budget allocation across marketing channels.

Data Visualization Analyst Oct 2021 - Nov 2021

Saint Louis University (Remote)

• Conducted EDA on 14 marketing channels using Python and Excel, identifying trends that improved campaign performance.

• Created Tableau dashboards to track campaign ROI, identifying campaigns below 80% ROI and improving budget efficiency.

• Automated weekly marketing reports in Excel and Tableau, reducing manual effort by 25% and improving report accessibility. EDUCATION

Master of Data Science, New Jersey Institute of Technology Sep 2022 - Dec 2023 GPA: 3.8/4

BTech in Electronics and Communication Engineering, PES University Aug 2017- May 2021 GPA: 7.3/10

PROJECTS

• BCG Data Science Job Simulation (Feb 2025): Built a Random Forest model achieving 85% accuracy using Python to analyze churn and provided strategic insights.

• Employee Attrition Prediction and Analysis (Jun 2024): Evaluated Logistic Regression, Random Forest, and Decision Tree, achieving 98.98% accuracy with Random Forest to deliver retention insights.

• Credit Card Customer Attrition Prediction (Dec 2022): Applied Logistic Regression, Random Forest, and XGBoost for churn prediction, achieving a recall of 0.91 and ROC-AUC of 0.98, with Random Forest CERTIFICATIONS

Data Science Specialization, John Hopkin’s University Generative AI Fundamentals, Azure DataBricks



Contact this candidate