Akram Pathan
Jersey City, NJ - *****, USA +1-516-***-**** ***************@*****.*** LinkedIn GitHub SUMMARY
Data Analyst with a strong computer science background and 3+ years of hands-on experience, holding a Master’s degree in Data Science from Stevens Institute of Technology. Proficient in advanced analytical techniques, forecasting, data visualization, and cloud technologies. Managed large-scale financial and sales data, developed interactive dashboards, and optimized data processes across industries. Experienced in Python, SQL, R, and database systems, with expertise in AWS and Azure cloud platforms. Communicates complex data clearly for strategic decision-making and collaborates with teams to meet project milestones. TECHNICAL SKILLS
Programming Languages: Python, SQL, R
Databases/ Libraries: MySQL, Oracle, PostgreSQL, MongoDB, NumPy, Pandas, Matplotlib, Seaborn, Plotly Cloud Technologies: AWS (EC2, S3, RDS, Lambda, Redshift, CloudWatch), Azure (Blob Storage, Data Lake, Data Factory), Google Big Query, Snowflake
Machine Learning & AI Frameworks: TensorFlow, PyTorch, Scikit-learn, NLTK, LangChain Data Visualization: Tableau, Power BI, MS Excel, Amazon Quicksight, Looker Data Analysis & ETL Expertise: Data Mining, Data Cleansing, Statistical Analysis, Data Wrangling, Data Warehousing, Alteryx Method & Version Control: Agile, Waterfall, Git, GitHub PROFESSIONAL EXPERIENCE
Data Analyst, Dow Jones Princeton, NJ Jan 2025 – Present
● Engineered and optimized 8M+ records in Google BigQuery, transforming unstructured text data with advanced SQL processes to deliver clean, structured datasets for clients.
● Developed Python scripts to generate optimized SQL queries for data insertion, streamlining process of structuring and integrating cleaned data into databases, enhancing data processing efficiency.
● Leveraged Amazon S3 to verify original PDF files, resolving extraction discrepancies and ensuring data accuracy and integrity.
● Designed and deployed Looker dashboards to monitor and classify large-scale content data, streamlining approval workflows and improving content relevancy for client reports.
● Built dynamic Google Sheets reports to track project progress, approvals, and content issues, streamlining communication across teams and accelerating decision-making.
● Partnered with product leadership and an ML engineer to refine OCR-based text extraction models, boosting classification accuracy and aligning results with strategic goals.
AI Product Designer/Analyst Intern, Radical AI New York, NY Sep 2024 – Dec 2024
● Performed data analysis using Python and SQL to extract, clean, and transform user behavior and chatbot data, improving accuracy and quality of product insights.
● Crafted 3+ interactive Tableau dashboards visualizing user behavior and patterns, enabling product teams to identify friction points and improve AI chatbot’s user interface.
● Partnered with designers and engineers to deliver data-driven insights, driving interface improvements that enhanced usability and aligned features with user needs.
Data Analyst, KPMG India Sep 2020 – Jul 2022
● Consolidated financial data from transactional databases, accounting systems, and financial reporting platforms into a unified repository, managing over 1 million records to ensure comprehensive data coverage.
● Engineered and implemented complex SQL queries to extract, merge, and integrate financial data into Oracle databases, ensuring 100% data completeness and accuracy across all datasets.
● Harnessed pandas for data manipulation and merging, creating 20 new financial metrics that enhanced performance analysis and supported more accurate decision-making.
● Developed interactive Power BI dashboards utilizing DAX to create custom KPIs, time intelligence measures, and financial insights on revenue growth, expense variance, and profitability, enhancing stakeholder decision-making by 30%.
● Built dynamic Excel reports using pivot tables, VLOOKUP/XLOOKUP, conditional formatting, and advanced formulas to summarize trends and deliver 15+ analytical reports monthly with accurate, actionable financial insights.
● Orchestrated AWS services (EC2, S3, Lambda, RDS, Redshift) for scalable computing, secure storage, serverless processing, data warehousing, optimizing data handling, and complex queries.
● Led agile project life cycles and utilized GitHub for version control, ensuring timely delivery of financial analysis objectives and maintaining data integrity across 5+ projects through effective collaboration with cross-functional teams. Data Analyst Intern, Trigent Software India Mar 2020 – Aug 2020
● Constructed automated Python scripts to extract, transform, and analyze sales data from diverse sources; achieved a significant reduction in processing time by 50 hours monthly while enhancing overall workflow efficiency.
● Designed and optimized MySQL databases to store and manage large volumes of sales data, implementing indexing strategies and query optimizations that maximized data retrieval speed.
● Created comprehensive, interactive dashboards and reports using Power BI and MS Excel, translating intricate sales data patterns into visually compelling presentations that elevate data-driven decision-making across sales departments.
● Managed code repositories for sales data analysis project using Git, tracking changes, and maintaining code quality throughout development lifecycle of 12 analytical models and data pipelines. EDUCATION
Master of Science in Data Science, Stevens Institute of Technology Hoboken, NJ Sep 2022 – May 2024 Bachelor of Technology in Computer Science, Rajarambapu Institute of Technology India Aug 2017 – May 2021 PROJECTS
Time Series Modeling for Financial and Meteorological Forecasting
● Forecasted stock prices and managed risk by applying ARIMA and GARCH models to 9 years of TCS stock price data, enabling data-driven insights for strategic decision-making using Python.
● Achieved 95% forecast accuracy by applying SARIMA to analyze Seattle's daily weather patterns, supporting data-driven strategic planning in weather-dependent scenarios.
Customer Sales and Pricing Strategy Analysis
● Analyzed 300,000+ entry dataset in Tableau to calculate revenue across dimensions, evaluating sales volume against discounts for 200+ products, which informed targeted marketing strategies and boosted profit margins.