Post Job Free
Sign in

Data Analyst

Location:
Denton, TX, 76205
Salary:
40/hr
Posted:
November 13, 2025

Contact this candidate

Resume:

RAJEEV KRISHNA YALAMARTHY

940-***-**** **************@*****.*** LinkedIn GitHub

SUMMARY

Results-driven Data & Business Analyst with a Master’s in Information Systems and nearly 5 years of combined academic and industry experience in data analytics, automation, AI product development, and cloud platforms. Skilled in SQL, Python, Power BI, Azure, AWS, and Databricks, with proven success delivering AI-enhanced data solutions across enterprise and startup environments. Led development of intelligent data platforms, GPT-based assistants, and real-time automation tools supporting telecom, legal, HR, and customer operations use cases. Adept in LLM prompt engineering, Agile product ownership, and stakeholder collaboration, with hands-on experience in OpenAI GPT-4, LangChain, and MLflow. Passionate about building scalable AI-driven products that transform data into strategic insights and measurable business impact.

EDUCATION

MASTER’S IN INFORMATION SYSTEMS AND TECHNOLOGIES CGPA:3.6/4 UNIVERSITY OF NORTH TEXAS

BACHELOR OF TECHNOLOGY IN COMPUTER SCIENCE & ENGINEERING CGPA:7.9/10 VELLORE INSTITUTE OF TECHNOLOGY, INDIA.

SKILLS

Data Visualizations: Tableau, Power BI.

Programming and Scripting: Python, Perl, Shell Scripting, Bash, HTML, CSS, PHP, R Operating Systems: Linux, Unix

Database: MySQL, PostgreSQL, MongoDB, Snowflake

Data Transformation tools (ETL tools): Google Cloud Dataflow, Informatica, IBM DataStage, Bash Cloud & Big Data: AWS (Certified), Azure (Certified), Big Query, Google Cloud Storage (GCS), Dataproc, Hadoop

(basics), Kafka (basics), Google Cloud (Vertex AI, BigQuery), Redshift AI Tools & Frameworks: OpenAI GPT-4, Prompt Engineering, LangChain, Hugging Face Transformers, Dialogflow, Azure AI Studio, Databricks, scikit-learn, MLflow

Data Analysis & Management: MS Excel (Microsoft Power Query, VLOOKUP, XLOOKUP, Extract Transform Load

(ETL), Pivot Tables). Data Management, Data Visualization, Statistical Analysis. RPA/Automation tools: Microsoft Power Automate Desktop. Other Tools: Lucidchart, Git, JIRA, Agile/Scrum, Clickup WORK EXPERIENCE

Business-Data Analyst, Cognizant Technology Solutions Corporation MAR 2021 – AUG 2023 Customer Interaction Data Platform – Azure & Databricks Implementation

• Designed and implemented an automated data ingestion framework using Azure Data Factory to integrate sources including SQL Server, REST APIs, and FTP, processing over 500K daily customer interaction records.

• Developed incremental load pipelines using watermark logic and parameterized ADF controls, improving pipeline efficiency and reducing redundant data movement by 70%.

• Built schema-evolving Delta Lake pipelines, enabling dynamic onboarding of CRM and BPO datasets with zero downtime or manual intervention.

• Architected and maintained a bronze-silver-gold transformation layer using Databricks (PySpark) to support analytics for churn prediction, SLA compliance, and agent performance.

• Applied data validation and business rules using the DataFrame API to ensure >98% data accuracy in gold- layer datasets for executive-level dashboards.

• Collaborated cross-functionally with BI and operations teams to define standardized data contracts, decreasing source onboarding time by 50% and improving ingestion stability.

• Generated machine learning-ready datasets by blending structured and semi-structured data (e.g., call logs, billing, tickets) for churn and upsell modeling.

• Enabled near real-time ingestion via file triggers and control flows, cutting latency windows for critical reports by 40%.

• Optimized Spark jobs using adaptive query execution and resource-level tuning, reducing cluster overhead and runtime costs by 30–35%.

• Authored reusable notebook templates and internal playbooks, increasing project ramp-up speed by 60% and ensuring codebase consistency.

• Created masked test datasets from staging zones to support QA/UAT efforts, enhancing test coverage and validation accuracy.

• Contributed to Agile ceremonies, maintaining 92% sprint delivery consistency while mentoring junior engineers on pipeline design, debugging, and best practices

Software Engineer Intern, CYIENT Limited MAY 2024-AUG 2024 Telecom-Data Intelligence & Geo-Analytics Project

[Power BI Python Shell Scripting Perl (support scripts) sss SQL Excel Power Automate]

• Automated extraction of over 10,000 geospatial cable records from AT&T legacy systems using Power Automate and Python, improving data extraction speed by 60%.

• Built interactive Power BI dashboards that visualized cable health, service coverage, and g3e_fid-based telecom datasets, significantly improving executive decision-making.

• Developed Google Earth-based KMZ visualizations to map and compare network lines, improving service audit clarity.

• Used Python to implement error logging and cleansing techniques for geospatial pipeline datasets, increasing data accuracy by 35%.

• Implemented initial AI-based approaches to recommend cable mapping overlays using pattern detection in KMZ files, improving spatial accuracy for remote site planning.

• Created reusable EXE tools to segregate and validate telecom records by region, reducing manual errors by 15% and improving process efficiency by 35%.

Associate Product Analyst- AI Interfaces, Digital Agents.io (Startup Internship) NOV2024- Jan2025 Development & Testing of AI-Powered Digital Labor Agents for Business Automation

• Contributed to building AI-powered digital labor agents designed for HR, legal, real estate, and sales

(“DemoMonkey”, “RapidHire”, “AI Sidekick”), executing routine business tasks 24/7.

• Authored and prioritized 100+ use cases, defining persona-based workflows such as appointment scheduling, document summarization, onboarding chatbots, and lead handling, driving a 30% boost in deployment readiness.

• Developed prompt-engineered prototypes using OpenAI GPT, including a real-world assistant for legal document handling, lead qualification, and scheduling automation.

• Led acceptance testing & UX evaluation across web and chat interfaces, reducing UI friction by ~25% and ensuring agents responded accurately from a centralized knowledge base.

• Gathered performance data with integrated analytics dashboards, demonstrating agents delivered a 5 productivity increase, constant availability, and visible ROI within deployment hours. PROJECTS

AI-Powered Resume Screening Assistant MAR-2025

• Designed and deployed an AI-driven screening tool using OpenAI’s GPT-4 to parse and evaluate resumes against job descriptions in real time, automating first-round candidate selection.

• Implemented custom prompt engineering logic for skill extraction, scoring alignment, and ranking output, optimizing the model to prioritize top applicants based on core job criteria.

• Integrated feedback loops for refining scoring accuracy and used Python to simulate dataset processing and mock candidate-job pairing at scale.

• Achieved ~88% match precision across 100+ resume/job combinations during internal testing, reducing manual HR screening time by over 50%.

• Tool adaptable for integration into ATS platforms and scalable for use in high-volume recruitment pipelines.

Smart Contract Summarizer – GPTBased Legal Risk Detection Tool AUG-2024

• Developed a GPT-powered AI summarizer to extract key clauses and identify legal risk terms from unstructured text in NDAs and commercial lease agreements.

• Achieved ~90% clause detection accuracy across 75+ contracts by iteratively refining prompt structures and incorporating control logic for semantic consistency.

• Proposed and prototyped a real-time AI compliance assistant to support legal teams, reducing contract review turnaround by up to 40%.

• Applied legal-specific NLP techniques and prompt tuning to enhance clause relevance scoring and risk flagging for high-stakes terms

AI-Based Vehicle Breakdown Assistance Chatbot SEP-2023

• Engineered a real-time mechanic locator web application integrated with Google Maps, enabling dispatchers to promptly identify and alert the three nearest mechanics to breakdown locations, dramatically improving response times.

• Developed a real-time assistant to help stranded users locate nearby services using location-based APIs and OpenAI logic sequences.

• Integrated escalation flows, automated task triggers, and conversational feedback, improving emergency resolution response by 40% in simulated testing.

• Deployed location-based service with Google API, facilitating direct connections between stranded drivers and nearby mechanics, and ensuring 85% of breakdowns received immediate assistance from a local professional. RAINFALL-Prediction System MAR-2023

• Developed a rainfall prediction model using Decision Tree (classification), Linear Regression, SVM, and ARIMA

(regression) algorithms. Integrated real-time data from Open Weather API and utilized a 20-year Kaggle dataset (4377 columns, 19 rows) including variables such as wind speed, rainfall, humidity, temperature, and wind direction. Achieved 99.32% accuracy in predictions. Automated API ingestion (OpenWeather) and visualization through Python dashboards for environmental forecasting.

• Deployed the processed data on an open-ended web portal, providing free access to users for predictive insights on current and future rainfall patterns.

Forest-Fire-Detection via Satellite Image Classification Dec-2022

• Leveraged image processing techniques and machine learning algorithms to develop a proof-of-concept forest fire detection system to detect smoke from satellite imagery that achieved 88% accuracy in identifying smoke plumes from a 100-mile range.

• Designed alert thresholds to integrate with emergency response systems. Analysing AtliQ Hardware's Data: Unveiling Sales and Financial Insights SEP-2022 Excel(Microsoft Power Query,VLOOKUP, XLOOKUP,Extract Transform Load (ETL), Pivot Tables)

• Analyzed AtliQ Hardware's data from 2019-2021, examining customer details, product info, market dynamics, and sales figures to reveal operational insights.

• Constructed customized Profit & Loss statements revealing previously unseen trends in regional market performance, driving a 15% improvement in operational efficiency within six months. POSITIONS OF RESPONSIBILITY

President of See Their Smiles project in National Service Scheme – VIT Chennai SEP 2020 – MAY 2023

• Pivot motive of this project is to help and support orphanages and old-age homes Secretary of PALS Event Management Authority – VIT Chennai AUG 2021 – MAY 2023

• PAN IIT Alumni Leadership Series (PALS) is an educational initiative for alumni of all IITS. I was part of the PALS yearlong program, which involved various activities for the students and management of engineering institutions. Social Media and Content head of IEEE Computer Society – VIT Chennai SEP 2022 – MAY 2023

• Managed social media and content creation for technical events. Boosted online engagement and participation, through strategic content dissemination. Hosted various events including ML, Ops Workshop, Hackathons, and Cryptic Hunts. CERTIFICATIONS

• Microsoft Azure AI Fundamentals (AI-900) Jun 2025

• ChatGPT Prompt Engineering for Developers – DeepLearning.AI May 2025

• OpenAI API Integration (Independent Study) Apr 2025

• Data Analytics Essentials – Cisco Mar 2025

• Academy Accreditation - Generative AI Fundamentals – Databricks Feb 2025- Feb 2027 Credential ID: 134242970

• Career Essentials in Project Management – Microsoft & LinkedIn Feb 2025

• Microsoft Azure Data Engineer Associate (DP-203) Cert Prep – LinkedIn Feb 2025

• Prepare for the Microsoft Azure AI Fundamentals (AI-900) Certification – LinkedIn Feb2025

• AWS Certified Cloud Practitioner – AWS Apr-2023

• Microsoft Certified: Azure Fundamentals – Microsoft Aug 2022 Credential ID: I395-1176



Contact this candidate