Hey

I’m Atharva
An Aspiring Data Scientist & Analyst, exploring all things data in Toronto.

About

An Aspiring Data Scientist with Master's from University of Guelph. Built R Shiny applications for agricultural research and optimized ETL pipelines with 50% speed improvements. My passion lies in creating interactive dashboards and predictive models that help organizations make smarter decisions. Skilled in Python, R, SQL, Tableau, Looker Studio and Machine Learning. Outside data I enjoy playing Football Manager, Fantasy Premier League, Assassins Creed, supporting Manchester City FC, and watching Chinese & Turkish dramas.
Seeking Data Analyst, Business Analyst, or Data Scientist roles where I can apply my technical skills to solve real-world problems and drive data informed decision making.

Resume

Skills

Python SQL R programming HTML CSS JS
PostgreSQL ETL (Databricks) MYSQL MongoDB NLP
Tableau Looker Studio Excel RShiny Machine Learning

Experience

University of Guelph

Data Analyst (Graduate Research Assistant)

May 2025 – Aug. 2025

Optimized fertilizer recommendations by rebuilding Ontario’s Corn Nitrogen Rate Calculator as a Shiny app, enabling faster data-driven decisions; built an interactive Historical Ontario Corn Nitrogen Database Dashboard on 60+ years of trial data to automate fertilizer risk assessments and monitoring; and improved nitrogen response modeling using nls(), visualizing results via ggplot2/plotly to help farmers and stakeholders make evidence-based decisions.

United We Care

Data Science and Analytics Intern

Dec 2023 – Apr 2024

Identified retention risks for 5,000+ users by segmenting and building KPI dashboards to enable targeted product interventions; reduced data processing time by 50% by optimizing SQL ETL pipelines across 200+ datasets, improving analytics readiness; and enabled faster executive decisions by developing dashboards in Looker Studio and Excel.

Education

University of Guelph

Master of Data Science

Graduated with a GPA of 3.86/4, specializing in data science, machine learning, predictive modeling, and interactive dashboards. Completed projects using Python, R, SQL, and BI tools, including R Shiny apps gaining hands-on experience transforming complex data into actionable insights.

Rajiv Gandhi Institute of Technology

B.E Computer Engineering

Graduated with a CGPA of 8.6/10, focusing on software development, databases, and programming fundamentals. Built projects in Python and SQL while developing strong problem-solving and analytical skills applicable to data-driven roles.

Additional projects showcasing my expertise in data science, machine learning, R programming, dashboard development, and large language models.

Acdemic Project

Car Co-operators Insurance Prediction

This project builds a machine learning model to predict car insurance premiums using customer demographics, vehicle characteristics, and claim history. The goal is to help insurance companies price policies more accurately, manage risk effectively, and streamline the underwriting process, turning raw data into actionable insights for business decisions.

  • Python
  • Machine Learning
  • Regression, Data Preprocessing
  • Feature Engineering
Acdemic Project

London Crime Map

Worked in a team to build an interactive dashboard mapping London’s crime risk. I focused on integrating multi-source data and developing the R Shiny dashboard for hotspots and risk scores.

  • R
  • RShiny
  • leaflet, ggplot2, plotly
  • Dashboards
Personal Project

LexiAI — QA Assistant

LexiAI is an AI-powered QA assistant that helps users query dense legal documents and get clear, context-aware answers instantly. Built with NLP and LLMs, it goes beyond keyword search to summarize, interpret, and retrieve information—making legal knowledge more accessible and practical.

  • Python
  • NLP
  • Large Language Models (LLMs)
  • Text Summarization
Academic Project

House Price Prediction

Using the Ames Housing Dataset, I built a Linear Regression model to predict sale prices while minimizing RMSE. The project involved data cleaning, handling outliers, and encoding categorical features, showing how careful preprocessing and feature engineering can significantly improve model performance.

  • Python
  • Linear Regression
  • Feature Engineering
  • Data Preprocessing
Academic Project

BC Wildfire Analysis

Analyzing historical wildfire data in British Columbia, I explored spatial and temporal trends, identified key risk factors, and visualized high-risk zones. The project demonstrates how data-driven insights can inform wildfire management and mitigation efforts.

  • R
  • Data Analysis
  • GIS / Spatial Analysis
  • Data Visualization
Academic Project

What’s Cooking?

As part of a team, we built a model to predict a recipe’s cuisine from its ingredients. My contributions included text preprocessing, feature engineering, and classification modeling. The project highlights how we transformed raw ingredient lists into accurate cuisine predictions, combining culinary intuition with data science.

  • Python
  • NLP
  • Classification Modeling
  • Machine Learning

Get In Touch

I’d love to hear from you! Whether you have a question about data, analytics, dashboards, or just want to chat about tech and data-driven insights drop me a message.