Name: Pruthviraj Rathod

Job Role: Data scientist

Experience: Fresher

Address: sect 12 Kharghar Navi Mumbai , India

Skills

SQL 95%
PYTHON 85%
Data Visualization 90%
Statistical Analysis 85%
Machine Learning 80%

About

About Me

Enthusiastic Data Scientist with a passion for uncovering actionable insights through data. Proficient in SQL, Python programming, and machine learning, with hands-on experience in data mining, visualization, and predictive analytics. Skilled in translating complex data into meaningful insights to drive informed decision-making. A task-driven individual eager to contribute to an esteemed organization, leveraging a diverse skill set and a keen analytical mindset to drive impactful solutions.

  • Profile: Data Science & Analytics
  • Domain: Retail, Ecommerce, BFSI & Digital Marketing
  • Education: MBA in (statistics) Aspiring 2025
  • Language: English, Hindi, Marathi
  • Programming Languages: SQL, Python, SAS
  • Visualisation Tools: Microsoft Power BI & Tableau
  • Other Skills: Exploratory Data Analysis, Statistical Modeling, Machine Learning, Data Manage ment, Data Modelling, NLP, Advanced Excel & Analytics on SAS
  • Interest: Traveling, Travel Photography

0 +   Projects completed

LinkedIn

Resume

Resume

Recent Data Science graduate skilled in Python, SQL, and machine learning. Experienced in data analysis, visualization, and predictive modeling. Passionate about turning data into actionable insights to drive business decisions. Eager to contribute innovative solutions..

Experience


Sep 2023 - Oct 2023

Home Loan Prediction

Academic Project

Developed a sophisticated machine learning model aimed at predicting home loan approvals, which streamlined approval processes and mitigated risks for financial institutions.

  • Skills Utilized: Exploratory Data Analysis (EDA), Data Visualization, Feature Engineering, Synthetic Minority Over-sampling Technique (SMOTE), Data Collection, Data Wrangling, Machine Learning, Descriptive and Inferential Statistics, Hypothesis Testing, Confidence Intervals, and Distribution Analysis.
  • Tools Used: Python, Pandas, NumPy, Matplotlib, Seaborn, Scikit-Learn.

Jul 2023 - Sep 2023

Machine Learning based Clustering and PCA Problem

Academic Project

Developed clusters based on browsing behavior to determine revenue potential for online shopping websites using unsupervised learning.

  • Skills utilized: EDA, Machine Learning, Unsupervised Learning, K-means Clustering, Hierarchical Clustering, Principal Component Analysis.
  • Tools Used: Python, Pandas, NumPy, Matplotlib, Seaborn, Yellowbrick, Sklearn

Jun 2023 - Jul 2023

Online Shopping Process Analytics

Academic Project

Analyzed online customer purchasing behavior over a decade using 'Sales and Delivery' data to understand seasonality and business patterns.

  • Skills utilized: SQL Query, Joins, Sub-Query, Windows Function.
  • Tools Used: MySQL Workbench



Education


2020-2023

Bachelor of Science BSc (Chemistry)

BAMU University

Completed with First Class.

2023

Post Graduate Program in Data Science Engineering

Great Lakes Executive Learning

Completed in 2023.

2024 - Aspiring 2026

Master of Business Administration (MBA)

Aditya Engineering College

Specialization: Statistics

Projects

Projects

Below are the sample Data scientist projects on SQL, Python, Power BI & ML.

Machine Learning based Disease Prediction Model

Course: Supervised Learning - Classification

Built a binomial machine learning model to predict Parkinson's disease. The objective of this model is to classify patients with Parkinson's disease. Performed the required pre-processing steps prior to model building and evaluated the model performance through a proper tuning process.

Skills & Tools Covered:

  • Machine Learning
  • Supervised Learning
  • Linear and Non-Linear Classification Models
  • Hyperparameter Tuning
  • Model Metrics

Machine Learning Model Building for Continuous Variable Prediction

Course: Supervised Learning - Regression

Built a Linear Machine Learning model to understand the relationship between the population of US cities in the years 1920 and 1930. Evaluated the model performance with appropriate measures. Performed all required graphical and quantitative exploratory data analysis prior to model building. The dataset comprises 49 rows and 2 columns, representing the population (in thousands) of 49 U.S. cities in 1920 and 1930. These cities are a random sample from the 196 largest cities in the U.S.

Skills & Tools Covered:

  • Machine Learning
  • Supervised Learning
  • Linear Regression

Descriptive Statistics on Real World Problem

Course: Statistics for Machine Learning

This project involves tackling three real-world problem statements:

  • The first problem focuses on exploratory data analysis of a La Liga Cup (a football tournament). The goal is to analyze the data for insights into top teams, winning distributions, and more.
  • The second problem helps in understanding and applying confidence intervals to a healthcare-related issue.

Skills & Tools Covered:

  • Descriptive and Inferential Statistics
  • Hypothesis Testing
  • Confidence Interval
  • Distribution

ITP/NPV

Course: Introduction to Python Programming

Performed data analysis to derive various insights from data collected from IPL matches. The analysis utilized two datasets:

  • deliveries.csv: Contains information about each delivery bowled in all the IPL matches including runs scored, batsman, bowler, non-striker, and more.
  • matches.csv: Includes details related to the match such as location, contesting teams, umpires, results, etc.

Skills & Tools Covered:

  • Python
  • Numpy
  • Pandas

Machine Learning based Clustering and PCA Problem

Course: Unsupervised Learning

Based on given data of visitors browsing for online shopping, built different clusters to determine if the person is only browsing and visiting multiple pages or also generating revenue for the shopping websites. Analyzed and compared the clusters formed with the existing Revenue Column.

Skills & Tools Covered:

  • Machine Learning
  • Unsupervised Learning
  • K-means clustering
  • Hierarchical Clustering
  • Principal Component Analysis
0 Achievements
0 Projects
0 Certifications
0 Hackathons

More projects on Github

I love to solve business problems & uncover hidden data stories


GitHub

Contact

Contact Me

Below are the details to reach out to me!

Address

Sect 12, Kharghar, Navi Mumbai, 410210, India

Contact Number

+91 9359904202

Download Resume

resumelink



Have a Question? Click Here