Personal details

Raghav P. - Remote data scientist

Raghav P.

Based in: 🇮🇳 India
Timezone: Mumbai (UTC+5.5)

About

Highly experienced Full-Stack Data Scientist / Machine Learning Engineer and Software Developer, delivering production ready ML models and customised softwares to automate business processes and decision making. With an added advantage of attaining MBA from India's finest B-school, I build tech powered solutions to ensure maximum profitability for any business venture.

Skills :
● Data Science & Machine Learning : Supervised ML, Linear Regression, Decision Tree, Prompt Engineering, Hypothesis Testing, Statistics, Exploratory Data Analysis, Data Wrangling, Cleaning & Tidying, Feature Engineering, Data Visualization, SQL, Scikit-learn, NumPy, Pandas, Matplotlib, Plotly, Dash, Microsoft Excel, Metabase, AWS Athena & Sagemaker, Google BigQuery, Unsupervised ML, Neural Nets, NLP, Large Language Models (LLM), GPT, Text Embedding, Vector Databases, PyTorch, fastai, TensorFlow, OpenAI API, Generative AI (GenAI)
● Programming : Python, MySQL, PostgreSQL, HTML, CSS, Flask, Heroku, APIs, Git, AWS S3 & EC2, JavaScript, jQuery, Java, Werkzeug WSGI, Leaflet.js, Folium, GeoPandas, Google Maps API & Sheets API, Bootstrap

Professional Work :

  1. Engineered a suite of cutting-edge ensemble ML models, revolutionizing underwriting processes for personal loan products a major FinTech Lender in India.
  2. Developed and deployed 6 ML models for credit-risk scorecard, using 5 data sources (Bureau, Bank Statement, SMS, Smartphone App Info and Demography), improving risk spread by 1200%.
  3. Devised NLP based models (Passive Aggressive Classifier and XGBoost Classifier with TFiDF Vectorizer) for classification of SMS, improving recall by 35% and accuracy by 60%, and saved 36 man-hours daily, processing over 200 Mn SMS/day.
  4. Created LLM based classifier models using Prompt Engineering on ChatGPT (OpenAI), Mistral AI (Open Source) and other open source models. Deployed using AWS Sagemaker.
  5. Crafted an LLM based Law ChatBot that answers to Lawyer's questions pertaining to any specified PDF file of a Legal Document.
  6. Built an in-house MLOps infrastructure with AWS Sagemaker, enhancing model integration.
  7. Engineered 330+ features from SMS data to emulate metrics including cash flow, lifestyle and financial literacy indicators. Included steps like Exploratory Data Analysis, Data Wrangling, Cleaning & Tidying, Feature Engineering.
  8. Designed comprehensive daily and monthly model monitoring dashboards using Flask-Python, Metabase and SQL tables, improved response time to anomalies by 75%, enhancing overall model reliability and performance.
  9. Designed a Customer Loyalty Program and delivered via a web and Android app, over 2500 DAU, rated 4.6 on play store for a manufacturer and distributor of PVC Plumbing Systems in Central India.
  10. Increased territorial penetration by over 25% by building automated interactive GIS sales dashboard for a wholesale business in India.
  11. Achieved 100% monthly revenue target completion by creating Business Intelligence Dashboard – Google Apps Script.
  12. Reduced total credit by 20% in five months by building a Python-bot that sends regular WhatsApp messages to debtors.
  13. Saved annual recurring cost of ₹25 lakh+ by automating data collection for a hedge fund using Yahoo Finance API & Python.
  14. Cut time from over 4 hours to 2 clicks by developing a GIS-enabled web application for new-license territory discovery for a developer and licensor of Escape Room Game Designs in USA.

Personal Projects :

  1. Developed a Fake News Classifier using TfIdf Vectorizer and Passive Aggressive Classifier - ML, pandas, scikit-learn.
  2. Created a Spam Email Classifier using Naïve Bayes and XGBoost Classifiers - ML, NumPy, seaborn.
  3. Designed a Cat Image Classifier using multi-layer neural network - DL, NumPy, Matplotlib, h5py, PIL, SciPy.
  4. Developed a web app to manage portfolio of stock with real-time prices using IEX API -SQLite, Flask, HTML, CSS, JS.
  5. Built a terminal-run card game called Declare - Python.

Education :
Computer Science and Data Science Masters (Self-Learned)
Major Electives:
Computer Science: MIT 6.0001, MIT 6.0002, Automate the Boring Stuff with Python, Harvard CS50, Harvard CS50 - Web Development, Intro. to Algorithms, Design and Analysis of Algorithms, Software Development Lifecycle, GIT and GitHub, Flask for Python, The Python Mega Course, Data Structures, Analysis of Algorithms, The Odin Project etc.
Data Science: Stanford CS229, Stanford Machine Learning (CS229A), Data Science path on Codecademy.com, Data Science and Machine Learning Bootcamp, Neural Networks and Deep Learning, Deep Learning with PyTorch: Zero to GANs, Improving Deep Neural Networks: Hyperparameter tuning, Regularization and Optimization

Work Experience

Freelance Software Developer / Data Scientist
Palrigraphy (Self Employed) | Dec 2019 - Present
HTML/CSS
Python
Java
jQuery
Flask
Google Maps
PostgreSQL
Heroku
Amazon S3
Bootstrap
Leaflet
uWSGI
JavaScript
Mapbox
Google Sheets
Geographic Information System

Single-handedly planned, built, launched and managed world-class software innovations for corporate clients.

  • Puranmal Sons, India, manufacturer and distributor of PVC Plumbing Systems in Central India:
    Designed a Customer Loyalty Program and developed web and Android app, over 2500 DAU, rated 4.6 on play store
    Automate BI reporting by building interactive GIS sales dashboard using Leaflet.js, Folium, GeoPandas, Google Sheets API
    Reduced receivables by >20% by building a bot that sends regular outstanding reminders to debtors using CAPIWHA API
  • New Escape Room Designs, LLC, USA, developer and licensor of Escape Room Game Designs:
    Reduced time from 4hr+ to 2 clicks by building GIS-enabled web application for new-license territory-discovery process
  • Developed a Fake News Classifier using TfIdf Vectorizer and Passive Aggressive Classifier – ML, pandas, scikit-learn
  • Created a Spam Email Classifier using Naïve Bayes and XGBoost Classifiers – ML, NumPy, seaborn
  • Designed a Cat Image Classifier using multi-layer neural network – DL, NumPy, Matplotlib, h5py, PIL, SciPy
  • Developed a web app to manage portfolio of stock with real-time prices using IEX API – SQLite, Flask, HTML, CSS, JS
  • Built GUI Automator for creating playlists in YouTube Music (Python, PyAutoGUI)
  • Built a terminal-run card game called Declare – Python
Principal Data Scientist / Manager Data Scientist
Money View (Whizdm Innovation Pvt Ltd) | Apr 2021 - Jul 2024
Python
SQL
Machine learning
Development
Team Building
Data Science
NLP
Google Cloud Platform
Management
Recruitment
Training
GPT
AI
AWS
LLM

At MoneyView, I have been instrumental in pioneering transformative initiatives in ML model engineering for credit underwriting and unstructured data parsing, leveraging advanced techniques such as RegEx, NLP, and GPT. My strategic leadership in building cutting-edge ensemble ML models and establishing model development pipelines has significantly enhanced our underwriting processes, resulting in substantial risk reduction and operational efficiency gains. My hands-on experience in developing and deploying multiple ML models across various data sources has equipped me with a robust analytical prowess and a number-driven approach, essential for identifying problems and crafting effective solutions.

My expertise extends beyond technical proficiency to encompass project management, team leadership, and collaboration with business teams. I have successfully managed cross-functional teams, mentored junior colleagues, and conducted training sessions on Python, Automation, and Machine Learning. My track record of receiving consecutive promotions and accolades, including the 'Delivery Champion' award at MoneyView, reflects my dedication to excellence and results-driven approach.

You can read about a few of my top projects in my resume. If you want to learn more about how I did these projects and many more, reach out to me.

Projects

Web Application for Customer Loyalty Program for manufacturer and distributor of PVC Plumbing Systems
HTML/CSS
Python
Java
SQL
Node.js
Flask
PostgreSQL
Heroku
Amazon S3
Pandas
Leaflet
uWSGI
JavaScript
Google Sheets
Problem - Engage with customer and create pull for product. Solution - Designed a Customer Loyalty Program and developed web and Android app, rated 4.7 on play store Impact - Over 3500 daily active users, increased sales by >20% YOY, new dealer appointment increased >35% YOY.
Web App to Manage Internal Sales Maps
2020
HTML/CSS
Python
jQuery
Flask
Google Maps
PostgreSQL
Heroku
Bootstrap
JavaScript
Cartography
Geographic Information System
Problem - We (Client) offer our customers an exclusive area where we agree not to sell the licensed design to another escape room company. We describe this exclusive area as a 2.5-hour drive radius. We currently use a couple of Google Map layers to manage all our sales and research availability to new customers. With 12 products (and 12 map layers) about 600+ previous sales, this can be time-consuming. Solution - A Python (Flask) web-application hosted on Heroku and DB on PostgreSQL was built. First page will ask for a location to be entered (city / full address / ZIP). Upon submitting the address, second page will load, which will give a list of products that can be licensed for the given address using 2.5-hour drive-time boundary condition. The app will have added functionality of map. Your query will return a map with drive-time for all* data points from the queried address. Impact - Cut time to respond to customer query from over 4 hours to 2 clicks

Education

Self Studies
Master's degreeComputer Science and Machine Learning
Apr 2019 - Mar 2021
Indian Institute of Management Indore (IIM Indore)
MBABusiness Management
Jun 2015 - Mar 2017

Certifications & Awards

CS50 and CS50 Web Developer
Harvard University | Nov 2020
Computer Science MIT 6.0001, MIT 6.0002, MIT 6.006 and MIT 6.046J
MIT OpenCourseWare | Sep 2020