Skip to content
View pathik1511's full-sized avatar

Block or report pathik1511

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
pathik1511/README.md

Portfolio LinkedIn Kaggle Email Profile Views


👨‍💻 About Me

Data Scientist with 5+ years of experience driving product and business decisions through experimentation, causal inference, and statistical modeling at scale.

  • 🏢 Currently Data Scientist @ Walmart Connect — building Brand WAMM models, NLP pipelines & causal inference frameworks
  • 🎓 M.S. Computer Science — California State University, Long Beach (2021–2023)
  • 🤖 Deep expertise in NLP, Transformer models, MLOps and multi-cloud AI deployments
  • 📊 Track record: 60% ↓ tagging effort · 40% ↓ runtime · 25% ↑ marketing effectiveness
  • 🏆 Active Kaggle competitor — sharing notebooks, datasets & competition solutions
  • 📍 Farragut, Tennessee · Open to Data Scientist & ML Engineer roles

🛠 Tech Stack

ML & AI Python PyTorch TensorFlow Scikit-learn R

NLP & GenAI HuggingFace LangChain spaCy OpenAI

Data & Cloud BigQuery Airflow AWS GCP Azure SQL

MLOps & DevOps Docker Kubernetes MLflow Git

BI & Visualisation Looker Power BI Tableau


📊 GitHub Stats

  

🏆 Kaggle

I'm actively competing in ML challenges and sharing reproducible work with the Kaggle community:

  • 🏁 Competitions — End-to-end ML/DL solutions with leaderboard results
  • 📓 Notebooks — EDA walkthroughs, feature engineering guides & model experiments
  • 📦 Datasets — Curated open datasets published for the community
  • 💬 Discussions — Tips, insights & collaboration with fellow Kagglers

⭐ Visit kaggle.com/pathik1511 for my latest notebooks and competition results.


🚀 Featured Projects

Project Description Stack
☁️ ATS Resume Screener AI-powered Applicant Tracking System using Google Gemini for resume scoring Python · Gemini · NLP
🧠 Kidney Disease Classification End-to-end deep learning pipeline with MLflow experiment tracking PyTorch · MLflow · DVC
🐔 Chicken Disease Detection End-to-end ML project with CI/CD pipeline and cloud deployment Python · DVC · Docker
☕ Coffee Sales Forecasting Sales prediction & inventory optimisation with ML models Python · Scikit-learn · Pandas
📝 Text-to-SQL Natural language to SQL query generation using LLMs Python · LLM · SQL
🌿 Cassava Leaf Disease Computer vision model for plant disease classification TensorFlow · CNN · Kaggle

📈 Impact Highlights

60% reduction in manual tagging effort   → spaCy & Hugging Face pipelines (Walmart Connect)
40% runtime reduction                    → Production WAMM framework in Python/SQL
25% boost in marketing effectiveness     → Sentiment analysis, Naïve Bayes (Syntrons)
15% creative effectiveness improvement   → NLP on customer reviews (Walmart Connect)
35% fraud detection improvement          → Deep learning fraud detection (Syntrons)
50% data processing speed increase       → PySpark big data optimisation

📬 Let's Connect

Portfolio LinkedIn Kaggle Email

Open to Data Scientist & ML Engineer roles

Pinned Loading

  1. -Coffee-Bean-Store-Sales-Prediction-and-Inventory-Optimization -Coffee-Bean-Store-Sales-Prediction-and-Inventory-Optimization Public

    Sales prediction and inventory optimisation for a coffee store using ML forecasting models

    Jupyter Notebook 1

  2. commonlit-evaluate-student-summaries commonlit-evaluate-student-summaries Public

    NLP model to evaluate quality of student-written summaries — Kaggle competition

    Jupyter Notebook

  3. Farmfood Farmfood Public

    HTML

  4. OSICS OSICS Public

    Online Student Information and Course System — academic project

    Python