Data Science Portfolio

Explore production-grade machine learning projects with proven business impact. From fraud detection to medical imaging, each project demonstrates real-world value.

17 Production Projects
7+ Industry Domains
End-to-End ML Pipelines
RSF
WAPE
13.5%

Retail Sales Forecasting

LightGBM achieving 13.5% WAPE for 16-day forecasting across 54 stores with 56 engineered features

LightGBMXGBoostStreamlitPlotly+2 more
CCF
Net Savings
$131K

Credit Card Fraud Detection

Real-time fraud detection using XGBoost and SHAP achieving 97% ROC-AUC and $131K savings per 100K transactions

XGBoostSHAPFastAPIStreamlit+4 more
CXC
ROC-AUC
92.1%

COVID-19 X-ray Classification

Deep learning for medical imaging with 92% ROC-AUC and Grad-CAM explainability for clinical decision support

PyTorchResNet50EfficientNetDenseNet+4 more
ECC
ROC-AUC
98.1%

E-Commerce Customer Churn Prediction

Cost-optimized machine learning with 98% ROC-AUC and 60% cost reduction through threshold optimization

CatBoostXGBoostLightGBMSMOTE+4 more
FSA
Accuracy
81.2%

Financial Sentiment Analysis

NLP platform with BERT-MPNet achieving 81% accuracy on 4,846 expert-annotated financial sentences

BERTsentence-transformersXGBoostRandom Forest+5 more
MBA
AOV Increase
20-35%

Market Basket Analytics

Association rule mining and customer segmentation with 105K+ rules generating 20-35% increase in average order value

FP-GrowthAprioriK-MeansRFM Analysis+3 more
SMI
Prediction Accuracy
98.33%

Stock Market Intelligence Platform

LSTM deep learning and technical analysis platform with 98.33% prediction accuracy across 7,195 US stocks

LSTMTensorFlowKerasTA-Lib+3 more
RVA
Time Reduction
90%

Retail Vision Analytics

Computer vision for fashion retail with 92% accuracy achieving 90% reduction in manual data entry time

PyTorchCNNFashion MNISTStreamlit+2 more
JCP
Cost Reduction
20%

Job Change Prediction (HR Analytics)

Cost-optimized HR analytics with 79.7% ROC-AUC and 20% cost reduction through business-aligned ML

LightGBMCatBoostXGBoostSHAP+3 more
ENI
F1-Score
92%

Enterprise NER Intelligence (CoNLL-2003)

Named Entity Recognition with BERT achieving 92% F1-score for financial, legal, and healthcare applications

BERTTransformersBi-LSTM-CRFTensorFlow+2 more
S5I
Forecast Horizon
90 days

S&P 500 Intelligent Forecasting

Time series forecasting and portfolio optimization with Prophet and Modern Portfolio Theory for S&P 500

ProphetPyPortfolioOptModern Portfolio TheoryStreamlit+2 more
TSA
ROC-AUC
88.3%

Twitter Sentiment Analysis

NLP platform with 88.3% ROC-AUC analyzing 400K tweets for brand monitoring and market intelligence

Logistic RegressionSVMNaive BayesRandom Forest+4 more
HPP
R² Score
90.4%

House Price Prediction

CatBoost regression achieving 90.4% R² accuracy with 223 engineered features and SHAP explainability

CatBoostSHAPStreamlitPlotly+3 more
CIP
R² Score
99.78%

Car Insurance Premium Analytics

Stacking ensemble achieving 99.78% R² accuracy with real-time premium prediction and explainable AI

XGBoostLightGBMCatBoostScikit-learn+4 more
WCA
Adjusted Rand Index
0.898

Wine Clustering Analysis

GMM vs K-Means comparison achieving 0.898 ARI with automatic cluster selection and uncertainty quantification

Scikit-learnGMMK-MeansDBSCAN+5 more
NBS
ROC-AUC
91.6%

Naive Bayes Spam Detection

MATLAB-based email classifier achieving 91.6% ROC-AUC with hyperparameter optimization and bootstrap validation

MATLABNaive BayesLogistic RegressionUCI Spambase+2 more
CSA
mAP
37.0

COCO Smart Analytics

Computer vision platform with Faster R-CNN detecting 80+ object categories for retail, smart cities, and security

PyTorchFaster R-CNNResNet-50-FPNTorchvision+3 more