Data Scientist
Specializing in Machine Learning, Deep Learning, and serverless MLOps with a strong foundation in financial analytics. Master's candidate at UBC with a unique dual MBA-Tech background.
I'm a Data Scientist and Machine Learning Engineer specializing in building, validating, and deploying machine learning models, serverless MLOps pipelines, and financial optimization systems.
My background is uniquely positioned at the intersection of computer engineering and financial analytics. I hold a dual B.Tech in Computer Engineering & MBA in Finance from NMIMS, and am completing my Master of Data Science at the University of British Columbia (UBC).
I build production-grade AI systems, MLOps solutions, and quantitative tools—like my Paper Trader AI platform running 3 live strategies, and my automated property pre-fill pipeline built with foundation models at Square One Insurance.
Automated prediction pipeline. Developed for Square One Insurance to pre-fill property attributes on insurance quote forms, reducing customer abandonment. Masked building footprints using Microsoft Building Footprints with fallback custom U-Net models, extracting features using Clay/DINOv2 mapping to an XGBoost classification head.
Performance: Achieved 96.83% precision at 95% confidence threshold with 35.46% spatial coverage. Runs 26x faster than Swin/Clay backbones.
# Running self-supervised feature extraction
$ python predict.py --input satellite_img.tif
Feature Extraction: 26x Speedup vs Clay
Building Footprints:MS Footprints + U-Net
Calibration: 96.83% Precision (35.46% Coverage)
ACTUARIAL THRESHOLD MET: Pre-fill payloads generated.
I built a trading system that runs itself. Three strategies compete head-to-head on the S&P 500: momentum, XGBoost, and LSTM. Backtested on 9 years of data, now running live with automated daily trades.
Live results (Oct 2025 – May 2026): Momentum leads at +18.8% Alpha over SPY, 1.54 Sharpe. Paper trading only.
# Paper Trader AI - Live Performance
$ python main.py --status
System: LIVE since Oct 2025
Leader: Momentum +18.8% Alpha (vs SPY +3.94%)
Sharpe: 1.54 | Excess: +14.86%
All 3 models running autonomously.
View live dashboard for real-time updates →
High-performance product search assistant. Built an end-to-end Retrieval-Augmented Generation system using hybrid lexical-semantic retrieval (BM25 + FAISS) and Reciprocal Rank Fusion. Highly optimized data storage and indexing structures to minimize server footprint.
Performance: Indexed 94K+ products, reducing startup latency by 96% and lowering peak RAM from 6 GB to <2 GB.
# Optimizing hybrid search index
$ python index.py --products 94k
Retrieval: BM25 + FAISS Hybrid
RAM Usage: < 1.84 GB (was 6.1 GB)
Startup Time: -96% Latency Cut
RRF ranking fusion initialized for 94K+ products.
I'm actively seeking opportunities in Data Science and AI/ML Engineering. Whether you have a role in mind, a project to discuss, or just want to connect, I'd love to hear from you.