Designed a YOLOv8-based model to detect trash disposal actions in real-time. Deployed model with TensorRT on Jetson Nano, achieving 22 FPS inference speed for smart city waste monitoring applications.
YOLOv8TensorRTJetson Nano
Trained an EffNetB2 + scratch ViT model on 20% of the Food101 dataset (101 classes). Designed scalable training with regularization and tuned LR schedules, achieving 97.22% accuracy via ViT.
PyTorchViTEffNetB2
📄 ViT Research Replication
Recreated the Vision Transformer architecture from scratch based on the original research paper. Implemented patch embedding, multi-head self-attention, and positional encoding to study scaling laws and convergence.
PyTorchAttention Models
Used MediaPipe for real-time skeleton keypoint extraction during batting motions. Developed form evaluation algorithms achieving 92% consistency with expert scoring. Built a Streamlit dashboard for playback-based biomechanical feedback.
MediaPipeStreamlitOpenCV
🛰️ Satellite Forecasting Pipeline
Processed 5TB+ SEVIRI satellite imagery for temporal-spatial forecasting tasks. Used Xarray + Dask for parallel data handling, improving I/O performance by 40%. Containerized data workflows using Docker for reproducibility.
XarrayDockerDask