YOLOv8-OBB Real-Time Object Detection Service
End-to-end Computer Vision pipeline: a 2-class oriented bounding-box (OBB) detector for bottles vs cans, fine-tuned on Colab T4 GPU and served externally through a FastAPI REST API.
- Fixed dataset class imbalance — discovered a 93% / 7% bottle/can split and implemented a stratified re-split (80/10/10) from scratch
- Fine-tuned YOLOv8-OBB — AdamW + cosine LR + mosaic/mixup augmentation → confidence 0.82–0.94 in real-world tests
- Designed the FastAPI server — /predict (JSON) + /predict/visualize (PNG) endpoints with auto-generated Swagger docs
- Real-time post-processing — temporal smoothing (STABILITY_FRAMES=3) + area filter to suppress false positives from webcam streams