Healthcare Image Processing
An end-to-end pipeline that turns Healthcare video into deduplicated frames, object detections, and standard annotations—ready for downstream ML—with production-minded metrics and container packaging.
Overview
The pipeline extracts frames from video, removes near-duplicates with perceptual hashing, runs YOLOv8 detection, emits COCO-format annotations, and publishes operational metrics. It is designed for repeatable runs, clear reporting, and integration with common monitoring stacks.
Capabilities
Frame intelligence
Configurable perceptual-hash deduplication cuts redundant storage and compute while preserving clinically distinct frames.
Detection & labels
YOLOv8 inference with tunable confidence; outputs map cleanly into COCO-style annotations.json for training and evaluation tooling.
Observability
Prometheus counters and histograms for throughput, detections, and timing—suitable for dashboards and alerting.
Operational packaging
Dockerfile, Makefile targets, YAML configuration, and a markdown pipeline report summarising stats, timing, and class distribution.
Visual summary
Technology stack
- Python 3.9+
- YOLOv8
- OpenCV / FFmpeg
- Perceptual hashing
- Prometheus
- Docker
- pytest
Need a similar vision or video ML pipeline?
Get in touch