detrflow – End-to-End RT-DETR Object Detection Pipeline

Overview

detrflow is a fully end-to-end object detection pipeline built on RT-DETR (Real-Time Detection Transformer), covering everything from data loading and training to evaluation, serving, and deployment.

The project demonstrates a complete applied ML workflow — from pretrained baseline evaluation to a live inference API — and is accompanied by a peer-reviewed arXiv publication on knowledge distillation for RT-DETR.

Key Results

MetricValue
COCO val2017 AP (baseline)47.9
BackboneRT-DETR (ResNet-50)
Evaluation datasetCOCO val2017 (5,000 images)

Features

  • COCO val2017 Evaluation — Full benchmark against the standard detection suite with reproducible results
  • FastAPI Inference API — REST endpoint for single-image and batch inference
  • HuggingFace Space Deployment — Live demo hosted and publicly accessible
  • Benchmark Scripts — Latency and throughput profiling utilities
  • Model Card — Documented limitations, intended use, and evaluation results

Tech Stack

PyTorch · RT-DETR · HuggingFace Transformers · FastAPI · COCO API · Docker

This project builds on the RT-DETR architecture and serves as the empirical foundation for the accompanying knowledge distillation study. The distillation experiments explore teacher-student training dynamics on top of the pretrained RT-DETR backbone, with ablations reported in the paper.