YOLO Object Detection with Tonic and Axum

📝 Overview

This project implements a real-time object detection pipeline using a YOLO v8 model. A Rust-based gRPC service (using Tonic and Ort) handles inference, while an Axum-based middleware streams video from a webcam, collects predictions, and serves them via WebSocket. Additionally, a simple HTML page is provided for live visualization of the detections, and a predict_image POST endpoint allows direct inference on JPEG images.

🛠️ Tech Stack

Tonic, Ort for running YOLO model inference via gRPC using ONNX runtime
Axum, Tonic for handling video streaming and client Http requests
OpenCV for processing video frames in the middleware
HTML + JavaScript for real-time visualization

🏗️ Architecture

️📦 Installation

To run the project, simply execute the following command at the root directory:

make all

This will download the necessary images, runs the service and automatically open a webpage for visualization.

⚡ Running with CUDA and TensorRT support

If you have set up CUDA correctly, you can run the project with GPU acceleration by executing:

make all-cuda

⏱️ Performances

The system is designed for real-time processing using parallel inference sessions to maximize throughput.

TensorRT: The time taken to generate predictions for a single frame is typically around 42ms (median) and 90ms (99th percentile). This means the detections you see correspond to the video frame from approximately 42/90ms prior.
CPU: The time taken to generate predictions for a single frame is typically around 250ms (median) and 450ms (99th percentile). This means the detections you see correspond to the video frame from approximately 250/450ms prior.

This, of course, depends on the hardware you run and the results obtained here are on a RTX2060 with 8gb of VRAM for the TensorRT part and an Intel® Core™ i5-9600K × 6 for the CPU part.

🐧 OS Compatibility

Currently, the project only works on Linux as it relies on mounting the /dev/video1 device. If you are using a different video input, you may need to modify the source code accordingly.

📄 License

This project utilizes the YOLOv8m model, which is distributed under the AGPL-3.0 license. Please refer to the Ultralytics website for the full license details.

This project is intended for educational and demonstration purposes. Its licensing is subject to the licensing terms of the YOLOv8m model.

Disclaimer: Users are responsible for ensuring their compliance with the YOLOv8m model's license.

Name		Name	Last commit message	Last commit date
Latest commit History 89 Commits
.github/workflows		.github/workflows
docs		docs
grafana		grafana
prometheus		prometheus
scripts		scripts
webcam_capture		webcam_capture
yolo_prediction		yolo_prediction
yolo_proto		yolo_proto
.dockerignore		.dockerignore
.env.template		.env.template
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
Makefile		Makefile
README.md		README.md
base.yaml		base.yaml
compose.cuda.local.yaml		compose.cuda.local.yaml
compose.cuda.yaml		compose.cuda.yaml
compose.local.yaml		compose.local.yaml
compose.yaml		compose.yaml
index.html		index.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

YOLO Object Detection with Tonic and Axum

📝 Overview

🛠️ Tech Stack

🏗️ Architecture

️📦 Installation

⚡ Running with CUDA and TensorRT support

⏱️ Performances

🐧 OS Compatibility

📄 License

About

Uh oh!

Packages

Uh oh!

Uh oh!

Languages

jordandelbar/yolo-tonic

Folders and files

Latest commit

History

Repository files navigation

YOLO Object Detection with Tonic and Axum

📝 Overview

🛠️ Tech Stack

🏗️ Architecture

️📦 Installation

⚡ Running with CUDA and TensorRT support

⏱️ Performances

🐧 OS Compatibility

📄 License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Packages 0

Uh oh!

Uh oh!

Languages

Packages