OCR Plate (C++ / ONNX Runtime / OpenCV)

Dự án nhận diện phương tiện, phân loại hãng xe và OCR biển số bằng C++ với ONNX Runtime + OpenCV.

Hỗ trợ input: --image, --folder, --video.

1) Pipeline

Luồng xử lý mỗi lần infer (1 ảnh / 1 frame infer):

Vehicle detection (YOLO) trên ảnh gốc.
Crop vehicle ROI từ bbox vehicle.
Chạy song song 2 nhánh:
- Brand branch: batch classify hãng xe cho car.
- Plate branch:
  - plate detect theo từng vehicle (multi-thread).
  - map bbox plate về ảnh gốc.
  - crop + preprocess plate (multi-thread).
  - OCR (batch; nếu model OCR fix batch=1 thì fallback multi-thread theo từng ảnh).
Merge kết quả vehicle + plate + OCR text/conf.
Draw overlay và xuất ảnh/video.

flowchart TD
    A[Input image/folder/video] --> B[Vehicle Detection YOLO]
    B --> C[Crop vehicle ROIs]
    C --> D[Brand branch - batch]
    C --> E[Plate branch - MT detect/map/preprocess + OCR]
    D --> F[Merge]
    E --> F
    F --> G[Draw overlay + Save/Show]

ONNX Runtime được vendor sẵn trong third_party/onnxruntime.

2) Tracking (video)

Trong mode video, hệ thống dùng tracking-by-detection để gán track_id ổn định cho vehicle theo thời gian.

Hiện tại tracker được implement theo hướng ByteTrack-like:

Predict bbox mỗi frame (để chạy tốt khi chỉ infer mỗi N frame).
Data association 2-stage (high-score rồi low-score) + assignment toàn cục (giảm id switch).

Ngoài track_id, hệ thống duy trì map nghiệp vụ: track_id -> {brand, plate} và chỉ “chốt” khi đủ ngưỡng:

brand: brand_conf > kTrackBrandAcceptConf
plate: plate_det_conf > kTrackPlateDetAcceptConf và ocr_conf_avg > kTrackPlateOcrAcceptConf
brand/plate đều có budget số lần thử; quá ngưỡng thì dừng predict nhánh tương ứng (kTrackBrandMaxAttempts, kTrackPlateMaxOcrAttempts)

Nếu một track đã đủ brand/plate hợp lệ, các frame sau sẽ bỏ qua phần nhận diện tương ứng để giảm compute.

Ở mode video, sau khi chọn vùng làm việc, hệ thống cho phép chọn thêm một đường ranh (2 điểm). Chỉ các track đã đi qua đường này mới bắt đầu chạy predict brand và OCR plate.

3) Yêu cầu

Linux (khuyến nghị Ubuntu 22.04)
CMake + compiler hỗ trợ C++23
OpenCV dev

setup.sh cài sẵn:

build-essential
cmake
pkg-config
libopencv-dev

4) Quick start

./setup.sh
./build.sh
cd build
../run.sh --image ../img/1.jpeg

Nếu script chưa executable:

chmod +x setup.sh build.sh run.sh

5) Build

Mặc định build 2 target: main và benchmark.

./build.sh

Tùy chọn:

--build-type <type> (mặc định Release)
--jobs <n>
--clean
--target <name> (lặp được)

Ví dụ:

./build.sh --build-type Debug --jobs 8
./build.sh --clean --target benchmark

Output:

out/build/bin/main
out/build/bin/benchmark

Ghi chú: nếu bạn từng build trên môi trường khác (ví dụ cache compiler trỏ tới *.exe), hãy chạy ./build.sh --clean để tạo cache mới.

6) Chạy

Main:

../run.sh --image ../img/1.jpeg
../run.sh --folder ../img
../run.sh --video ../video.mp4 --show --nosave

Benchmark:

../run.sh --benchmark --image ../img/1.png --warmup 5 --runs 20

Video note:

infer theo chu kỳ app_config::kVideoInferEveryNFrames (mặc định 5), các frame giữa chu kỳ tái dùng overlay gần nhất.

7) Cấu hình

Thiết lập trong include/ocrplate/core/app_config.h:

Model paths: kVehicleModelPath, kPlateModelPath, kBrandCarModelPath, kOcrModelPath
Thresholds: kVehicleConfThresh, kPlateConfThresh, kNmsIouThresh, kOcrConfAvgThresh
Video/tracking: kVideoInferEveryNFrames, các biến kTracker*, các biến kTrack*AcceptConf
Budget retry: kTrackBrandMaxAttempts, kTrackPlateMaxOcrAttempts

8) Docker

docker build -t ocr-plate .
docker run --rm -v "$PWD/img:/app/img" ocr-plate --image /app/img/1.jpeg
docker run --rm -v "$PWD/img:/app/img" --entrypoint /app/benchmark ocr-plate --image /app/img/1.jpeg --warmup 3 --runs 10

9) Cấu trúc project (production-like)

Public headers nằm dưới include/ocrplate/* (module hoá theo folder), source nằm dưới src/*:

src/app/*: entrypoints + CLI parsing
src/pipeline/*: pipeline infer/draw
src/services/*: model runners (YOLO/OCR/Brand/ONNX)
src/utils/*: util chung (parallel, decode, report, preprocess)
src/tracking/*: tracking + identity store

Các module được build thành library targets trong CMakeLists.txt:

ocrplate_utils, ocrplate_tracking, ocrplate_services, ocrplate_pipeline

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OCR Plate (C++ / ONNX Runtime / OpenCV)

1) Pipeline

2) Tracking (video)

3) Yêu cầu

4) Quick start

5) Build

6) Chạy

7) Cấu hình

8) Docker

9) Cấu trúc project (production-like)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 79 Commits
docs		docs
img		img
include/ocrplate		include/ocrplate
model		model
src		src
third_party/onnxruntime		third_party/onnxruntime
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
CMakePresets.json		CMakePresets.json
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
build.sh		build.sh
run.sh		run.sh
setup.sh		setup.sh

Folders and files

Latest commit

History

Repository files navigation

OCR Plate (C++ / ONNX Runtime / OpenCV)

1) Pipeline

2) Tracking (video)

3) Yêu cầu

4) Quick start

5) Build

6) Chạy

7) Cấu hình

8) Docker

9) Cấu trúc project (production-like)

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages