OpenCV 5 Is Here: The Biggest Leap in Years for Computer Vision(opencv.org)
841 points by ternaus 5 days ago | 147 comments
tl;dr: OpenCV 5 ships a rewritten graph-based DNN engine that boosts ONNX operator coverage from ~22% to over 80%, adds dynamic shapes, attention/MatMul fusion, and built-in tokenizer + KV-cache support for running LLMs and VLMs (Qwen, Gemma, GPT) directly via the Net API. Benchmarks show it matching or beating ONNX Runtime on CPU for models like YOLOv8, DINOv2, and OWLv2, while the old engine remains available behind the same API for backward compatibility. The release also modernizes the core with FP16/BF16 types, 0D/1D Mat support, a redesigned hardware acceleration layer (Intel IPP, Arm KleidiCV, Qualcomm FastCV, RISC-V), and split 3D/calibration modules; pip release is slated for June 2026.
HN Discussion:
  • OpenCV excels at basic image/video loading regardless of its CV features
  • Confirmed real-world performance improvements in OpenCV 5 validate the release claims
  • Investing in their own ONNX engine is misguided versus wrapping existing runtimes
  • ~OpenCV still has performance and shape-flexibility limitations compared to alternatives
  • Practical questions and adjacent use cases (Pyodide, production deployment, mobile)