技術摘要(英)
This technology is applied in AI applications that require both object detection and semantic segmentation tasks simultaneously. It is capable of detecting pedestrians, cyclists, two-wheeled vehicles, four-wheeled vehicles, lane markings, and drivable areas. The core of the solution utilizes a unified architecture (Backbone with multi-task branches) that enables both object detection and semantic segmentation functionalities. The backbone architecture can be adapted to various platform computational capabilities, offering different model complexities such as Yolov8, Yolov9, Yolov10, and Yolov11, with corresponding variants including n, s, m, l, and x models based on the desired complexity.