docs: update readme

Signed-off-by: badai-nguyen <[email protected]>
autowarefoundation · Nov 20, 2023 · 8b787e3 · 8b787e3
1 parent 9148eb4
commit 8b787e3
Showing 1 changed file with 60 additions and 21 deletions.
diff --git a/perception/tensorrt_yolox/README.md b/perception/tensorrt_yolox/README.md
@@ -2,13 +2,13 @@
 
 ## Purpose
 
-This package detects target objects e.g., cars, trucks, bicycles, and pedestrians on a image based on [YOLOX](https://github.com/Megvii-BaseDetection/YOLOX) model.
+This package detects target objects e.g., cars, trucks, bicycles, and pedestrians with semantic segmentation header including vehicle such as cars, trucks, buses and pedestrian, building, vegetation, road, sidewalk on a image based on [YOLOX](https://github.com/Megvii-BaseDetection/YOLOX) model.
 
 ## Inner-workings / Algorithms
 
 ### Cite
 
-<!-- cspell: ignore Zheng, Songtao, Feng, Zeming, Jian -->
+<!-- cspell: ignore Zheng, Songtao, Feng, Zeming, Jian, semseg -->
 
 Zheng Ge, Songtao Liu, Feng Wang, Zeming Li, Jian Sun, "YOLOX: Exceeding YOLO Series in 2021", arXiv preprint arXiv:2107.08430, 2021 [[ref](https://arxiv.org/abs/2107.08430)]
 
@@ -22,10 +22,12 @@ Zheng Ge, Songtao Liu, Feng Wang, Zeming Li, Jian Sun, "YOLOX: Exceeding YOLO Se
 
 ### Output
 
-| Name          | Type                                               | Description                                        |
-| ------------- | -------------------------------------------------- | -------------------------------------------------- |
-| `out/objects` | `tier4_perception_msgs/DetectedObjectsWithFeature` | The detected objects with 2D bounding boxes        |
-| `out/image`   | `sensor_msgs/Image`                                | The image with 2D bounding boxes for visualization |
+| Name             | Type                                               | Description                                                         |
+| ---------------- | -------------------------------------------------- | ------------------------------------------------------------------- |
+| `out/objects`    | `tier4_perception_msgs/DetectedObjectsWithFeature` | The detected objects with 2D bounding boxes                         |
+| `out/image`      | `sensor_msgs/Image`                                | The image with 2D bounding boxes for visualization                  |
+| `out/mask`       | `sensor_msgs/Image`                                | The semantic segmentation mask                                      |
+| `out/color_mask` | `sensor_msgs/Image`                                | The colorized image of semantic segmentation mask for visualization |
 
 ## Parameters
 
@@ -40,20 +42,32 @@ Zheng Ge, Songtao Liu, Feng Wang, Zeming Li, Jian Sun, "YOLOX: Exceeding YOLO Se
 
 ### Node Parameters
 
-| Name                          | Type   | Default Value | Description                                                                                                                                                                                                                              |
-| ----------------------------- | ------ | ------------- | ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
-| `model_path`                  | string | ""            | The onnx file name for yolox model                                                                                                                                                                                                       |
-| `label_path`                  | string | ""            | The label file with label names for detected objects written on it                                                                                                                                                                       |
-| `precision`                   | string | "fp16"        | The inference mode: "fp32", "fp16", "int8"                                                                                                                                                                                               |
-| `build_only`                  | bool   | false         | shutdown node after TensorRT engine file is built                                                                                                                                                                                        |
-| `calibration_algorithm`       | string | "MinMax"      | Calibration algorithm to be used for quantization when precision==int8. Valid value is one of: Entropy",("Legacy" \| "Percentile"), "MinMax"]                                                                                            |
-| `dla_core_id`                 | int    | -1            | If positive ID value is specified, the node assign inference task to the DLA core                                                                                                                                                        |
-| `quantize_first_layer`        | bool   | false         | If true, set the operating precision for the first (input) layer to be fp16. This option is valid only when precision==int8                                                                                                              |
-| `quantize_last_layer`         | bool   | false         | If true, set the operating precision for the last (output) layer to be fp16. This option is valid only when precision==int8                                                                                                              |
-| `profile_per_layer`           | bool   | false         | If true, profiler function will be enabled. Since the profile function may affect execution speed, it is recommended to set this flag true only for development purpose.                                                                 |
-| `clip_value`                  | double | 0.0           | If positive value is specified, the value of each layer output will be clipped between [0.0, clip_value]. This option is valid only when precision==int8 and used to manually specify the dynamic range instead of using any calibration |
-| `preprocess_on_gpu`           | bool   | true          | If true, pre-processing is performed on GPU                                                                                                                                                                                              |
-| `calibration_image_list_path` | string | ""            | Path to a file which contains path to images. Those images will be used for int8 quantization.                                                                                                                                           |
+| Name                                   | Type   | Default Value | Description                                                                                                                                                                                                                              |
+| -------------------------------------- | ------ | ------------- | ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
+| `model_path`                           | string | ""            | The onnx file name for yolox model                                                                                                                                                                                                       |
+| `label_path`                           | string | ""            | The label file with label names for detected objects written on it                                                                                                                                                                       |
+| `precision`                            | string | "fp16"        | The inference mode: "fp32", "fp16", "int8"                                                                                                                                                                                               |
+| `build_only`                           | bool   | false         | shutdown node after TensorRT engine file is built                                                                                                                                                                                        |
+| `calibration_algorithm`                | string | "MinMax"      | Calibration algorithm to be used for quantization when precision==int8. Valid value is one of: Entropy",("Legacy" \| "Percentile"), "MinMax"]                                                                                            |
+| `dla_core_id`                          | int    | -1            | If positive ID value is specified, the node assign inference task to the DLA core                                                                                                                                                        |
+| `quantize_first_layer`                 | bool   | false         | If true, set the operating precision for the first (input) layer to be fp16. This option is valid only when precision==int8                                                                                                              |
+| `quantize_last_layer`                  | bool   | false         | If true, set the operating precision for the last (output) layer to be fp16. This option is valid only when precision==int8                                                                                                              |
+| `profile_per_layer`                    | bool   | false         | If true, profiler function will be enabled. Since the profile function may affect execution speed, it is recommended to set this flag true only for development purpose.                                                                 |
+| `clip_value`                           | double | 0.0           | If positive value is specified, the value of each layer output will be clipped between [0.0, clip_value]. This option is valid only when precision==int8 and used to manually specify the dynamic range instead of using any calibration |
+| `preprocess_on_gpu`                    | bool   | true          | If true, pre-processing is performed on GPU                                                                                                                                                                                              |
+| `calibration_image_list_path`          | string | ""            | Path to a file which contains path to images. Those images will be used for int8 quantization.                                                                                                                                           |
+| `yolox_s_plus_opt_param_path`          | string | ""            | Path to parameter file                                                                                                                                                                                                                   |
+| `is_publish_color_mask`                | bool   | false         | If true, publish color mask for result visualization                                                                                                                                                                                     |
+| `is_roi_overlap_segment`               | bool   | true          | If true, overlay detected object roi onto semantic segmentation as roi higher priority                                                                                                                                                   |
+| `overlap_roi_score_threshold`          | float  | 0.3           | minimum existence_probability of detected roi considered to replace segmentation                                                                                                                                                         |
+| `roi_overlay_segment_label.UNKNOWN`    | bool   | true          | If true, unknown objects roi will be overlay onto sematic segmentation mask.                                                                                                                                                             |
+| `roi_overlay_segment_label.CAR`        | bool   | true          | If true, car objects roi will be overlay onto sematic segmentation mask.                                                                                                                                                                 |
+| `roi_overlay_segment_label.TRUCK`      | bool   | true          | If true, truck objects roi will be overlay onto sematic segmentation mask.                                                                                                                                                               |
+| `roi_overlay_segment_label.BUS`        | bool   | true          | If true, bus objects roi will be overlay onto sematic segmentation mask.                                                                                                                                                                 |
+| `roi_overlay_segment_label.TRAILER`    | bool   | true          | If true, trailer objects roi will be overlay onto sematic segmentation mask.                                                                                                                                                             |
+| `roi_overlay_segment_label.MOTORCYCLE` | bool   | true          | If true, motorcycle objects roi will be overlay onto sematic segmentation mask.                                                                                                                                                          |
+| `roi_overlay_segment_label.BICYCLE`    | bool   | true          | If true, bicycle objects roi will be overlay onto sematic segmentation mask.                                                                                                                                                             |
+| `roi_overlay_segment_label.PEDESTRIAN` | bool   | true          | If true, pedestrian objects roi will be overlay onto sematic segmentation mask.                                                                                                                                                          |
 
 ## Assumptions / Known limits
 
@@ -69,6 +83,31 @@ The label contained in detected 2D bounding boxes (i.e., `out/objects`) will be
 If other labels (case insensitive) are contained in the file specified via the `label_file` parameter,
 those are labeled as `UNKNOWN`, while detected rectangles are drawn in the visualization result (`out/image`).
 
+The semantic segmentation mask are gray image whose pixel is index of one of the followings:
+
+| index | semantic name |
+| ----- | ------------- |
+| 0     | road          |
+| 1     | sidewalk      |
+| 2     | building      |
+| 3     | wall          |
+| 4     | fence         |
+| 5     | pole          |
+| 6     | traffic_light |
+| 7     | traffic_sign  |
+| 8     | vegetation    |
+| 9     | terrain       |
+| 10    | sky           |
+| 11    | person        |
+| 12    | ride          |
+| 13    | car           |
+| 14    | truck         |
+| 15    | bus           |
+| 16    | train         |
+| 17    | motorcycle    |
+| 18    | bicycle       |
+| 19    | others        |
+
 ## Onnx model
 
 A sample model (named `yolox-tiny.onnx`) is downloaded by ansible script on env preparation stage, if not, please, follow [Manual downloading of artifacts](https://github.com/autowarefoundation/autoware/tree/main/ansible/roles/artifacts).
@@ -146,7 +185,7 @@ Please refer [the official document](https://github.com/Megvii-BaseDetection/YOL
 
 ## Label file
 
-A sample label file (named `label.txt`)is also downloaded automatically during env preparation process
+A sample label file (named `label.txt`) and semantic segmentation color map file (name `semseg_color_map.csv`) are also downloaded automatically during env preparation process
 (**NOTE:** This file is incompatible with models that output labels for the COCO dataset (e.g., models from the official YOLOX repository)).
 
 This file represents the correspondence between class index (integer outputted from YOLOX network) and