Name		Name	Last commit message	Last commit date
parent directory ..
Detic		Detic
detectron2		detectron2
segment-anything		segment-anything
test_data		test_data
README.md		README.md
detic_sam.py		detic_sam.py
run_detic_sam.sh		run_detic_sam.sh
setup.sh		setup.sh
setup_alta.sh		setup_alta.sh

README.md

Detic + SAM for objects recognition and segmentation

Description

It's the DTSAM (Detic + SAM) python package for objects recognition and segmentation.

Specifically, it uses Detic for objects recognition to get the bounding boxes of objects and then uses SAM for object segmentation to get the masks of objects.

The main script is: detic_sam.py.

Details of Detic and SAM can be found in their respective repositories:

Installation

First add the following to your bash profile (assuming you have CUDA 11+): export CUDA_PATH=/usr/local/cuda-11.7/

Next, be sure to use python 3.8. If you have a higher version of python, then install 3.8 and use this.

Either run ./setup.sh (make sure the python command uses python 3.8 in this case!) or follow the steps manually.

Details can be checked in setup.sh or setup_alta.sh

Usage

Run run_detic_sam.sh There are four parameters:

IMAGE_PATH: The image path to run the detection on.
CLASSES: The object classes that you want to detect.
DEVICE: The device to run the detection on (e.g. 'cuda:0' or 'cpu').
THRESHOLD: Detection threshold.

More details can be checked in dtsam.py

Troubleshooting

Segmentation Fault (core dumped)

This stems from an issue with the detectron2 installation. Torch and detectron2 are closely linked: you need to make sure you've installed the torch version with the right CUDA extension corresponding to the detectron2 version (as well as your own system setup). Find the correct detectron2 installation command here. Then, find the corresponding torch version compatible withthat and make sure you have that.

PIL.Image.LINEAR doesn't exist.

  File "/home/nkumar/detic-sam/venv/lib/python3.10/site-packages/detectron2/data/transforms/transform.py", line 46, in ExtentTransform
  def __init__(self, src_rect, output_size, interp=Image.LINEAR, fill=0):
  AttributeError: module 'PIL.Image' has no attribute 'LINEAR'. Did you mean: 'BILINEAR'?

Then simply edit the offending file to change Image.LINEAR to Image.BILINEAR.

Acknowledgements

Thank bdaiinstitute for providing the initial code !

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dtsam_package

dtsam_package

README.md

Detic + SAM for objects recognition and segmentation

Description

Installation

Usage

Troubleshooting

Acknowledgements

Files

dtsam_package

Directory actions

More options

Directory actions

More options

Latest commit

History

dtsam_package

Folders and files

parent directory

README.md

Detic + SAM for objects recognition and segmentation

Description

Installation

Usage

Troubleshooting

Acknowledgements