VisiOSC

A set of Python scripts that capture video input, recognize body landmarks, and send their coordinates via Open Sound Control (OSC) protocol, so that they can be used in software like Max/MSP, TouchDesigner, Pure Data or any OSC-capable DAW or plugin. Google MediaPipe library and models are used for computer vision tasks.

Usage

First of all, install the necessary dependencies with:

pip install -r requirements.txt

Then, refer to the following guides for using the scripts. Coordinates for x and y are always inside [0.0, 1.0] range, with the origin being the left bottom corner (camera image is automatically mirrored).

For each script, the following options can be specified:

Option	Description
`-a ADDRESS, --address ADDRESS`	Address to send OSC messages (default: `127.0.0.1`)
`-d DEVICE, --device DEVICE`	Index of the video device to use (default: `0`, if you have multiple video input devices you might have to try different values)
`-m MODE, --mode MODE`	Send a simplified and named list of the landmarks (`0`, default) or send all the numbered landmarks (`1`)
`-p PORT, --port PORT`	Port to send OSC messages (default: `8000`)
`-s WIDTH HEIGHT, --size WIDTH HEIGHT`	Width and height of the capture window (default: `640 480`)

Hands Tracking

The script hands_track.py recognizes 21 hand landmarks for each hand. If two right or two left hands appear in the video stream, only one will be detected (you can easily modify this behavior by editing the code). The default mode does not pass to OSC stream the 21 landmarks, but the coordinates of the tips of the fingers, the wrist and the palm. This behavior can be changed with the appropriate option (see options table). Launch it with:

python hands_track.py

Face Tracking

The script face_track.py recognizes 478 face landmarks on the face (one face is detected, this can optionally be changed in the code). The default mode does not pass to OSC stream the 478 landmarks, but the coordinates of the main parts of the face (chin, lips, mouth, cheeks, nose, eyes, eyebrows, forehead). This behavior can be changed with the appropriate option (see options table). Launch it with:

python face_track.py

Pose Tracking

The script pose_track.py recognizes 33 pose landmarks on whole body. The default mode does not pass to OSC stream the numbered 33 landmarks, but the named body parts minus some redundant ones but plus the torso and mouth centres. This behavior can be changed with the appropriate option (see options table). Launch it with:

python pose_track.py

Pose tracking works even if a whole figure is not detected, and tries to guess the coordinates of the out-of-screen body parts. For this reason, some coordinates may be outside [0.0, 1.0] range. The script has a -o / --out option that if set to 0 filters out all out-of-screen coordinates, if set to 1 clamps all the coordinates inside [0.0, 1.0] range and if set to 2 (which is the default) passes all the coordinates as they are.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
models		models
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
_config.yml		_config.yml
draw_landmarks.py		draw_landmarks.py
face_track.py		face_track.py
hands_track.py		hands_track.py
pose_track.py		pose_track.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VisiOSC

Usage

Hands Tracking

Face Tracking

Pose Tracking

About

Releases

Packages

Languages

License

valeriorlandini/visiosc

Folders and files

Latest commit

History

Repository files navigation

VisiOSC

Usage

Hands Tracking

Face Tracking

Pose Tracking

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages