duetable

A generative system which listens to an audio input, and "reacts" - plays output similar to input - and can create a "duet" between the human and the machine - similar to Continuator, but duet-able (:

high level design

requirements

Python 3.11
poetry

setup

poetry install

for Essentia please do:

activate your virtual environment and from there:
pip install essentia
brew install sdl2
pip install basic-pitch

for PyAudio please do:

brew install portaudio

Application

We have implemented three different Audio To Midi wrappers with usage of API from Essentia, Aubio and Basic Pitch (Spotify). Essentia and Basic Pitch are not good for realtime applications. Which one to use is subject of configuration. Default is: AudioToMidiWithAubio

Example configuration for stream input signal to audio:

settings = DuetableSettings()
settings.buffer_length = 4
app = Duetable(
    converter=AudioToMidiWithAubio(down_sample=1),
    settings=settings,
    # device_name="U46",
    regenerator=MuptRegenerator()
)
app.read()

By default data will be read from build in microphone = 'MacBook Pro Microphone'. To adjust please set correct device_name. On start, stream class log to console all possible input devices.

Run

Running app from the console together with a server for MAX Msp UI server:

poetry run python src/max_msp_ui/max_osc_ui.py

Configuration

Multiple parameters can be provided on runtime to configure the system. These are:

recording_strategy - how the midi buffer is filled. Options are:
- NOTES - recording till amount of notes is reached and equal to buffer_length
- TIME - recording till amount of time is reached and equal to buffer_time
- NOTES_ONCE - recording only once till amount of notes is reached and equal to buffer_length
- TIME_ONCE - recording only once till amount of time is reached and equal to buffer_time
record_when_playing - if the system should record when playing
append_to_play_buffer - if set will append to newly detected notes to previous ones
upper_meter - upper meter for the regenerated melody
lower_meter - lower meter for the regenerated melody
bpm - tempo for the regenerated melody
sleep_with_note - specify if the midi player has to sleep with the note duration or time calculated from tempo and meter
regenerator - specify method for regenerating recorded midi notes, possible values are:
- MuptRegenerator - regenerates melody using MUPT (https://arxiv.org/abs/2404.06393)
- HttpMuptRegenerator - regenerates melody using MUPT API
- MuptWithMarkovChainRegenerator - first regeneration is executed with MUTP API and all other iterations with MarkovChain
- MarkovChainRegenerator - regenerates melody using MarkovChain
- DummyRegenerator - returns the same melody as input
n_bars - number of bars to generate with Mupt API
temperature - temperature for melody generation with Mupt API
model_size - LLM model type for Mupt API, possible values: small, large
loop_playback - if set will loop the playback of the regenerated melody (to disable Question-Answer melody style)
transformers - list of transformers to apply on the input melody after regeneration, possible values are:
- RandomMuteTransformer - mute random notes in the melody, accept function to determine probability of muting
- SimpleTransposeTransformer - transpose melody note by random number of semitones, accept function to determine range of transposition
- SimpleTimeTransformer - change duration time of the note by random factor, accept function to determine range of time change
- MidiRangeTransformer - clip midi range of the melody, accept from to midi values
- FixedRangeTransformer - creates a correlation between Input data and output, pitch-wise
- ApproachNotesTransformer - creates an approach note for each note in melody
converter - plugin responsible for audio-to-midi conversion, possible values are:
- AudioToMidiWithEssentia - uses Essentia for audio-to-midi conversion (!not fully implemented yet!)
- AudioToMidiWithAubio - uses Aubio for audio-to-midi conversion
- AudioToMidiWithBasicPitch - uses BasicPitch for audio-to-midi conversion (!uses temporary files because lib does not support buffers, requires better implementation!)
- AudioToMidiDummy - no midi detection, returns random notes
device_name - name of the audio input device, default: 'MacBook Pro Microphone'
open_output - method for setting up output midi device, possible to route to many devices at once

Low level settings:

frames_per_buffer - frames per buffer for audio-to-midi conversion and audio in, default 512
sample_rate - sample rate for audio-to-midi conversion and audio in, default 44100
win_s - window size for audio-to-midi conversion, default 1024
hop_s - hop size for audio-to-midi conversion and audio in, default 128

Please check files: act_1.py, act_2.py, act_3.py, act_3_1.py which were used during the demo performance for examples of usage.

MUPT API specification

Endpoint: POST http://<SERVER_URL>/generate/

Request params:

prefix - start of the melody in ABC notation
n_bars - n bars after in output (Default: 2)
temperature - temperature for sampling (Default: 1.)
n_samples - samples to generate from (Default: 3)
model - type of the model ['large', 'small'] (Default: large) Example Request body

{
  "prefix": "X:1<n>L:1/8<n>Q:1/8=200<n>M:4/4|: BGdB",
  "n_bars": 4,
  "temperature": 0.7,
  "n_samples": 5,
  "model": "small"
}

Example Response

{
    "status_code": 200,
    "headers": {
        "date": "Wed, 19 Jun 2024 15:20:15 GMT",
        "server": "uvicorn",
        "content-length": "97",
        "content-type": "application/json"
    },
    "content": {
        "melody": "X:1\nL:1/8\nQ:1/8=200\nM:4/4\n BGdB Af g2 | g2 dc BG G2 | cAFA cAFA | BGdB A2 G2 |\n"
    }
}

Name		Name	Last commit message	Last commit date
Latest commit History 106 Commits
MIDI Files		MIDI Files
audio_files		audio_files
docs		docs
src		src
tests/duetable		tests/duetable
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

duetable

high level design

requirements

setup

Application

Run

Configuration

MUPT API specification

About

Releases

Packages

Contributors 3

Languages

l11ama/duetable

Folders and files

Latest commit

History

Repository files navigation

duetable

high level design

requirements

setup

Application

Run

Configuration

MUPT API specification

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages