Skip to content

dataset #5

Answered by jhauret
andyye1999 asked this question in Q&A
Apr 5, 2023 · 1 comments · 5 replies
Discussion options

You must be logged in to vote

Hi,

Our approach can be applied to any body-conduction microphone including bone, in-ear, and throat microphones.

If your dataset is limited, you can try to perform a pre-training on simulated data (just as we did by applying a low-pass filter on clean speech with roughly the same characteristics as the mic you're tackling).

The size of the finetuning dataset is a question that we are currently exploring. We are planning to record 50 hours of air and body-conducted speech. We believe it will be sufficient with well-adapted hyperparameters.

Good luck with your application!

Replies: 1 comment 5 replies

Comment options

You must be logged in to vote
5 replies
@andyye1999
Comment options

@jhauret
Comment options

@andyye1999
Comment options

@andyye1999
Comment options

@andyye1999
Comment options

Answer selected by jhauret
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants