dataset #5

andyye1999 · 2023-04-05T14:19:28Z

andyye1999
Apr 5, 2023

Hello author, after reading your article, I thought of my current application scenario which involves mapping bone conduction speech signals to air-conducted speech signals. However, the dataset is currently limited. How does the size of the dataset affect the performance of an EBNE network?

Answered by jhauret

Apr 5, 2023

Hi,

Our approach can be applied to any body-conduction microphone including bone, in-ear, and throat microphones.

If your dataset is limited, you can try to perform a pre-training on simulated data (just as we did by applying a low-pass filter on clean speech with roughly the same characteristics as the mic you're tackling).

The size of the finetuning dataset is a question that we are currently exploring. We are planning to record 50 hours of air and body-conducted speech. We believe it will be sufficient with well-adapted hyperparameters.

Good luck with your application!

View full answer

jhauret · 2023-04-05T15:34:57Z

jhauret
Apr 5, 2023
Maintainer

Hi,

Our approach can be applied to any body-conduction microphone including bone, in-ear, and throat microphones.

If your dataset is limited, you can try to perform a pre-training on simulated data (just as we did by applying a low-pass filter on clean speech with roughly the same characteristics as the mic you're tackling).

The size of the finetuning dataset is a question that we are currently exploring. We are planning to record 50 hours of air and body-conducted speech. We believe it will be sufficient with well-adapted hyperparameters.

Good luck with your application!

5 replies

andyye1999 Apr 5, 2023
Author

I have a dataset of about 5 hours, do you think the model will converge?

jhauret Apr 5, 2023
Maintainer

Idk, deep learning is an experimental science, so you should give it a try. With adequate pre-training, it might work. 😉

andyye1999 Apr 5, 2023
Author

i am working on it and hope it can work
I am very interested in your research and would like to express my gratitude.

andyye1999 Apr 6, 2023
Author

Hi,

Our approach can be applied to any body-conduction microphone including bone, in-ear, and throat microphones.

If your dataset is limited, you can try to perform a pre-training on simulated data (just as we did by applying a low-pass filter on clean speech with roughly the same characteristics as the mic you're tackling).

The size of the finetuning dataset is a question that we are currently exploring. We are planning to record 50 hours of air and body-conducted speech. We believe it will be sufficient with well-adapted hyperparameters.

Good luck with your application!

I previously used a mapping method similar to the speech enhancement model DCCRN and found that it could only map up to 4000Hz. It was unable to map frequencies between 4000Hz and 8000Hz. Currently, I am trying your model.

andyye1999 Apr 6, 2023
Author

The size of the finetuning dataset is a question that we are currently exploring. We are planning to record 50 hours of air and body-conducted speech. We believe it will be sufficient with well-adapted hyperparameters.

I have found some datasets data1 But in the country where I am located, it is not possible to download such a large dataset :( You can download this dataset and take a look at its data quality. Interestingly, we have previously discovered the same dataset. data2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dataset #5

{{title}}

Replies: 1 comment 5 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

dataset #5

andyye1999 Apr 5, 2023

Replies: 1 comment · 5 replies

jhauret Apr 5, 2023 Maintainer

andyye1999 Apr 5, 2023 Author

jhauret Apr 5, 2023 Maintainer

andyye1999 Apr 5, 2023 Author

andyye1999 Apr 6, 2023 Author

andyye1999 Apr 6, 2023 Author

andyye1999
Apr 5, 2023

Replies: 1 comment 5 replies

jhauret
Apr 5, 2023
Maintainer

andyye1999 Apr 5, 2023
Author

jhauret Apr 5, 2023
Maintainer

andyye1999 Apr 5, 2023
Author

andyye1999 Apr 6, 2023
Author

andyye1999 Apr 6, 2023
Author