How can I use the trained model to do speech enhancement? #2

tuliang1996 · 2019-03-07T08:24:15Z

Just like using a noisy speech as input, such as a wav file, outputting enhanced speech。
But I have not found any function about enhanced。

lifelongeek · 2019-03-07T09:14:50Z

You can add '--mode test --load_path PATH_TO_PRETRAINED_MODEL' to the training script.

For example,
python main.py --mode test --trainer AAS --DB_name chime --rnn_size 500 --rnn_layers 4 --ASR_path ../AM_training/models/librispeech_final.pth.tar --load_path /data/kenkim/AAS_enhancement/model.pth.tar

If you request, I will upload pre-trained model as well.

tuliang1996 · 2019-03-07T09:25:43Z

Thank you for your reply.
I found the following code in main.py,
If (config.mode == 'train'):
         Trainer.train() # VAE
     Elif(config.mode == 'test'):
         Trainer.test()
     Elif(config.mode == 'visualize'):
         Trainer.visualize()
So, I went to the trainer_AAS.py file to find these functions, I found the function train() but I didn't find the function test(). This makes me confused.
I will train a model myself first. If I have some problems, I will Come back for your help.

tuliang1996 · 2019-03-11T08:55:05Z

sorry to disturb you.
Can I add '--mode test --load_path PATH_TO_PRETRAINED_MODEL' to other models, such as FSEGAN?
and use the default 300 epochs and 20 batch size for the CHiME-4 dataset on 1080ti devices，how long will i spent?
Or can you tell me the time details about your training?

lifelongeek · 2019-03-12T08:39:42Z

For train/test FSEGAN
You can use '--mode test --trainer FSEGAN --load_path PATH_TO_PRETRAINED_MODEL' for test FSEGAN model. I found that main.py does not link to trainer_FSEGAN.py so i just added now.
Training time
For the results in the paper, I train the model with maximum epoch = 100. In my case, it takes roughly 3 days on Titan machine per experiment.
Although maximum epoch may be depends on model, learning algorithm and problem complexity, maximum epoch 100 might be too large for current setting. You can observe loss curve, and if there seems 'clear overfitting' on validation data, you can stop training.

tuliang1996 · 2019-03-12T12:20:31Z

Thank you
I think I need your pre-training model.
And use Chinese speech data for transfer learning.

lifelongeek · 2019-05-16T06:26:09Z

Sorry for late upload. Check main page :)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How can I use the trained model to do speech enhancement? #2

How can I use the trained model to do speech enhancement? #2

tuliang1996 commented Mar 7, 2019

lifelongeek commented Mar 7, 2019

tuliang1996 commented Mar 7, 2019

tuliang1996 commented Mar 11, 2019

lifelongeek commented Mar 12, 2019 •

edited

Loading

tuliang1996 commented Mar 12, 2019

lifelongeek commented May 16, 2019

How can I use the trained model to do speech enhancement? #2

How can I use the trained model to do speech enhancement? #2

Comments

tuliang1996 commented Mar 7, 2019

lifelongeek commented Mar 7, 2019

tuliang1996 commented Mar 7, 2019

tuliang1996 commented Mar 11, 2019

lifelongeek commented Mar 12, 2019 • edited Loading

tuliang1996 commented Mar 12, 2019

lifelongeek commented May 16, 2019

lifelongeek commented Mar 12, 2019 •

edited

Loading