Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ctc_fsmn_base 模型的自训练问题---和阿里官方的模型对比 #185

Open
ltcxjtu opened this issue Feb 10, 2025 · 0 comments
Open

Comments

@ltcxjtu
Copy link

ltcxjtu commented Feb 10, 2025

自己训练的base模型没有官方给的好;自己拿wenet_speech+kespeech数据(重新用paraformer洗过一遍,还有10000+h,)训练一个base模型,在问问的数据集上字准大概在20%,但阿里开源出来的base模型字准在30%,感觉base模型训练有一些门道,是不是现有的训练方法还是没有达到最优? 训练数据是不一样的;可参考ali的base https://modelscope.cn/models/iic/speech_charctc_kws_phone-xiaoyun

@ltcxjtu ltcxjtu changed the title ctc_fsmn_base ctc_fsmn_base 模型的自训练问题---和阿里官方的模型对比 Feb 10, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant