Building a great multi-lingual teacher with sparsely-gated mixture of experts for speech recognition

作者： Kenichi Kumatani , Robert Gmyr , Felipe Cruz Salinas , Linquan Liu , Wei Zuo

DOI:

关键词:

摘要: … The sparsely-gated Mixture of Experts (MoE) can magnify a … More specifically, we apply the sparsely-gated MoE technique to two … , End-to-end model, Mixture of experts, Transformers …

arxiv.org 本地加速

arxiv.org PDF 下载加速

参考文章(0)

Building a great multi-lingual teacher with sparsely-gated mixture of experts for speech recognition

来源期刊

我的账户

Building a great multi-lingual teacher with sparsely-gated mixture of experts for speech recognition

来源期刊

相似文章 0

我的账户