Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleApril 2024
Mixture-of-experts with expert choice routing
- Yanqi Zhou,
- Tao Lei,
- Hanxiao Liu,
- Nan Du,
- Yanping Huang,
- Vincent Y. Zhao,
- Andrew Dai,
- Zhifeng Chen,
- Quoc Le,
- James Laudon
NIPS '22: Proceedings of the 36th International Conference on Neural Information Processing SystemsNovember 2022, Article No.: 515, Pages 7103–7114Sparsely-activated Mixture-of-experts (MoE) models allow the number of parameters to greatly increase while keeping the amount of computation for a given token or a given sample unchanged. However, a poor expert routing strategy can cause certain experts ...
- doctoral_thesisJanuary 2020
Towards Robust Representation Learning and Beyond
AbstractDeep networks have reshaped the computer vision research in recent years. As fueled by powerful computational resources and massive amount of data, deep networks now dominate a wide range of visual benchmarks. Nonetheless, these success stories ...