This repository contains code required to reproduce the expert pruning and merging methods used in the paper: REAP the Experts: Why Pruning Prevails for One-Shot MoE compression Expert pruning and ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results