A PyTorch implementation of a cost-aware, gated BERT pool (MPC-Router) for efficient Natural Language Understanding. This project implements a Mixture of Experts (MoE) architecture with asymmetric ...