Max Ryabinin 62652e1717 Add Switch Transformers-like RemoteMixtureOfExperts (#228) %!s(int64=4) %!d(string=hai) anos
..
__init__.py 62652e1717 Add Switch Transformers-like RemoteMixtureOfExperts (#228) %!s(int64=4) %!d(string=hai) anos
common.py 62652e1717 Add Switch Transformers-like RemoteMixtureOfExperts (#228) %!s(int64=4) %!d(string=hai) anos
custom_experts.py ca5c7610ae Add tool for custom user experts (#189) %!s(int64=4) %!d(string=hai) anos
dropout.py ca5c7610ae Add tool for custom user experts (#189) %!s(int64=4) %!d(string=hai) anos
lr_schedule.py 3024d381c5 Support learning rate schedulers in ExpertBackend (#196) %!s(int64=4) %!d(string=hai) anos