Max Ryabinin 62652e1717 Add Switch Transformers-like RemoteMixtureOfExperts (#228) 4 years ago
..
layers 62652e1717 Add Switch Transformers-like RemoteMixtureOfExperts (#228) 4 years ago
__init__.py 62652e1717 Add Switch Transformers-like RemoteMixtureOfExperts (#228) 4 years ago
checkpoints.py 3024d381c5 Support learning rate schedulers in ExpertBackend (#196) 4 years ago
connection_handler.py 916c3db52d Move compression-related code to hivemind.utils.compression (#213) 4 years ago
dht_handler.py f132294edb Extract expert-specific methods from DHT (#192) 4 years ago
expert_backend.py ca5c7610ae Add tool for custom user experts (#189) 4 years ago
expert_uid.py 3024d381c5 Support learning rate schedulers in ExpertBackend (#196) 4 years ago
runtime.py 6128cbbd51 Add gradient clipping support to ExpertBackend (#214) 4 years ago
task_pool.py 6f8f192150 Improve Runtime exception handling (#207) 4 years ago