.. |
layers
|
62652e1717
Add Switch Transformers-like RemoteMixtureOfExperts (#228)
|
4 years ago |
__init__.py
|
62652e1717
Add Switch Transformers-like RemoteMixtureOfExperts (#228)
|
4 years ago |
checkpoints.py
|
3024d381c5
Support learning rate schedulers in ExpertBackend (#196)
|
4 years ago |
connection_handler.py
|
916c3db52d
Move compression-related code to hivemind.utils.compression (#213)
|
4 years ago |
dht_handler.py
|
f132294edb
Extract expert-specific methods from DHT (#192)
|
4 years ago |
expert_backend.py
|
ca5c7610ae
Add tool for custom user experts (#189)
|
4 years ago |
expert_uid.py
|
3024d381c5
Support learning rate schedulers in ExpertBackend (#196)
|
4 years ago |
runtime.py
|
6128cbbd51
Add gradient clipping support to ExpertBackend (#214)
|
4 years ago |
task_pool.py
|
6f8f192150
Improve Runtime exception handling (#207)
|
4 years ago |