| .. |
|
layers
|
62652e1717
Add Switch Transformers-like RemoteMixtureOfExperts (#228)
|
4 years ago |
|
__init__.py
|
62652e1717
Add Switch Transformers-like RemoteMixtureOfExperts (#228)
|
4 years ago |
|
checkpoints.py
|
3024d381c5
Support learning rate schedulers in ExpertBackend (#196)
|
4 years ago |
|
connection_handler.py
|
916c3db52d
Move compression-related code to hivemind.utils.compression (#213)
|
4 years ago |
|
dht_handler.py
|
f132294edb
Extract expert-specific methods from DHT (#192)
|
4 years ago |
|
expert_backend.py
|
ca5c7610ae
Add tool for custom user experts (#189)
|
4 years ago |
|
expert_uid.py
|
3024d381c5
Support learning rate schedulers in ExpertBackend (#196)
|
4 years ago |
|
runtime.py
|
6128cbbd51
Add gradient clipping support to ExpertBackend (#214)
|
4 years ago |
|
task_pool.py
|
6f8f192150
Improve Runtime exception handling (#207)
|
4 years ago |