.. |
layers
|
62652e1717
Add Switch Transformers-like RemoteMixtureOfExperts (#228)
|
4 jaren geleden |
__init__.py
|
f0c5627139
Improve error handling, remove deprecated functionality (#261)
|
4 jaren geleden |
checkpoints.py
|
3024d381c5
Support learning rate schedulers in ExpertBackend (#196)
|
4 jaren geleden |
connection_handler.py
|
2328ba9262
Fix device in Switch-MoE, overhaul Server architecture (#256)
|
4 jaren geleden |
dht_handler.py
|
f0c5627139
Improve error handling, remove deprecated functionality (#261)
|
4 jaren geleden |
expert_backend.py
|
2328ba9262
Fix device in Switch-MoE, overhaul Server architecture (#256)
|
4 jaren geleden |
expert_uid.py
|
f0c5627139
Improve error handling, remove deprecated functionality (#261)
|
4 jaren geleden |
runtime.py
|
42b9b6cef8
Use logging in benchmarks, fix libp2p-related issues (#280)
|
4 jaren geleden |
task_pool.py
|
200fbecdbf
Refactor MPFuture to use a single pipe/thread per process (#298)
|
4 jaren geleden |