justheuristic 200fbecdbf Refactor MPFuture to use a single pipe/thread per process (#298) 4 years ago
..
layers 62652e1717 Add Switch Transformers-like RemoteMixtureOfExperts (#228) 4 years ago
__init__.py f0c5627139 Improve error handling, remove deprecated functionality (#261) 4 years ago
checkpoints.py 3024d381c5 Support learning rate schedulers in ExpertBackend (#196) 4 years ago
connection_handler.py 2328ba9262 Fix device in Switch-MoE, overhaul Server architecture (#256) 4 years ago
dht_handler.py f0c5627139 Improve error handling, remove deprecated functionality (#261) 4 years ago
expert_backend.py 2328ba9262 Fix device in Switch-MoE, overhaul Server architecture (#256) 4 years ago
expert_uid.py f0c5627139 Improve error handling, remove deprecated functionality (#261) 4 years ago
runtime.py 42b9b6cef8 Use logging in benchmarks, fix libp2p-related issues (#280) 4 years ago
task_pool.py 200fbecdbf Refactor MPFuture to use a single pipe/thread per process (#298) 4 years ago