justheuristic 200fbecdbf Refactor MPFuture to use a single pipe/thread per process (#298) %!s(int64=4) %!d(string=hai) anos
..
layers 62652e1717 Add Switch Transformers-like RemoteMixtureOfExperts (#228) %!s(int64=4) %!d(string=hai) anos
__init__.py f0c5627139 Improve error handling, remove deprecated functionality (#261) %!s(int64=4) %!d(string=hai) anos
checkpoints.py 3024d381c5 Support learning rate schedulers in ExpertBackend (#196) %!s(int64=4) %!d(string=hai) anos
connection_handler.py 2328ba9262 Fix device in Switch-MoE, overhaul Server architecture (#256) %!s(int64=4) %!d(string=hai) anos
dht_handler.py f0c5627139 Improve error handling, remove deprecated functionality (#261) %!s(int64=4) %!d(string=hai) anos
expert_backend.py 2328ba9262 Fix device in Switch-MoE, overhaul Server architecture (#256) %!s(int64=4) %!d(string=hai) anos
expert_uid.py f0c5627139 Improve error handling, remove deprecated functionality (#261) %!s(int64=4) %!d(string=hai) anos
runtime.py 42b9b6cef8 Use logging in benchmarks, fix libp2p-related issues (#280) %!s(int64=4) %!d(string=hai) anos
task_pool.py 200fbecdbf Refactor MPFuture to use a single pipe/thread per process (#298) %!s(int64=4) %!d(string=hai) anos