justheuristic 200fbecdbf Refactor MPFuture to use a single pipe/thread per process (#298) пре 4 година
..
layers 62652e1717 Add Switch Transformers-like RemoteMixtureOfExperts (#228) пре 4 година
__init__.py f0c5627139 Improve error handling, remove deprecated functionality (#261) пре 4 година
checkpoints.py 3024d381c5 Support learning rate schedulers in ExpertBackend (#196) пре 4 година
connection_handler.py 2328ba9262 Fix device in Switch-MoE, overhaul Server architecture (#256) пре 4 година
dht_handler.py f0c5627139 Improve error handling, remove deprecated functionality (#261) пре 4 година
expert_backend.py 2328ba9262 Fix device in Switch-MoE, overhaul Server architecture (#256) пре 4 година
expert_uid.py f0c5627139 Improve error handling, remove deprecated functionality (#261) пре 4 година
runtime.py 42b9b6cef8 Use logging in benchmarks, fix libp2p-related issues (#280) пре 4 година
task_pool.py 200fbecdbf Refactor MPFuture to use a single pipe/thread per process (#298) пре 4 година