justheuristic 785e115d89 wip: implement grad wrt logits 5 лет назад
..
__init__.py c58d08cc06 remove run_and_await_k completely, rename gating_function to moe 5 лет назад
expert.py 6fb99c8746 wip: parallel fault-tolerant moe backward pass 5 лет назад
moe.py 785e115d89 wip: implement grad wrt logits 5 лет назад