Commit History

Autor SHA1 Mensaxe Data
  justheuristic b20f3ee985 grad logits wrt actual logits %!s(int64=5) %!d(string=hai) anos
  justheuristic be3119b12e add basic moe correctness test %!s(int64=5) %!d(string=hai) anos
  justheuristic 662357fcb3 reweigh grads correctly %!s(int64=5) %!d(string=hai) anos
  justheuristic 153ab20232 change order of grads %!s(int64=5) %!d(string=hai) anos
  justheuristic 284250d00c change order of grads %!s(int64=5) %!d(string=hai) anos
  justheuristic c5ee3d6041 only return grad w.r.t. inputs %!s(int64=5) %!d(string=hai) anos
  justheuristic 05e7c92f3d unpack tuple %!s(int64=5) %!d(string=hai) anos
  justheuristic 5cbcf79b00 list -> tensor %!s(int64=5) %!d(string=hai) anos
  justheuristic c8889bde96 list -> tensor %!s(int64=5) %!d(string=hai) anos
  justheuristic 8030c075c9 use lists for gatehr %!s(int64=5) %!d(string=hai) anos
  justheuristic 49e4459ec8 do not .detach non-tensor parameters %!s(int64=5) %!d(string=hai) anos
  justheuristic 97c4003e5c enumerate %!s(int64=5) %!d(string=hai) anos
  justheuristic 60af3952c9 flag to remove optimizer %!s(int64=5) %!d(string=hai) anos
  justheuristic 9a4e306f39 flag to remove optimizer %!s(int64=5) %!d(string=hai) anos
  justheuristic 80ab75583f wip: parallel fault-tolerant moe backward pass %!s(int64=5) %!d(string=hai) anos
  justheuristic 2b2ddf8280 wip: parallel fault-tolerant moe backward pass %!s(int64=5) %!d(string=hai) anos
  justheuristic 6fb99c8746 wip: parallel fault-tolerant moe backward pass %!s(int64=5) %!d(string=hai) anos
  justheuristic ebe07eebfd typo %!s(int64=5) %!d(string=hai) anos
  justheuristic 88d1bdc025 unused imports %!s(int64=5) %!d(string=hai) anos
  justheuristic c58d08cc06 remove run_and_await_k completely, rename gating_function to moe %!s(int64=5) %!d(string=hai) anos
  justheuristic 4a33e155b6 remove run_and_await_k completely %!s(int64=5) %!d(string=hai) anos
  justheuristic 5016002186 remove dependency on run_and_await_k, rename GatingFunction to RemoteMixtureOfExperts %!s(int64=5) %!d(string=hai) anos
  justheuristic e71bb5428f change print time for network %!s(int64=5) %!d(string=hai) anos
  justheuristic ba1533b7bb add lifetime option for server and dht %!s(int64=5) %!d(string=hai) anos
  justheuristic 7c8d091633 deduplicate args %!s(int64=5) %!d(string=hai) anos
  justheuristic 7fe3b8d7a5 clarify network shutdown %!s(int64=5) %!d(string=hai) anos
  justheuristic 3ceb24d07d separate dht script %!s(int64=5) %!d(string=hai) anos
  justheuristic fbea1907a8 unused imports %!s(int64=5) %!d(string=hai) anos
  justheuristic 40351c59cc verbose printing %!s(int64=5) %!d(string=hai) anos
  justheuristic 7f4530b210 verbose printing %!s(int64=5) %!d(string=hai) anos