Commit History

Autor SHA1 Mensaxe Data
  justheuristic 676066baed wip: implement grad wrt logits %!s(int64=5) %!d(string=hai) anos
  justheuristic 87b2f8b635 wip: implement grad wrt logits %!s(int64=5) %!d(string=hai) anos
  justheuristic 3d93ff1400 background_server is now a contextmanager %!s(int64=5) %!d(string=hai) anos
  justheuristic 7bdb5d919b background_server is now a contextmanager %!s(int64=5) %!d(string=hai) anos
  justheuristic f611f004f9 background_server is now a contextmanager %!s(int64=5) %!d(string=hai) anos
  justheuristic 87a5916d74 background_server is now a contextmanager %!s(int64=5) %!d(string=hai) anos
  justheuristic 384ccc1115 background_server is now a contextmanager %!s(int64=5) %!d(string=hai) anos
  justheuristic cbeb07205f background_server is now a contextmanager %!s(int64=5) %!d(string=hai) anos
  justheuristic d703c8d4c5 background_server is now a contextmanager %!s(int64=5) %!d(string=hai) anos
  justheuristic 9a8320c106 pep8 %!s(int64=5) %!d(string=hai) anos
  justheuristic aa0743c587 pep8 %!s(int64=5) %!d(string=hai) anos
  justheuristic 6605b00d05 safer shutdown order %!s(int64=5) %!d(string=hai) anos
  justheuristic f9798a474a unified prefix scheme %!s(int64=5) %!d(string=hai) anos
  justheuristic cbf1c42df1 unified prefix scheme %!s(int64=5) %!d(string=hai) anos
  justheuristic dfa9dfaae2 move to notes %!s(int64=5) %!d(string=hai) anos
  justheuristic 8931c56f73 move to notes %!s(int64=5) %!d(string=hai) anos
  justheuristic b20f3ee985 grad logits wrt actual logits %!s(int64=5) %!d(string=hai) anos
  justheuristic be3119b12e add basic moe correctness test %!s(int64=5) %!d(string=hai) anos
  justheuristic 662357fcb3 reweigh grads correctly %!s(int64=5) %!d(string=hai) anos
  justheuristic 153ab20232 change order of grads %!s(int64=5) %!d(string=hai) anos
  justheuristic 284250d00c change order of grads %!s(int64=5) %!d(string=hai) anos
  justheuristic c5ee3d6041 only return grad w.r.t. inputs %!s(int64=5) %!d(string=hai) anos
  justheuristic 05e7c92f3d unpack tuple %!s(int64=5) %!d(string=hai) anos
  justheuristic 5cbcf79b00 list -> tensor %!s(int64=5) %!d(string=hai) anos
  justheuristic c8889bde96 list -> tensor %!s(int64=5) %!d(string=hai) anos
  justheuristic 8030c075c9 use lists for gatehr %!s(int64=5) %!d(string=hai) anos
  justheuristic 49e4459ec8 do not .detach non-tensor parameters %!s(int64=5) %!d(string=hai) anos
  justheuristic 97c4003e5c enumerate %!s(int64=5) %!d(string=hai) anos
  justheuristic 60af3952c9 flag to remove optimizer %!s(int64=5) %!d(string=hai) anos
  justheuristic 9a4e306f39 flag to remove optimizer %!s(int64=5) %!d(string=hai) anos