Commit History

Autor SHA1 Mensaxe Data
  xtinkt eedfae4bd4 Merge branch 'decentralized_lr_scheduler' of https://github.com/learning-at-home/hivemind into decentralized_lr_scheduler %!s(int64=4) %!d(string=hai) anos
  xtinkt be77054fad sync peer in constructor id needed %!s(int64=4) %!d(string=hai) anos
  Anton Sinitsin edfc3c005b Update hivemind/client/averaging/training.py %!s(int64=4) %!d(string=hai) anos
  Anton Sinitsin 26ac1c7299 Update hivemind/optim/averaged.py %!s(int64=4) %!d(string=hai) anos
  Anton Sinitsin fe41e0d95d Update hivemind/optim/averaged.py %!s(int64=4) %!d(string=hai) anos
  xtinkt 017fbe43a2 change DecentralizedState to TrainingState %!s(int64=4) %!d(string=hai) anos
  xtinkt c27925de26 fix some comments %!s(int64=4) %!d(string=hai) anos
  xtinkt 42423cfe6b fix some pr issues %!s(int64=4) %!d(string=hai) anos
  xtinkt 517018b136 added scheduler state loading and saveing to averager %!s(int64=4) %!d(string=hai) anos
  xtinkt 937e622b8b fix some issues in pr %!s(int64=4) %!d(string=hai) anos
  xtinkt 15f1e9b0b5 fix some issues in pr %!s(int64=4) %!d(string=hai) anos
  xtinkt 14404e4168 fix some issues in pr %!s(int64=4) %!d(string=hai) anos
  xtinkt 97cec29e13 Added info about new params. Changed some names. %!s(int64=4) %!d(string=hai) anos
  xtinkt de01a9cb15 add decentralized learning rate scheduler and epochs abstraction %!s(int64=4) %!d(string=hai) anos
  Aleksandr Borzunov 08ee017f0f Add nltk to ALBERT example's requirements (#251) %!s(int64=4) %!d(string=hai) anos
  Roman Zhytar e833a7efb9 Decentralized adaptive optimizers (#243) %!s(int64=4) %!d(string=hai) anos
  Aleksandr Borzunov 18add2c04b Implement combining validators (#249) %!s(int64=4) %!d(string=hai) anos
  Max Ryabinin 0a1fdb172f Fix incorrect data types/values in RemoteSwitchMixtureOfExperts (#246) %!s(int64=4) %!d(string=hai) anos
  Max Ryabinin dfbc401196 Add Dockerfile, refactor tests (#245) %!s(int64=4) %!d(string=hai) anos
  justheuristic ddb5389e66 Fix server hanging in certain cases when connection is lost (#247) %!s(int64=4) %!d(string=hai) anos
  Aleksandr Borzunov a3feafa907 Add DHT schema validator (#227) %!s(int64=4) %!d(string=hai) anos
  Michael Diskin 2314e7ebd5 fix metrics (#240) %!s(int64=4) %!d(string=hai) anos
  Alexey Bukhtiyarov 27ea94e3f9 Add example for collaborative ALBERT training (#226) %!s(int64=4) %!d(string=hai) anos
  Max Ryabinin 62652e1717 Add Switch Transformers-like RemoteMixtureOfExperts (#228) %!s(int64=4) %!d(string=hai) anos
  justheuristic 3d6a242e30 Ensure version-consistent result rounding in load_balance_peers (#230) %!s(int64=4) %!d(string=hai) anos
  Roman Zhytar 8c3bd93e87 Statistics averaging (#229) %!s(int64=4) %!d(string=hai) anos
  Vsevolod-pl 91d17a4ebc Delta gradients transmission (#225) %!s(int64=4) %!d(string=hai) anos
  romakail ca5c7610ae Add tool for custom user experts (#189) %!s(int64=4) %!d(string=hai) anos
  justheuristic 32b87bf3fe Reset gradient buffers when synchronizing with peers (#222) %!s(int64=4) %!d(string=hai) anos
  justheuristic b906ae94ed better zero_grad behavior in CollaborativeOptimizer (#221) %!s(int64=4) %!d(string=hai) anos