Commit History

Author SHA1 Message Date
  Aleksandr Borzunov 020c068344 Log collaboration step to Wandb, store metrics only if peer is synchronized (#267) 4 years ago
  Aleksandr Borzunov 9bb775fe04 Log correct loss in examples/albert/run_first_peer.py (#265) 4 years ago
  Alexey Bukhtiyarov 01103cf991 Add state checkpointing and uploading in coordinator (#241) 4 years ago
  Aleksandr Borzunov 3bde6188fe Protect training progress and metrics with signatures and DHT schema validation (#250) 4 years ago
  Michael Diskin 2314e7ebd5 fix metrics (#240) 4 years ago
  Alexey Bukhtiyarov 27ea94e3f9 Add example for collaborative ALBERT training (#226) 4 years ago