justheuristic
|
21fda756c1
Account for multi-gpu devices in examples/albert (#309)
|
4 роки тому |
Aleksandr Borzunov
|
020c068344
Log collaboration step to Wandb, store metrics only if peer is synchronized (#267)
|
4 роки тому |
Michael Diskin
|
afc59d2a6b
Log more stats for user, move performance stats to examples (#257)
|
4 роки тому |
Alexey Bukhtiyarov
|
01103cf991
Add state checkpointing and uploading in coordinator (#241)
|
4 роки тому |
Aleksandr Borzunov
|
3bde6188fe
Protect training progress and metrics with signatures and DHT schema validation (#250)
|
4 роки тому |
justheuristic
|
ddb5389e66
Fix server hanging in certain cases when connection is lost (#247)
|
4 роки тому |
Michael Diskin
|
2314e7ebd5
fix metrics (#240)
|
4 роки тому |
Alexey Bukhtiyarov
|
27ea94e3f9
Add example for collaborative ALBERT training (#226)
|
4 роки тому |