justheuristic
|
ac8a1a4308
is the new black
|
3 years ago |
justheuristic
|
0a20d44ae1
remove staleness timeout
|
3 years ago |
justheuristic
|
171993fd9d
ignore_stale_updates
|
3 years ago |
justheuristic
|
f842b91d11
ignore_stale_updates
|
3 years ago |
justheuristic
|
2462796b77
staleness update
|
3 years ago |
justheuristic
|
4dddc75d16
ignore_stale_updates
|
3 years ago |
justheuristic
|
f4b8c2cbb5
rollback
|
3 years ago |
justheuristic
|
e12740fcb3
ignore_stale_updates
|
3 years ago |
justheuristic
|
ad82b1ca20
ignore_stale_updates
|
3 years ago |
justheuristic
|
927e31f27d
ignore_stale_updates
|
3 years ago |
justheuristic
|
ef5bb9f0d5
ignore stale updates
|
3 years ago |
justheuristic
|
251cd3450e
fix a bug that incorrectly accounted for step tolerance in CollaborativeOptimizer
|
3 years ago |
justheuristic
|
9f5fc866c8
fix a bug that incorrectly accounted for step tolerance in CollaborativeOptimizer
|
3 years ago |
Alexander Borzunov
|
b84f62bc08
Make log handlers configurable, shorten entries (#378)
|
4 years ago |
justheuristic
|
2500e3dd58
Fix "Too many open files" and load state freezing (#371)
|
4 years ago |
Michael Diskin
|
bedfa6eefb
Reorder imports with isort (#326)
|
4 years ago |
Alexander Borzunov
|
0774937a93
Refactor naming and serialization for PeerIDs (#339)
|
4 years ago |
Alexander Borzunov
|
3f691fced4
Convert averager to libp2p backend (#323)
|
4 years ago |
justheuristic
|
11db5fd56f
Refactor for v0.9.10 and fix example (#319)
|
4 years ago |
Max Ryabinin
|
2f07a556e6
Reformat code with Black (#274)
|
4 years ago |
Michael Diskin
|
cc8d39c2ea
Update readthedocs with hivemind.optim (#288)
|
4 years ago |
Max Ryabinin
|
5233b6c085
Split hivemind.client into hivemind.averaging and hivemind.moe (#304)
|
4 years ago |
Michael Diskin
|
86f3c0dd0d
Support auxiliary peers in CollaborativeOptimizer (#279)
|
4 years ago |
foksly
|
e58f65db33
Support auxiliary participants in AllReduceProtocol (#260)
|
4 years ago |
Michael Diskin
|
afc59d2a6b
Log more stats for user, move performance stats to examples (#257)
|
4 years ago |
Alexey Bukhtiyarov
|
01103cf991
Add state checkpointing and uploading in coordinator (#241)
|
4 years ago |
Aleksandr Borzunov
|
3bde6188fe
Protect training progress and metrics with signatures and DHT schema validation (#250)
|
4 years ago |
justheuristic
|
ddb5389e66
Fix server hanging in certain cases when connection is lost (#247)
|
4 years ago |
Alexey Bukhtiyarov
|
27ea94e3f9
Add example for collaborative ALBERT training (#226)
|
4 years ago |
Roman Zhytar
|
8c3bd93e87
Statistics averaging (#229)
|
4 years ago |