AIForce/hivemind

作者	SHA1 备注	提交日期
xtinkt	eedfae4bd4 Merge branch 'decentralized_lr_scheduler' of https://github.com/learning-at-home/hivemind into decentralized_lr_scheduler	4 年之前
xtinkt	be77054fad sync peer in constructor id needed	4 年之前
Anton Sinitsin	edfc3c005b Update hivemind/client/averaging/training.py	4 年之前
Anton Sinitsin	26ac1c7299 Update hivemind/optim/averaged.py	4 年之前
Anton Sinitsin	fe41e0d95d Update hivemind/optim/averaged.py	4 年之前
xtinkt	017fbe43a2 change DecentralizedState to TrainingState	4 年之前
xtinkt	c27925de26 fix some comments	4 年之前
xtinkt	42423cfe6b fix some pr issues	4 年之前
xtinkt	517018b136 added scheduler state loading and saveing to averager	4 年之前
xtinkt	937e622b8b fix some issues in pr	4 年之前
xtinkt	15f1e9b0b5 fix some issues in pr	4 年之前
xtinkt	14404e4168 fix some issues in pr	4 年之前
xtinkt	97cec29e13 Added info about new params. Changed some names.	4 年之前
xtinkt	de01a9cb15 add decentralized learning rate scheduler and epochs abstraction	4 年之前
Aleksandr Borzunov	08ee017f0f Add nltk to ALBERT example's requirements (#251)	4 年之前
Roman Zhytar	e833a7efb9 Decentralized adaptive optimizers (#243)	4 年之前
Aleksandr Borzunov	18add2c04b Implement combining validators (#249)	4 年之前
Max Ryabinin	0a1fdb172f Fix incorrect data types/values in RemoteSwitchMixtureOfExperts (#246)	4 年之前
Max Ryabinin	dfbc401196 Add Dockerfile, refactor tests (#245)	4 年之前
justheuristic	ddb5389e66 Fix server hanging in certain cases when connection is lost (#247)	4 年之前
Aleksandr Borzunov	a3feafa907 Add DHT schema validator (#227)	4 年之前
Michael Diskin	2314e7ebd5 fix metrics (#240)	4 年之前
Alexey Bukhtiyarov	27ea94e3f9 Add example for collaborative ALBERT training (#226)	4 年之前
Max Ryabinin	62652e1717 Add Switch Transformers-like RemoteMixtureOfExperts (#228)	4 年之前
justheuristic	3d6a242e30 Ensure version-consistent result rounding in load_balance_peers (#230)	4 年之前
Roman Zhytar	8c3bd93e87 Statistics averaging (#229)	4 年之前
Vsevolod-pl	91d17a4ebc Delta gradients transmission (#225)	4 年之前
romakail	ca5c7610ae Add tool for custom user experts (#189)	4 年之前
justheuristic	32b87bf3fe Reset gradient buffers when synchronizing with peers (#222)	4 年之前
justheuristic	b906ae94ed better zero_grad behavior in CollaborativeOptimizer (#221)	4 年之前

更新的提交更旧的提交

提交历史 查找

提交历史