AIForce/hivemind

Autor	SHA1 Mensaje	Fecha
Michael Diskin	afc59d2a6b Log more stats for user, move performance stats to examples (#257)	hace 4 años
Max Ryabinin	2328ba9262 Fix device in Switch-MoE, overhaul Server architecture (#256)	hace 4 años
justheuristic	94b9db0d37 Fix random freezes in averager.step, improve error handling (#254)	hace 4 años
Alexey Bukhtiyarov	01103cf991 Add state checkpointing and uploading in coordinator (#241)	hace 4 años
Aleksandr Borzunov	3bde6188fe Protect training progress and metrics with signatures and DHT schema validation (#250)	hace 4 años
Aleksandr Borzunov	08ee017f0f Add nltk to ALBERT example's requirements (#251)	hace 4 años
Roman Zhytar	e833a7efb9 Decentralized adaptive optimizers (#243)	hace 4 años
Aleksandr Borzunov	18add2c04b Implement combining validators (#249)	hace 4 años
Max Ryabinin	0a1fdb172f Fix incorrect data types/values in RemoteSwitchMixtureOfExperts (#246)	hace 4 años
Max Ryabinin	dfbc401196 Add Dockerfile, refactor tests (#245)	hace 4 años
justheuristic	ddb5389e66 Fix server hanging in certain cases when connection is lost (#247)	hace 4 años
Aleksandr Borzunov	a3feafa907 Add DHT schema validator (#227)	hace 4 años
Michael Diskin	2314e7ebd5 fix metrics (#240)	hace 4 años
Alexey Bukhtiyarov	27ea94e3f9 Add example for collaborative ALBERT training (#226)	hace 4 años
Max Ryabinin	62652e1717 Add Switch Transformers-like RemoteMixtureOfExperts (#228)	hace 4 años
justheuristic	3d6a242e30 Ensure version-consistent result rounding in load_balance_peers (#230)	hace 4 años
Roman Zhytar	8c3bd93e87 Statistics averaging (#229)	hace 4 años
Vsevolod-pl	91d17a4ebc Delta gradients transmission (#225)	hace 4 años
romakail	ca5c7610ae Add tool for custom user experts (#189)	hace 4 años
justheuristic	32b87bf3fe Reset gradient buffers when synchronizing with peers (#222)	hace 4 años
justheuristic	b906ae94ed better zero_grad behavior in CollaborativeOptimizer (#221)	hace 4 años
justheuristic	2359906253 Add gradient buffers to CollaborativeOptimizer (#220)	hace 4 años
mponty	0080028e25 Add uniform compression (#202)	hace 4 años
ploshkin	9d2a40714c Prevent DecentralizedSGD from accidentally skipping a fraction of training batches (#218)	hace 4 años
Max Ryabinin	916c3db52d Move compression-related code to hivemind.utils.compression (#213)	hace 4 años
Alexey Bukhtiyarov	7bb6565674 Add CollaborativeOptimizer, TrainingAverager (#215)	hace 4 años
justheuristic	053c7c7d13 Disentangle DecentralizedAverager components, add weights (#217)	hace 4 años
Max Ryabinin	ca6d87a837 Replace FeedforwardBlock with a correct implementation (#211)	hace 4 años
Max Ryabinin	1d364b7c32 Convert SerializerBase to an abstract class (#212)	hace 4 años
Max Ryabinin	6128cbbd51 Add gradient clipping support to ExpertBackend (#214)	hace 4 años

Posterior Anterior

Historial de Commits Buscar

Historial de Commits